Founded in 2010, Machinetutors provides machine learning consultancy and customized AI software development services. Machinetutors empowers businesses all over the world by solving real-world problems. Machinetutors has two products; mtDATA, a data collection and annotation services platform, and mtAPI, SaaS AI solutions with pre-trained models, customization options, and scalable infrastructure.

TRUBA HPC Center, operating under TUBITAK ULAKBIM, coordinates NCC Turkey. Middle East Technical University (METU), Sabancı University (SU), and Istanbul Technical University National Center for High-Performance Computing (UHeM) are the third parties of the NCC. Our objective is to level up the industry, public sector, and academia in High-Performance Computing (HPC), High-Performance Data Analytics (HPDA), Artificial Intelligence (AI), and prepare for the developing EuroHPC ecosystem. Our competencies include HPC, HPDA, AI, CUDA, Materials Science, and Computational Fluid Dynamics (CFD). The particular third party presenting this success story is TRUBA.

        

Technical/scientific Challenge

This project addresses the problem of large-scale real-time image-based content moderation. The system is deployed to a production environment where tens of thousands of users browse the internet daily. The system must be both accurate and run in real-time to meet the business requirements. Moreover, the model size must be small so that multiple copies of the model can be run simultaneously on a GPU to reduce server costs. A major challenge has been making several models work efficiently together.

Solution

In order to solve the problem defined, we develop three main models. In the first model, we propose a multi-label NSFW classifier that can detect the NSFW levels (light, medium, hard) and predict other labels, such as the real person and clothing characteristics. The second model is a one-stage body -based age & gender detection model. Current age & gender methods are both face based i.e. they use face bounding boxes and are two-stage processes, they first run a face detector and then run the model on these boxes. When multiple faces are present in an image, this approach fails to meet the real-time requirement. The third one is a segmentation model. These three models run in a pipeline via which we can run various scenarios.

Business impact

Our client, a SaaS online content moderation start-up, is currently the number one content filter in their specific target market around the world thanks to the success of AI-supported high-tech features developed during this project.

User feedback on all our models from our clients’ users are positive and they consider it to be the best product on the market. Thousands of users now browse the Internet with their adjusted moderation level. Our client has already reached a breakeven point financially.

HPC’s speed and cost benefits enabled the project to be successfully delivered on time. All of the engineers on Machinetutors team are now proficient in using the TRUBA infrastructure due to this collaboration. We were able to work effectively and efficiently with our colleagues from TRUBA and look forward to the next project.

Benefits

Machinetutors:

  • With this collaboration, we were able to run many experiments in parallel and quickly see the effects of the model updates.
  • With the ability to run large batch size trainings on newer GPUs, our experiments completed much faster.
  • Being able to access many GPUs at the same time enabled us to tune the hyper-parameters of each model to improve the results.
  • The speed and cost-efficiency provided by this support have helped us gain a considerable competitive advantage in the global AI ecosystem.

Keywords

  • Artificial Intelligence, Machine Learning, Deep Learning, Content Moderation, Classification, Segmentation, Object Detection, Data Collection, and Annotation.
  • Industry sector: Computer Science, Artificial Intelligence, Software
  • Technology: HPC, AI