The New Griffeye AI Technology – Trained at Taskforce Argos

These days computers can use high-performance AI technology to optimize energy grids for entire cities, provide accurate sports analysis, create movie trailers, detect the spread of cancer and so on – the areas where it can be used are almost endless. At Griffeye, our AI technology detects child sexual abuse content in massive image and video datasets. Here’s how we trained it.Training AI technology on the right data, that is relevant and big enough, is absolutely necessary for it to work and produce high quality results. This was one of the most important aspects for us, with the goal of detecting with high precision the distinctive features that characterize child sexual abuse (CSA) images and videos.

Our AI technology has been trained on categorized CSA material at Taskforce Argos of the Queensland Police in Australia, one of the world’s – if not the world’s – most renowned victim identification units.

Taskforce Argos’ database is quality assured, and each image has been reviewed and categorized by police in Australia according to the criteria that collectively describe CSA material. But we have also let our AI technology be trained on adult pornography or otherwise irrelevant content, so the technology learns the difference between relevant and irrelevant material.

The training process

The Griffeye AI technology was pre-trained to know what an image is and what’s important in an image such as faces and people. At that stage, our AI technology could understand images but couldn’t recognize sexual abuse of children.


Get The Latest DFIR News

Join the Forensic Focus newsletter for the best DFIR articles in your inbox every month.


Unsubscribe any time. We respect your privacy - read our privacy policy.

1. Training

The training set at Taskforce Argos consisted of around 300,000 unique images. The algorithm was exposed directly to the illegal images to make it understand what illegal images look like and find similarities and relationships between them. In addition, legal images were added during the training session, some of which were visually similar to the illegal images so that the algorithm learnt the specific details that determine whether an image depicts child sexual abuse or not.

2. Validation

In addition to the training set, a validation set of images is used to find out how the training is going. This set is smaller than the training set and is used to validate whether the technology has been properly trained. Does the algorithm understand the relationships? And how specific or general is the classification? As long as the classification performance on the validation set improves, training continues.

3. Test of accuracy


The last step of the training process was a final test to measure the accuracy with which the AI technology correctly identified and classified the majority of images that depict CSA. This final test was made on yet a new set of material.

The challenges of AI technology training

One of the biggest challenges with machine learning in general is “overfitting”, which means that the algorithm simply becomes too good at classifying the images it is trained on. The classification becomes too specific, instead of helping find similarities and relationships between the objects in the pictures. When that happens, it’s no longer possible to generalize the learned connections for new images. That’s largely why the validation set is so important. However, there is a risk that “overfitting” will also affect the validation set after a while. Which in turn explains the need for the final accuracy test to give a true answer about how good the technology really is.

Another difficulty – and perhaps the greatest challenge of all when training AI technology on CSA material – is the availability of data to train on. In other cases where you train AI technology, you can find and even create the pictures you need. But with child sexual abuse images it becomes difficult. We can neither create nor collect illegal material to train the technology on. So we had to find different ways to work around the problem.

Feedback from our users

Although there are relatively few users who currently have access to the AI technology (since it’s currently a beta version), several users and organizations have provided detailed feedback. The general message is unquestionably positive. Some have already managed to work the Griffeye AI into their daily work, and with good results. But as a beta version, there’s still a lot to work on.

We’ve received valuable feedback from users suggesting patterns in the mistakes the technology makes. For example, it’s difficult to classify small images in a collage, because the algorithm can’t find the details in the pictures. At present, we scale down the image because larger images require more computing power, but that makes finding details in collages difficult.

Another example where the classifier sometimes makes mistakes is certain types of images of children, such as holiday pictures on the beach. For the algorithm, these images are often too similar to illegal pictures. However, by training the algorithm on these types of images, the AI technology can learn to understand the difference, and quickly classify what is illegal.

Future developments

Training AI technology is an ongoing process. The more categorized material we get access to the better it becomes. Therefore, the next step will be to train our technology on new data from other organizations and authorities. In addition to training the technology on more illegal material, we will also train it on larger data sets of legal images.

The ambition is also to train the AI technology to distinguish with great precision how CSA is defined and categorized in different countries. Imagine that you can have a baseline AI technology that can be used straight away in most countries, but also specific AI algorithms for how CSA material is legally defined in different countries.

Research in AI and machine learning is advancing rapidly, but the basis for how the models are trained is the same. The amount and quality of data used to train the algorithm determine how good the results are. The massive volume of child sexual abuse images is deeply concerning, of course, but at least we can turn that to our advantage using AI technology. New and improved GPUs are released yearly, with more focus on machine learning tasks, which makes it possible to keep up with the increased volumes of data. AI technology can help to lower the physical and mental burden for investigators, help in identifying victims and perpetrators more quickly and accurately, and ultimately stop and prevent child sexual abuse.

Leave a Comment

Latest Videos

Si and Desi interview Emi Polito from Amped about how to become an Amped FIVE Certified Examiner (AFCE). They discuss the exam requirements, format, timeline for certification, and Amped’s future plans. Emi explains that the certification is aimed at demonstrating competency with the Amped FIVE video analysis software after completing training. The exam consists of multiple choice questions on theory and practical exercises using the software. Emi talks about the online exam format and process for passing or failing.

Emi also discusses the broader challenges many organizations face with validation and accreditation. He emphasizes Amped's commitment to developing tools that facilitate that process. The hosts reflect on the confusing accreditation landscape and Amped’s passion for improving training and certification in forensics. This episode provides an overview of Amped's new certification and perspective on challenges in the field of video forensics.

Show Notes:

Introducing The AFCE Certification (Amped FIVE Certified Examiner) - https://www.forensicfocus.com/news/introducing-the-afce-certification-amped-five-certified-examiner/

Video Evidence Principles With Amped Software - https://www.forensicfocus.com/podcast/video-evidence-principles-with-amped-software/

Digital Image Authenticity And Integrity With Amped Authenticate - https://www.forensicfocus.com/podcast/digital-image-authenticity-and-integrity-with-amped-authenticate/

File Analysis And DVR Conversion Training From Amped Software - https://www.forensicfocus.com/reviews/file-analysis-and-dvr-conversion-training-from-amped-software/

Amped FIVE Speed Estimation 2d Filter And Training From Amped Software - https://www.forensicfocus.com/reviews/amped-five-speed-estimation-2d-filter-and-training-from-amped-software/

Amped Software’s Martino Jerian on Key Challenges and Opportunities for Video Evidence - https://www.forensicfocus.com/podcast/amped-softwares-martino-jerian-on-key-challenges-and-opportunities-for-video-evidence/

LEVA 2023 Training Symposium - https://www.leva.org/

Forensic Collision Investigation & Reconstruction Ltd - https://www.fcir.co.uk/

Amped FIVE Certified Examiner - https://ampedsoftware.com/afce-certification 

Introducing the Amped FIVE Certification Program - https://blog.ampedsoftware.com/2023/10/04/introducing-the-amped-five-certification-program

Amped Software YouTube - https://www.youtube.com/ampedsoftware
How to Use the Validation Tool in Amped FIVE - https://blog.ampedsoftware.com/2023/03/29/how-to-use-the-validation-tool-in-amped-five

Si and Desi interview Emi Polito from Amped about how to become an Amped FIVE Certified Examiner (AFCE). They discuss the exam requirements, format, timeline for certification, and Amped’s future plans. Emi explains that the certification is aimed at demonstrating competency with the Amped FIVE video analysis software after completing training. The exam consists of multiple choice questions on theory and practical exercises using the software. Emi talks about the online exam format and process for passing or failing.

Emi also discusses the broader challenges many organizations face with validation and accreditation. He emphasizes Amped's commitment to developing tools that facilitate that process. The hosts reflect on the confusing accreditation landscape and Amped’s passion for improving training and certification in forensics. This episode provides an overview of Amped's new certification and perspective on challenges in the field of video forensics.

Show Notes:

Introducing The AFCE Certification (Amped FIVE Certified Examiner) - https://www.forensicfocus.com/news/introducing-the-afce-certification-amped-five-certified-examiner/

Video Evidence Principles With Amped Software - https://www.forensicfocus.com/podcast/video-evidence-principles-with-amped-software/

Digital Image Authenticity And Integrity With Amped Authenticate - https://www.forensicfocus.com/podcast/digital-image-authenticity-and-integrity-with-amped-authenticate/

File Analysis And DVR Conversion Training From Amped Software - https://www.forensicfocus.com/reviews/file-analysis-and-dvr-conversion-training-from-amped-software/

Amped FIVE Speed Estimation 2d Filter And Training From Amped Software - https://www.forensicfocus.com/reviews/amped-five-speed-estimation-2d-filter-and-training-from-amped-software/

Amped Software’s Martino Jerian on Key Challenges and Opportunities for Video Evidence - https://www.forensicfocus.com/podcast/amped-softwares-martino-jerian-on-key-challenges-and-opportunities-for-video-evidence/

LEVA 2023 Training Symposium - https://www.leva.org/

Forensic Collision Investigation & Reconstruction Ltd - https://www.fcir.co.uk/

Amped FIVE Certified Examiner - https://ampedsoftware.com/afce-certification

Introducing the Amped FIVE Certification Program - https://blog.ampedsoftware.com/2023/10/04/introducing-the-amped-five-certification-program

Amped Software YouTube - https://www.youtube.com/ampedsoftware
How to Use the Validation Tool in Amped FIVE - https://blog.ampedsoftware.com/2023/03/29/how-to-use-the-validation-tool-in-amped-five

YouTube Video UCQajlJPesqmyWJDN52AZI4Q_VKk-mhlae1c

Becoming An Amped FIVE Certified Examiner (AFCE)

Forensic Focus 1st December 2023 4:25 pm

Subscribe to the Forensic Focus Podcast: https://www.forensicfocus.com/podcast/

Si and Desi are joined by Brittany and Ailsa from digital forensics software company ADF Solutions. They discuss how ADF is addressing key challenges for digital forensics practitioners, including handling the massive volumes of data from mobile devices and the cloud.

The guests outline ADF's focus on developing their software as an easy-to-use onsite triage tool that can help quickly identify pertinent evidence. Key features include advanced handling of video files, AI-assisted classification of images, and new screen recording capabilities for mobile devices that allow suspects to safely share relevant data. 

The hosts and guests also explore ADF's ongoing research into areas like facial recognition, handling new device types like games consoles and smart watches, and identifying deepfake media.

00:00 – Introduction to Ailsa and Brittany
03:00 – The challenge of vast amounts of data
05:50 – Recovering data from Chromebooks
08:50 – Triaging using ADF tools
12:30 – Benefits of using ADF Solutions’ tools
15:50 – Limitations in types of apps
17:20 – Keeping up with technological advancements
19:15 – ADF customer base
21:00 - Artificial intelligence in classifying images
30:00 – ADF Solutions’ triaging kit
37:00 – Training with ADF
40:00 – Target user
44:50 – Roadmap of future devices to examine
51:30 – Main focus for ADF Solutions going forwards

Show Notes:
AI-generated CSAM article on Sky News - https://news.sky.com/story/thousands-of-ai-generated-child-abuse-images-being-shared-online-research-finds-12991727

Subscribe to the Forensic Focus Podcast: https://www.forensicfocus.com/podcast/

Si and Desi are joined by Brittany and Ailsa from digital forensics software company ADF Solutions. They discuss how ADF is addressing key challenges for digital forensics practitioners, including handling the massive volumes of data from mobile devices and the cloud.

The guests outline ADF's focus on developing their software as an easy-to-use onsite triage tool that can help quickly identify pertinent evidence. Key features include advanced handling of video files, AI-assisted classification of images, and new screen recording capabilities for mobile devices that allow suspects to safely share relevant data.

The hosts and guests also explore ADF's ongoing research into areas like facial recognition, handling new device types like games consoles and smart watches, and identifying deepfake media.

00:00 – Introduction to Ailsa and Brittany
03:00 – The challenge of vast amounts of data
05:50 – Recovering data from Chromebooks
08:50 – Triaging using ADF tools
12:30 – Benefits of using ADF Solutions’ tools
15:50 – Limitations in types of apps
17:20 – Keeping up with technological advancements
19:15 – ADF customer base
21:00 - Artificial intelligence in classifying images
30:00 – ADF Solutions’ triaging kit
37:00 – Training with ADF
40:00 – Target user
44:50 – Roadmap of future devices to examine
51:30 – Main focus for ADF Solutions going forwards

Show Notes:
AI-generated CSAM article on Sky News - https://news.sky.com/story/thousands-of-ai-generated-child-abuse-images-being-shared-online-research-finds-12991727

YouTube Video UCQajlJPesqmyWJDN52AZI4Q_4z-EgH54KZk

The Power Of Digital Forensics: How ADF Solutions Is Revolutionizing The Digital Forensics Industry

Forensic Focus 30th November 2023 2:57 pm

This error message is only visible to WordPress admins

Important: No API Key Entered.

Many features are not available without adding an API Key. Please go to the YouTube Feed settings page to add an API key after following these instructions.

Latest Articles