New Algorithm to Identify Illegal Websites Listed in Photos, Videos

Advertisement
By Press Trust of India | Updated: 8 September 2015 18:36 IST
A new computer algorithm that can "read" web addresses in images or videos could make blocking pornographic, gambling and other illegal sites easier for parents and law enforcement agencies, scientists say.

Internet marketers of all shades might add a website address, a URL, to a graphic or photo that might then be found through an image search engine.

The user finding such an image may be interested in visiting said site, but will have to type out the URL into their browser's address bar to do so. Conversely, the URL might point to illicit content - pornography, gambling sites, illegal drugs, terrorist propaganda, researchers said. In that content, those in authority, whether parents and guardians of children or law enforcement, may wish to automatically blacklist such URLs.

Now, Nikolay Neshov of the Technical University of Sofia, Bulgaria and colleagues at the University of Karlstad, Sweden, and the University of Belgrade, Serbia, have developed a computer algorithm that can detect the presence of text overlaid on to an image or a still from a video, extract the text and convert it into an active URL for accessing or blocking a website.

Advertisement

Simple optical character recognition (OCR) does not work well with text overlaid on images as the background is usually complex, the text is likely to be of lower resolution and lower intensity and contrast than that seen in a scanned document or page, for instance. The new approach uses an identification extraction technique that finds anomalies in an image that would be present if text is overlaid.

Advertisement

It then removes the details surrounding those anomalies leaving just the area occupied by any text - the team calls this the binarisation process. This isolated text image can then be fed into an OCR system to convert the image of the text into actual text in the computer.

The team has successfully tested their algorithm on thousands of images with overlaid URLs. They were able to identify 619 URLs from a random selection of 1,000 test images at a rate of three per second using their approach. Conventional OCR was faster but only found 83 URLs in the same 1,000 images, an improvement from about 8 per cent to more than 60 per cent.

Advertisement

The researchers' initial motivation was to assist computer forensic investigations in which tens of thousands of illegal and illicit photos must be scanned and any associated websites identified quickly in an investigation. This is critical in investigations of child pornography and child sexual abuse, the team said, but such work is often stymied by the vast numbers of images involved.

The research was published in the International Journal of Reasoning-based Intelligent Systems.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Blacklist, Internet, URL, Websites
Advertisement

Related Stories

Popular Mobile Brands
  1. Redmi Note 15 5G 108 Master Pixel Edition Will Launch in India on This Date
  2. Lava Play Max Launched in India With Vapour Chamber Cooling at This Price
  3. OpenAI's Code Red to Reportedly Continue Till Two More AI Models Are Released
  4. Nothing Phone 3a Community Edition Launched: Here's What Makes It Special
  5. iPhone 16 Becomes the Best-Selling Smartphone in Q3 2025
  6. OnePlus Watch Lite Confirmed to Launch on This Date
  7. Samsung's Galaxy Z TriFold Is Now Available to Pre-Order in China
  1. OpenAI to Reportedly Release GPT-5.2 AI Model This Week, But ‘Code Red’ Will Continue
  2. Nothing Phone 3a Community Edition Launched in India With Custom Hardware Design and Custom UI Elements: Price, Features
  3. Google Shares Safety Guardrails for Chrome Browser’s Agentic Capabilities
  4. Google Pixel 9 Pro, Pixel 9 Pro XL and Pixel 9 Pro Fold Extended Repair Program for Specific Hardware Issues Announced
  5. Qualcomm Acquires Augentix to Expand Smart Camera Portfolio and Insight Platform
  6. Moto G Stylus (2026) Design Spotted in Leaked Renders Alongside Moto Tag 2; Motorola Edge 70 Ultra Tipped to Offer Stylus Support
  7. The Android Show: Google Teases AI Smart Glasses Alongside Likeness, PC Connect For Samsung Galaxy XR Headset
  8. Truecaller Introduces New Feature to Protect the Entire Family from Call-Based Scams
  9. Samsung Galaxy Z TriFold Now Available to Pre-Order in China: Price, Specifications
  10. Starlink Executive Clarifies: India Pricing Was a 'Glitch', Still Awaiting Launch Approval
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.