News/Machine Vision
Roboflow helps 1 million developers make computer vision accessible across industries
Can you feel the flow? Computer vision startup Roboflow is making AI more accessible by democratizing visual technology for organizations across diverse industries. The company's platform has become a critical tool for developers and businesses looking to implement computer vision solutions without extensive technical expertise, highlighting a broader trend of specialized AI tools becoming increasingly user-friendly and practical for solving real-world problems. The big picture: Roboflow is on a mission to make the world programmable through computer vision, building tools that help businesses extract meaningful insights from visual data. The startup has gained impressive traction, with over one million developers...
read Mar 5, 2025Augmented retail: Google morphs online shopping with AI that visualizes clothing before you buy
Google is rapidly transforming search into a personalized shopping experience through AI-powered visual tools. The company's new features allow users to visualize clothing and makeup products before purchasing, addressing a common consumer pain point where shoppers struggle to find items matching their mental vision. These developments represent a significant evolution in Google's AI integration strategy, making online shopping more intuitive while keeping users within Google's ecosystem rather than navigating to specialized retailer sites. The big picture: Google has launched "vision match," an AI tool that generates images based on detailed clothing descriptions and then suggests similar purchasable items. Mobile users...
read Feb 22, 2025Vision Pro finally gets Apple Intelligence, plus new iOS app
The Apple Vision Pro headset first launched in early 2024 as Apple's entry into spatial computing, offering users a mixed reality experience for productivity and entertainment. The upcoming visionOS 2.4 update marks Apple's first major AI integration for the device. Key Features Overview: The update introduces Apple Intelligence, a suite of AI-powered tools that enhance the Vision Pro's native capabilities through natural language processing and computer vision. Vision Pro users will gain access to AI writing assistance across native apps, allowing voice-prompted edits and proofreading A new feature called Genmojis enables the creation of personalized emoji reactions Image Playground integration...
read Feb 21, 2025AI running startup Ochy raises $1.7M, integrates with Adidas adiClub
Running biomechanics startup Ochy has developed an AI-powered platform that analyzes runners' gaits using smartphone cameras, making professional-level biomechanical analysis accessible to everyday athletes. The French company has recently secured new funding and formed a significant partnership with a major athletic brand. Key Investment Details: Ochy has raised USD 1.7 million in pre-seed funding to expand its AI-driven biomechanics platform and enhance its global presence. The funding round was led by Redstone's Social Impact Fund, with participation from Look AI Ventures, BPI France, Berkeley SkyDeck, and strategic investor Agile Physical Therapy The capital will support advancements in computer vision AI...
read Feb 20, 2025Apple brings AI-powered Visual Intelligence to iPhone 15 Pro
Apple is expanding its Visual Intelligence feature beyond its latest flagship devices. This AI-powered tool helps users identify and learn about objects, landmarks, and other items through their iPhone's camera by leveraging both Google search and ChatGPT. Key Development: Apple plans to extend Visual Intelligence functionality to iPhone 15 Pro models through an upcoming iOS 18.4 update, expected to launch in early April 2025. The feature will be accessible through the Action button on iPhone 15 Pro models, offering a different activation method from the Camera Control button used on iPhone 16 devices A new Control Center button will also...
read Feb 20, 2025AI cameras to detect phone and seatbelt offenses in UK Police trial
The growing prevalence of distracted driving and seatbelt violations has prompted UK police forces to explore advanced surveillance technologies. Essex Police is set to become the latest force to trial AI-powered cameras that can detect drivers using mobile phones and not wearing seatbelts. What you need to know: Essex Police will deploy high-definition AI cameras starting April 2025 to identify and document traffic violations related to mobile phone use and seatbelt compliance. The cameras, developed by tech company Acusensus, are currently in use across 19 police regions in England The system will be deployed using relocatable trailers positioned along roadsides...
read Feb 19, 2025Google Lens for iPhone just got a cool new AI upgrade
Artificial intelligence and augmented reality have dramatically enhanced visual search capabilities, with Google Lens serving as a primary tool for identifying objects and text through smartphone cameras. Google has announced significant updates to Google Lens that streamline the search process for iPhone users while expanding AI-powered search results. Key developments: Google is launching two major updates to Google Lens that enhance visual search capabilities on iOS devices through both Chrome and the Google app. Users can now instantly search screen content without taking screenshots or opening new tabs The feature enables quick information lookup by highlighting, drawing over, or tapping...
read Feb 19, 2025Georgia Tech PhD student trains humanoid robots with AR glasses
Call it magnificent mimicry. The rapid advancement in humanoid robotics has been limited by slow, manual data collection methods requiring direct robot operation. Georgia Tech researchers have developed a breakthrough approach using Meta's Project Aria glasses to capture human behaviors that can train robots more efficiently. Key innovation: EgoMimic, developed by PhD student Simar Kareer at Georgia Tech's Robotic Learning and Reasoning Lab, uses egocentric recordings from Aria glasses to create training data for humanoid robots. The framework combines human-recorded data with robot data to teach robots everyday tasks Traditional robot training requires hundreds of manual demonstrations through direct robot...
read Feb 18, 2025AI detects potholes in UK roads before they happen
Pothole detection is moving beyond eyes-on-the-street and frustrated outreach to a local government representative... The development of pothole detection technology continues to advance as local governments seek proactive solutions to road maintenance challenges. Hertfordshire County Council in the UK is testing an AI-powered road scanning system that could transform how municipalities identify and address road surface deterioration before it becomes problematic. Project Overview: Hertfordshire County Council is conducting trials of the ARRES Eye scanner, developed by Robotiz3d, to detect potential road surface issues before they develop into potholes. The AI-powered scanner can be mounted on council vehicles to collect data...
read Feb 13, 2025‘Knife Hunter’ AI tool hopes to cut down on UK knife crimes
Global policing efforts to combat rising knife crime have gained a powerful ally with the development of Knife Hunter, an AI-based tool created by Surrey University's Institute for People-Centred AI in partnership with the Metropolitan Police. Knife crime in England and Wales saw a 4% increase from 2023 to 2024, with over 50,000 offenses recorded during this period. System capabilities and design: Knife Hunter leverages artificial intelligence to identify and catalog knives while tracking their origins and patterns of use in criminal activities. The AI system has been trained on more than 25,000 images spanning 550 different knife types The...
read Feb 11, 2025AI-powered Roborock Saros 10 and 10R take robo-vacuums to new low with ultra-thin products for hard-to-reach places
Roborock has unveiled two new AI-powered robot vacuum cleaners, the Saros 10 and Saros 10R, designed to tackle complex cleaning challenges in larger homes with mixed flooring surfaces. These models represent significant advances in robotic cleaning technology, incorporating machine learning algorithms for both cleaning performance and navigation capabilities. Core Design Features: The Saros models introduce groundbreaking physical specifications and adaptability features that set them apart in the robot cleaner market. At just 3.15 inches high, these are the thinnest Roborock cleaners ever produced, enabling access under low furniture Four independent lifting modes allow the robots to adjust their chassis, mop,...
read Feb 8, 2025Oxford researchers use AI to decode ancient Greek scroll
Latest discoveries; A team of researchers has identified several ancient Greek words including terms for "foolish," "disgust," "fear," and "life" in a newly analyzed Herculaneum scroll housed at Oxford's Bodleian Libraries. The scroll, designated as PHerc. 172, shows evidence of being authored by Epicurean philosopher Philodemus based on its first-century BCE letter forms Initial analysis suggests similarities between this scroll's handwriting and other works attributed to Philodemus The text appears to be continuous throughout the entire scroll, promising substantial readable content once fully decoded Historical context; The scroll comes from the Villa of the Papyri in Herculaneum, a Roman town...
read Feb 7, 2025AI app detects skin cancer with 99.8% accuracy
Mobile AI Tool Shows Promise in Early Skin Cancer Detection Key Innovation: British tech company Skin Analytics has developed DERM, a smartphone-based artificial intelligence system designed to detect skin cancer with remarkable accuracy. • The AI-powered system has secured regulatory approval for clinical use • DERM boasts a 99.8% accuracy rate specifically in ruling out skin cancer • Healthcare technicians can capture photos of suspicious skin lesions and receive near-instant diagnostic results Clinical Significance: Early detection of skin cancer significantly improves patient outcomes and survival rates. • The technology enables rapid preliminary screening of suspicious skin lesions • Healthcare providers...
read Feb 5, 2025AI decodes 2000-year-old Roman scroll damaged by volcano
Mount Vesuvius's eruption preserved a charred Roman scroll that has finally been deciphered after 2000 years through innovative use of AI and advanced X-ray technology. Historical context: The papyrus scroll is one of 1800 artifacts recovered from a villa in Herculaneum (modern-day Ercolano) during the 1750s, all carbonized by the intense heat of volcanic debris. Initially mistaken for firewood by locals, these scrolls were later recognized as valuable historical texts Around 200 scrolls have been carefully opened using mechanical clock-based devices Three of these scrolls were acquired by Oxford's Bodleian Library through an unusual trade involving King George III and...
read Jan 24, 2025Hugging Face shrinks its AI vision models to operate on smartphones
Hugging Face's new SmolVLM vision-language AI models achieve superior performance while running on smartphones and small devices, marking a significant advancement in AI efficiency and accessibility. Key innovation details: SmolVLM represents a dramatic reduction in model size while improving capabilities compared to its predecessors. The SmolVLM-256M model operates on less than 1GB of GPU memory yet outperforms Hugging Face's previous 80 billion parameter Idefics model The technology comes in two sizes: 256M and 500M parameters, representing a 300x reduction from earlier models The smallest version can process 16 examples per second using only 15GB of RAM with a batch size...
read Jan 23, 2025OpenAI’s computer-controlling AI agent has arrived — here’s what it can do
OpenAI has introduced "Operator," a new AI agent that can autonomously perform web-based tasks for ChatGPT Pro subscribers in the United States. Core technology and capabilities: Operator employs a "Computer-Using Agent" (CUA) model that combines GPT-4o's visual processing abilities with reinforcement learning to interact with web interfaces like a human user. The agent can interpret screenshots and perform basic computer actions like typing, clicking, and scrolling Operator navigates web interfaces independently to complete tasks such as ordering groceries and making reservations Unlike traditional AI models, Operator doesn't rely on predefined APIs, allowing for more flexible interaction with websites Strategic partnerships:...
read Jan 23, 2025Hugging Face just made its small AI models even smaller (and multimodal)
Hugging Face has released two new additions to the SmolVLM model family. The new compact Vision Language Models - a 256M parameter version and a 500M parameter version - are designed to deliver efficient multimodal AI capabilities while maintaining a small computational footprint. Core innovations; The new SmolVLM models represent significant architectural improvements over their 2B parameter predecessor, introducing key optimizations for real-world applications. The models now utilize a streamlined 93M parameter SigLIP vision encoder, drastically reduced from the previous 400M version Higher resolution image processing capabilities enable enhanced visual comprehension New tokenization optimizations boost performance in practical applications The...
read Jan 23, 2025Take a picture of your hardest homework problem and this AI mobile app might help you solve it
SpeedTutorAI, a new AI-powered learning assistant app for iOS devices, offers comprehensive educational support through photo-based problem solving and lecture transcription capabilities. Product Overview: SpeedTutorAI Premium combines advanced AI technology with practical learning tools to help students master various academic subjects. The app enables users to photograph homework problems for step-by-step solutions across multiple subjects including algebra, calculus, chemistry, and grammar Built-in lecture recording features provide AI-powered transcription and automatic summarization capabilities An interactive AI tutor component offers real-time assistance and explanations for complex topics Technical Requirements and Accessibility: The platform is exclusively designed for Apple's mobile ecosystem with specific...
read Jan 21, 2025How AI, drones and sensor tech will help fight forest fires in the future
AI and technology companies are developing new tools to detect and combat wildfires in California, spurred by recent devastating fires in Los Angeles. Current landscape of firefighting innovation: Multiple startups and established technology companies are creating AI-powered solutions to address different aspects of wildfire management. Rain, a California startup, is developing software to enable autonomous helicopters and aircraft to detect and suppress fires more efficiently ALERTCalifornia has deployed an AI system that monitors video feeds from over 1,144 cameras across the state, helping fire departments identify fires in their early stages BurnBot is creating large vehicles equipped with propane torches...
read Jan 18, 2025New AI cameras in the UK are catching speeders in droves
AI-powered traffic cameras deployed by Humberside Police in the UK caught 849 traffic violations during a two-week trial in 2023, primarily focusing on mobile phone use and seat belt violations. Trial results and implementation: The innovative camera system was tested over two separate weeks in March and June 2023, demonstrating significant effectiveness in detecting traffic violations. The cameras identified 533 cases of drivers not wearing seat belts and 2 instances of children under 14 without proper restraints 301 violations involved mobile phone use while driving 13 cases were recorded of drivers not maintaining proper vehicle control Technology capabilities and process:...
read Jan 18, 2025AI security camera startup Coram AI secures $13.8M Series A
Coram AI has secured $13.8 million in Series A funding to advance its AI-powered video security platform that integrates with existing IP cameras for real-time analysis and insights. Funding details and leadership: Battery Ventures led the Series A round, with participation from existing investors 8VC and Mosaic Ventures, marking a significant milestone for the AI security startup. The funding round closed on January 16, 2025, bringing total investment to $13.8 million Battery Ventures Partner Marcus Ryu, former co-founder and CEO of Guidewire Software, joins Coram's board The company was founded by former Lyft autonomous driving executives Ashesh Jain and Peter...
read Jan 15, 2025AI traffic cameras in UK city destroyed within hours of installation
An AI traffic camera installed in Southampton was destroyed within hours of its installation on the A3024 Northam Bridge. Installation specifics: The advanced monitoring system was mounted on Tuesday on a central island of the Northam Bridge that previously housed a conventional speed camera. The new system was designed to monitor traffic in both directions across all lanes The installation site already had infrastructure in place from the previous speed camera system Camera capabilities: The AI-powered traffic monitoring system represented a significant upgrade from traditional speed cameras, offering enhanced detection capabilities. The system could identify drivers using mobile phones or...
read Jan 15, 2025Google’s Gemini AI can now process video and image simultaneously — here’s why it’s important
Google's Gemini AI has achieved simultaneous processing of multiple visual streams through an experimental application called AnyChat, marking a significant advancement in AI visual processing capabilities. The breakthrough explained: Through AnyChat, Gemini can now process live video feeds and analyze static images at the same time, a capability previously thought impossible in AI systems. AnyChat creator Ahsen Khaliq revealed that this functionality exists within Gemini's API but hasn't been implemented in Google's official applications The system can maintain conversational coherence while handling multiple visual inputs simultaneously This capability stands in contrast to other AI platforms like ChatGPT, which must disable...
read Jan 14, 2025Wyze’s new AI-powered cameras turn security footage into text notifications
Wyze has launched an AI-powered feature that generates detailed text descriptions of security camera footage for its users. What's new: Wyze's Descriptive Alerts feature provides detailed text summaries of motion events captured by its security cameras, going beyond basic motion detection notifications. The system can describe specific details like colors and actions, such as noting the color of a delivery person's hat or describing vehicles in view Alerts provide contextual information about events occurring in the camera's field of view Users receive these descriptions as notifications, reducing the need to manually review security footage Service details and availability: The new...
read