Machine Vision

News/Machine Vision

Jan 14, 2025

Pebble Beach introduces AI cameras and shot tracing tech to its nine-hole golf course

Pebble Beach Resorts has introduced ReelGOLF®, an AI-powered video capture system, at The Hay's ninth hole, allowing golfers to create and share high-quality videos of their shots. What's new: The ReelGOLF Experience combines AI-enabled cameras, shot tracing technology, and social sharing capabilities to enhance the golfing experience at the Tiger Woods-designed short course. AI-powered Bosch cameras positioned near tee boxes and greens automatically record players' shots Players activate the system by scanning a QR code, receiving enhanced videos with shot tracing and branding within minutes Starlink satellite internet ensures reliable video delivery even in remote course locations Technology implementation: The system...

read Jan 6, 2025

Xthings launches new line of AI security cameras at CES 2025

CES 2025 attendees can experience a new lineup of AI-powered security cameras from Xthings (under the Ulticam brand), featuring edge AI processing and subscription-free intelligent monitoring capabilities. Product Overview; The new Ulticam series includes three distinct models targeting different home security needs. The Ulticam IQ Floodlight combines security camera functionality with powerful floodlights for comprehensive outdoor monitoring The Ulticam IQ offers spotlight capabilities in a more compact outdoor camera design The Ulticam Dot provides a portable, wireless security solution with extended battery life Technical Capabilities; Edge AI technology enables advanced features without requiring costly subscription services. Built-in artificial intelligence processors...

read Jan 6, 2025

Microsoft’s Copilot Vision watches your online activities — and will even talk to you about them

The Innovation: Microsoft has launched Copilot Vision, an AI-powered visual analysis tool within Microsoft Edge that enables voice-based conversations about web content and images. Key Features and Functionality: Copilot Vision, available to select Copilot Pro subscribers for $20 monthly, combines visual analysis with conversational AI capabilities to enhance web browsing experiences. The tool offers four distinct AI voice options for natural interactions, allowing users to engage in conversations about webpage content Users can receive detailed descriptions of images, text summaries, and contextual information about the content they're viewing The system can analyze both visible and non-visible page elements, providing comprehensive...

read Jan 3, 2025

Earthcam’s new visual AI tracks construction materials and safety protocols

Earthcam has launched enhanced visual AI capabilities that monitor construction materials and safety protocols, integrating with popular project management platforms like Procore and Autodesk. Core technology advancement: Earthcam's new "Material Analysis" feature represents a significant expansion of their AI capabilities beyond basic safety monitoring. The system automatically tracks material deliveries and installations on construction sites, providing real-time updates and alerts Integration with Procore and Autodesk enables automatic project schedule updates based on visual confirmations The AI can generate instant notifications with photographic evidence when materials arrive on site Color-coding functionality helps visualize different materials and their installation progress Practical applications:...

read Dec 26, 2024

Dareesoft’s vehicle-mounted AI sensors detect road hazards in real time

Tech startup Dareesoft has successfully demonstrated its AI-powered road hazard detection system in Dubai, proving its ability to identify over 2,000 real-time road hazards. Key development: The proof of concept (PoC) in Dubai validates Dareesoft's RiaaS (Road Hazard Information as a Service) system, marking a significant step in the company's Middle East expansion plans. The successful demonstration leads Dareesoft to plan expansion across the UAE, Saudi Arabia, Qatar, and Kuwait The system demonstrated its capability to detect over 2,000 road hazards in real-time during the Dubai PoC The announcement was made on December 25, signaling immediate expansion plans Technology overview:...

read Dec 21, 2024

Stanford research show how next gen neural networks may live in hardware

Technical breakthrough: Stanford University researchers have created a novel approach to implementing neural networks directly in computer hardware using logic gates, the fundamental building blocks of computer chips. The new system can identify images significantly faster while consuming only a fraction of the energy compared to traditional neural networks This innovation makes neural networks more efficient by programming them directly into computer chip hardware rather than running them as software The technology could be particularly valuable for devices where power consumption and processing speed are critical constraints Methodology and implementation: Felix Petersen, a Stanford postdoctoral researcher, developed a sophisticated training...

read Dec 20, 2024

The UK is using AI to detect drunk and high drivers in ‘world-first’ trial

AI technology is being deployed in Devon and Cornwall to detect intoxicated drivers through sophisticated behavioral analysis, marking a world-first trial of this innovative road safety measure. The technology's core capabilities: The Heads-Up camera system, developed by Acusensus, analyzes driving patterns to identify behaviors consistent with alcohol or drug impairment. The AI-powered camera can detect concerning road use and behavioral patterns that may indicate an impaired driver Police officers stationed further along the road can then stop flagged vehicles for roadside testing The system can be rapidly relocated throughout Devon and Cornwall without prior notice Operational strategy and implementation: The...

read Dec 16, 2024

How AI and emerging tech are expanding wildlife conservation efforts

Artificial intelligence and machine learning technologies are transforming wildlife conservation research by enabling scientists to process and analyze massive datasets of animal sounds, images, and behaviors at unprecedented scales. Key technological breakthroughs: Machine learning algorithms are making it possible to monitor wildlife populations and ecosystems with greater accuracy and efficiency than ever before. Conservation Metrics, led by Matthew McKown, has developed AI systems that can analyze thousands of hours of audio recordings from natural habitats, including coral reefs and forests The technology can identify and track specific animal calls, songs, and behaviors, providing researchers with detailed data about population patterns...

read Dec 16, 2024

AI pioneer Fei-Fei Li’s mission to unlock advanced AI through ‘spatial intelligence’

Background and historical context: Stanford professor Fei-Fei Li, known for creating the groundbreaking ImageNet dataset, has launched World Labs, a startup focused on developing AI systems with sophisticated spatial awareness and 3D understanding. Li's ImageNet project and its 2012 competition marked a turning point in AI history when the neural network AlexNet demonstrated unprecedented object recognition capabilities This breakthrough helped catalyze the deep learning revolution, leveraging vast internet training data and GPU computing power Li also cofounded Stanford's Institute for Human-Centered AI (HAI) to advance computer vision research Core technology and innovation: World Labs is developing AI systems that can...

read Dec 15, 2024

Practical tasks you can do with ChatGPT’s Advanced Voice with Vision feature

OpenAI has expanded ChatGPT's capabilities with Advanced Voice with Vision, a new feature combining voice interaction and image processing capabilities for Plus and Pro subscribers. Launch details and availability: OpenAI unveiled Advanced Voice with Vision during their '12 Days of OpenAI' demonstration, marking a significant expansion of ChatGPT's interactive capabilities. The feature is exclusively available to ChatGPT Plus and Pro subscribers who pay the $20 monthly fee A special 'Chat with Santa' feature will be accessible to all users, including those on the free tier The rollout is happening gradually on a global scale Core functionality: Advanced Voice with Vision...

read Dec 15, 2024

How to try ChatGPT’s new Advanced Voice with Vision feature

The recent integration of voice commands and visual analysis capabilities into ChatGPT marks a significant advancement in making artificial intelligence more accessible and interactive for everyday users. New Feature Overview: OpenAI has unveiled Advanced Voice with Vision as part of their '12 Days of OpenAI' demonstration, combining voice interaction and image analysis capabilities within ChatGPT. The feature enables users to interact with ChatGPT through spoken commands while also analyzing uploaded images and video content This enhancement is exclusively available to ChatGPT Plus and Pro subscribers, who pay $20 monthly for access A special 'Chat with Santa' feature has been made...

read Dec 10, 2024

Solos unveils $299 ChatGPT-powered smart glasses

The race to dominate the smart glasses market intensifies as Solos launches a direct competitor to Meta's Ray-Ban smart glasses, bringing AI-powered visual recognition capabilities to everyday eyewear. Product Overview: Solos has released the AirGo Vision, a $299 smart eyewear system that integrates OpenAI's GPT-4o AI model for visual recognition and interactive features. The glasses can identify objects, people, and text in the wearer's field of view Built-in AI capabilities enable real-time translation, navigation assistance, and contextual information delivery The device supports multiple AI models, including Google Gemini and Anthropic's Claude Key Features and Design: The AirGo Vision introduces innovative...

read Dec 6, 2024

Microsoft Copilot Vision lets AI analyze your online activities

Microsoft's AI assistant Copilot is expanding its capabilities with a new vision feature that allows it to see and analyze web content alongside users as they browse the internet using the Edge browser. Initial rollout and availability: Microsoft has begun previewing Copilot Vision with a select group of Pro subscribers in the United States who are enrolled in the early-access Copilot Labs program. The feature is currently limited to specific websites and requires users to opt-in explicitly Users can activate Copilot Vision while browsing to analyze webpage contents, including both text and images Microsoft emphasizes that user privacy is protected...

read Dec 5, 2024

Google launches PaliGemma 2 vision language models

Google's latest contribution to the field of artificial intelligence combines advanced vision and language capabilities in a powerful new model called PaliGemma 2, representing a significant step forward in multimodal AI technology. Core architecture and capabilities; PaliGemma 2 integrates SigLIP for visual processing with Gemma 2 for text generation, creating a versatile vision-language model that can handle multiple image resolutions and text-based tasks. The model comes in three sizes: 3B, 8B, and 28B parameters, offering flexibility for different computational needs and use cases Supported image resolutions range from 224x224 to 896x896, enabling analysis of both standard and high-resolution images The...

read Dec 5, 2024

Microsoft’s new Copilot Vision one-ups ChatGPT with web browsing assistance

Microsoft's Copilot Vision introduces a new dimension to AI-assisted web browsing by enabling real-time visual understanding and interaction capabilities within the Microsoft Edge browser. Latest development: Microsoft has launched Copilot Vision in preview, offering Pro subscribers an AI assistant that can view and understand users' online activities in real-time through Copilot Labs. The feature enables Copilot to read along with users, discuss browsing issues, and provide contextual insights based on visual information Users can interact with Copilot Vision through natural verbal communication, making it more accessible and user-friendly Initial rollout is limited to select websites and Pro subscribers in the...

read Dec 5, 2024

Berkeley AI uncovers forgotten oil and gas wells in historical maps

The discovery and management of hundreds of thousands of undocumented oil and gas wells across the United States presents significant environmental and safety challenges, with researchers now leveraging artificial intelligence to locate these potentially hazardous sites. The critical challenge: Undocumented orphaned wells (UOWs) pose significant environmental risks through potential chemical leaks into water sources and methane emissions into the atmosphere. Experts estimate between 310,000 and 800,000 undocumented wells exist across the United States These wells can leak toxic substances like benzene and hydrogen sulfide Methane emissions from these wells are particularly concerning, as methane is 28 times more potent than...

read Dec 3, 2024

OpenAI poaches 3 senior engineers from DeepMind

The highly competitive AI talent market continues to see major moves as leading companies vie for top researchers and engineers to advance their artificial intelligence capabilities. Key development: OpenAI has recruited three senior computer vision and machine learning engineers from Google DeepMind to work in its new Zurich office, focusing on multimodal AI development. Lucas Beyer, Alexander Kolesnikov, and Xiaohua Zhai will join OpenAI's efforts to advance multimodal AI, which combines different types of inputs like text, images, and audio The three researchers previously worked together at DeepMind and will continue their collaboration at OpenAI Beyer has been an active...

read Dec 3, 2024

AI cameras catch bus lane violators in Seattle

Seattle's King County Metro has initiated a pilot program using AI-powered cameras on buses to monitor and document transit-only lane violations, marking a significant step in leveraging technology to improve public transportation efficiency. Project overview: King County Metro has mounted AI cameras on two bus routes to collect data on drivers illegally using transit-only lanes, which are typically marked in red to distinguish them from regular traffic lanes. The 60-day demonstration program began November 6, focusing on the RapidRide E Line along Aurora Avenue North and Route 7 along Rainier Avenue South While no tickets will be issued during this...

read Dec 3, 2024

Henry Minsky launches AI startup inspired by father’s MIT research

The intersection of legacy and innovation in artificial intelligence takes a new turn as Henry Minsky, son of AI pioneer Marvin Minsky, launches a startup focused on video analysis and efficiency improvements. The founding vision: Leela AI, a Somerville-based startup, comes from a rich heritage in artificial intelligence research, with technology rooted in decades-old principles from MIT's AI Lab. Henry Minsky, serving as chief technology officer, brings forward a newly patented method of visual intelligence The company's software analyzes video footage to provide recommendations for improving efficiency and safety Currently in seed stage, Leela AI has three additional patents pending...

read Nov 30, 2024

AI in winemaking: Ancient art meets modern tech

The integration of artificial intelligence into traditional winemaking practices is creating new opportunities for precision and quality improvement at prestigious wineries like Chateau Montelena in Napa Valley. Current state of AI in winemaking: While artificial intelligence applications in wine production are still emerging, they are already providing valuable insights across the entire winemaking process. Chateau Montelena's winemaker Matt Crafton notes that AI is in its early stages but showing promise in multiple areas of wine production The technology is being applied from vineyard management through to final bottling processes The winery maintains a balance between technological innovation and traditional winemaking...

read Nov 27, 2024

HuggingFace claims its new AI model SmolVLM will slash business AI costs

Hugging Face's release of SmolVLM represents a significant advancement in making vision-language AI more accessible and cost-effective for businesses, offering comparable performance to larger models while requiring substantially less computing power. Key innovation details: SmolVLM is a compact vision-language model that can process both images and text while using significantly less computational resources than existing alternatives. The model requires only 5.02 GB of GPU RAM, compared to competitors Qwen-VL 2B and InternVL2 2B which need 13.70 GB and 10.52 GB respectively SmolVLM utilizes 81 visual tokens to encode image patches of size 384×384, enabling efficient processing of visual information The...

read Nov 26, 2024

This new AI model can detect counterfeit Lacoste items from photos

The race to combat counterfeit goods has gained a powerful new ally with the development of an AI image-recognition model capable of identifying fake products from photographs. Technology breakthrough: Vrai AI, a French company whose name means "true," has developed an AI system that can detect counterfeit products with 99.7% accuracy by analyzing visual details. The system was trained on thousands of images of genuine products to identify subtle discrepancies that may indicate counterfeiting The technology can distinguish between normal manufacturing variations and actual counterfeits David G. Stork, the company's chief scientist and Stanford University visiting professor, brings expertise in...

read Nov 26, 2024

DoD announces AI upgrades to the US Capitol air defense system

The rapid advancement of security technology has reached the U.S. Capitol as defense officials implement AI-enhanced surveillance systems to protect the nation's capital from airborne threats. Modernizing Capitol defenses: The Department of Defense is upgrading its post-9/11 security network around Washington, D.C. with an advanced AI-powered visual recognition and identification system. The new system incorporates both standard visual and infrared cameras with enhanced range and high-definition capabilities AI-driven machine learning features enable automatic target identification and tracking Laser range finders can precisely determine aircraft altitude and distance Special lasers can illuminate aircraft cockpits when planes deviate from approved flight paths...

read Nov 23, 2024

Conservationists in the UK turn to AI to save red squirrels

AI technologies are emerging as crucial tools in wildlife conservation efforts, with a new system called Squirrel Agent demonstrating promising results in protecting endangered red squirrels in the UK. The innovation: Squirrel Agent is an AI-powered system that can distinguish between red and grey squirrels with 97% accuracy, enabling automated control of specialized feeding stations. The technology has been trained on thousands of squirrel images to recognize distinctive features between the two species The system controls access to feeders, directing red squirrels to food sources while steering grey squirrels toward contraceptive-containing stations Five wildlife charities are currently testing the system...

read