Browsing: AI & Robotics
From Gemma 3 270M to FunctionGemma, How Google AI Built a Compact Function Calling Specialist for Edge Workloads
Google has released FunctionGemma, a specialized version of the Gemma 3 270M model that is trained specifically for function calling…
Computer-aided design (CAD) systems are tried-and-true tools used to design many of the physical objects we use each day. But…
This AI Paper from Stanford and Harvard Explains Why Most ‘Agentic AI’ Systems Feel Impressive in Demos and then Completely Fall Apart in Real Use
Agentic AI systems sit on top of large language models and connect to tools, memory, and external environments. They already…
“At MIT, innovation ranges from awe-inspiring technology to down-to-Earth creativity,” noted Chronicle, during a campus visit this year for an episode…
Meta AI Open-Sourced Perception Encoder Audiovisual (PE-AV): The Audiovisual Encoder Powering SAM Audio And Large Scale Multimodal Retrieval
Meta researchers have introduced Perception Encoder Audiovisual, PEAV, as a new family of encoders for joint audio and video understanding.…
A “scientific sandbox” lets researchers explore the evolution of vision systems | MIT News
Why did humans evolve the eyes we have today?While scientists can’t go back in time to study the environmental pressures…
NVIDIA AI Releases Nemotron 3: A Hybrid Mamba Transformer MoE Stack for Long Context Agentic AI
NVIDIA has released the Nemotron 3 family of open models as part of a full stack for agentic AI, including…
Even networks long considered “untrainable” can learn effectively with a bit of a helping hand. Researchers at MIT’s Computer Science…
Meta AI Releases SAM Audio: A State-of-the-Art Unified Model that Uses Intuitive and Multimodal Prompts for Audio Separation
Meta has released SAM Audio, a prompt driven audio separation model that targets a common editing bottleneck, isolating one sound…
Most languages use word position and sentence structure to extract meaning. For example, “The cat sat on the box,” is…