AI Search
May 10, 2026
1. 3D Generation & Reconstruction
RecGen reconstructs 3D objects from RGBD images by learning from 200K synthetic assets. Fizz Forge generates physics-grounded 3D assets for games and robotics. D-Rex creates photorealistic relightable avatars. Map to World generates explorable 3D worlds from segmentation maps.
2. Image & Video Generation
Hydream O1 Image excels at text rendering and 2K generation without VAE (32GB models). UniVid X generates intrinsic video properties enabling relighting and background replacement. Swift I2V produces 2K videos from single images on RTX 4090. Bach 1.0 video generator ranks #6, generates 30-second 1080p videos with native sound.
3. Language Models & Efficiency
Gemma 4 gains 3.1x speedup via multi-token prediction/speculative decoding. Zia 1 8B, trained on AMD hardware, matches models 40-100x larger using compressed convolutional attention and Markovian RSA reasoning. CDM acceleration reduces diffusion steps from 20-50 to 4 while maintaining quality.
4. Robotics & Dexterous Manipulation
Genesis 2.6.5 enables humanoid robots to cook, solve Rubik's cubes, and perform lab tasks through dexterous hand control. Momo Act 2 (Allen AI) outperforms competitors in real-world zero-shot tests with 700-hour bimanual dataset. Boston Dynamics Atlas demonstrates superhuman range of motion including 180° rotations.
5. Algorithm Evolution & Scientific AI
AlphaEvolve improves real-world algorithms: 30% DNA sequencing error reduction, electricity grid optimization 14%→88%, disaster prediction +5% accuracy, quantum circuits 10x lower error. Lab OS integrates AI with XR glasses for real-time scientific lab guidance and protocol following.
6. Voice & Real-Time Interaction
OpenAI releases GPT Realtime 2 for natural conversational AI, GPT Realtime Translate for live translation across 70 languages into 13 outputs, and GPT Realtime Whisper for real-time transcription. All available via API with per-model pricing.
7. Open-Source & Infrastructure
Program Bench stress-tests AI coding: no models achieved 100% on 200 tasks (reverse-engineering full programs). Sakana AI + Nvidia's TW sparse format achieves 30% inference speedup and 30% energy reduction on H100s through efficient computation of non-zero values.
8. AI Benchmarks & New Labs
Video Rebirth's Bach 1.0 ranked #6 on blind tests. Hydream O1 ranks #8 on Artificial Analysis leaderboard, top open-source. Zia 1 first model trained on AMD hardware, demonstrating GPU vendor diversity. Genesis AI and Engine AI showcase robot fighting demos.