Multimodal AI

News, repos, and tools about Multimodal AI. Auto-updated every 6 hours.

Latest News
Amazon's New AI Product Images Have Shoppers Scratching Their Heads
TechCrunch · Jun 3, 2026
Scorsese, at 82, becomes the most unlikely Hollywood voice for AI
TechCrunch · Jun 2, 2026
Class Action Targets Amazon's Ring Facial Recognition Feature
TechCrunch · Jun 2, 2026
Nvidia chases $200B CPU market with AI agent PCs from Microsoft, Dell, and HP
TechCrunch · Jun 1, 2026
Making sense of the debate over AI psychosis
TechCrunch · May 31, 2026
AI grifters are creating fake Black people to sell Shein junk
The Verge · May 30, 2026
Meta is reportedly developing an AI pendant
TechCrunch · May 30, 2026
Tech companies desperately want to film you doing chores
The Verge · May 29, 2026
This AI startup will clean your home for free to train future robots
The Verge · May 29, 2026
YouTube will let you ask AI to make a custom video feed
The Verge · May 28, 2026
Did the Pope use AI to write about the dangers of AI?
The Verge · May 27, 2026
This startup is betting India’s gig economy can train the world’s robots
TechCrunch · May 26, 2026
Universal Music Group and TikTok renew agreement to combat unauthorized AI music
TechCrunch · May 26, 2026
AI warfare is already here
The Verge · May 26, 2026
Chatbot 'Personalities' Become Hacker Playground With Simple Tricks
The Verge · May 24, 2026
Google’s new anything-to-anything AI model is wild
The Verge · May 23, 2026
Elon Musk has given up on solar power (on Earth)
TechCrunch · May 23, 2026
AI is being used to resurrect the voices of dead pilots
TechCrunch · May 22, 2026
Google's AI Glasses With Displays Almost Ready for Fall Launch
TechCrunch · May 22, 2026
Spotify takes on Google’s NotebookLM with its new app
TechCrunch · May 21, 2026
Trending Repos
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Python · 78.7k stars
OpenBMB/VoxCPM
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Python · 25.5k stars
safishamsi/graphify
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL sch
Python · 58.8k stars
heygen-com/hyperframes
Write HTML. Render video. Built for agents.
TypeScript · 23.9k stars
OpenMOSS/MOSS-TTS
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is des
Python · 2.9k stars
openclaw/Peekaboo
Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the enti
Swift · 4.6k stars
calesthio/OpenMontage
World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI codi
Python · 4.3k stars
adamlyttleapps/claude-skill-aso-appstore-screenshots
A Claude Skill that automates the creation of App Store screenshots for ASO (App Store Optimization), leveraging AI to g
Python · 1.4k stars
Related Tools
Midjourney
Leading AI art generator known for highly aesthetic, photorealistic, and artistic image outputs. Web app and Discord.
DALL-E
OpenAI image generator integrated into ChatGPT, creating and editing images from natural language prompts.
Stable Diffusion
Open-source AI image model by Stability AI that can run locally or via API for unrestricted generation.
Leonardo
AI image and video generation platform with fine-tuned models for game assets, design, and art.
Ideogram
AI image generator known for exceptional text rendering accuracy within generated images.
Flux
State-of-the-art open-source image generation model by Black Forest Labs with excellent prompt adherence.
Runway
Leading AI video generation platform with Gen-4 Turbo model for creating and editing cinematic video from text and images.
Pika
AI video creation platform that generates and edits video clips from text prompts with cinematic quality.
Kling
Advanced AI video generation model producing high-quality, physics-aware video from text and image inputs.
Sora
OpenAI text-to-video model capable of generating realistic scenes with complex motion and camera movements.
Other Topics