A Curated List of AI Generators
From art to business, generative AI is making waves across the globe. It's set to fundamentally reshape our society. Here's a curated list of AI generators covering text, image, video, music, and code generation. Think of it as your go-to directory for exploring these emerging technologies.
- πΌοΈ Image Generators
- π Text Generators
- π¬ Video Generators
- π€ Voice Generators
- π΅ Music Generators
- π€ Code Generators
- π§ 3D Generators
- π AI Tutors
- 𧬠AI for Science
πΌοΈ Image Generators
General tools (text to image)
- Flux: high quality, exceptional prompt adherence and text rendering capabilities. Access. open source recommended
- Midjourney: high quality, best aesthetics.
- DALL-E: strong prompt following, good for illustration. Access from ChatGPT or Microsoft Copilot.
- Ideogram: exceptional prompt adherence and text rendering capabilities.
- Stable Diffusion: SD 1.5, SDXL, SD3, strong ecosystem and developer community. open source
- Google Imagen: Google's image generator. Access from Gemini.
- Adobe Firefly: integrated in Photoshop, generative fill / inpainting.
Specific Applications
- PortraitArt: photo to art.
- HeadshotPro: AI headshot.
- PhotoRoom: background removal, AI photo editor.
- Playground: AI designer.
π Text Generators
General tools (LLM model, chat bot, question answering, text summarization, writing assistant, code snippet)
- ChatGPT: best known conversational AI by OpenAI, multi-modality. recommended
- Claude: from Anthropic, multi-modality. recommended
- Gemini: from Google DeepMind, multi-modality.
- Llama: a family of advanced open-source language models developed by Meta, designed for tasks like text generation, summarization, and coding. Access from HuggingFace. open source
- Mistral: open and portable generative AI for devs and businesses. Chat bot. open source
- QWen: from Alibaba, multi-modality. Access from HuggingFace open source
Specific Applications
- Search
- Perplexity: AI-powered search engine, concise answers with sources, LLM + web search. recommended
- Writing assistant
- Notion AI: search, generate, analyze, and chat right inside Notion.
- Grammarly: AI writing assistant.
- Jenni: for students, help your reading, writing, and referencing.
- copy.ai: for sales and marketing, generates sales copy, digital ads, blog, email and product descriptions.
- Jasper: for sales and marketing, AI content writing and marketing copy tool.
- Note taking and research assistant
- NotebookLM: developed by Google. Note taking and research assistant powered by AI, even create podcast with one click. recommended
- Help you read PDF documents
π¬ Video Generators
General tools (text to video, image to video)
- Sora: developed by OpenAI, impressive demos but not released.
- Kling AI: good quality, lip sync, available now. recommended
- Meta's Movie Gen: developed by Meta, edit existing video with text, create sound effects, impressive demos but not released.
- Runway: motion brush, camera control.
- Luma Dream Machine: fast generation speed.
- Pika: special effects.
- Stable Video: open source
Specific Applications
- Synthesia: AI-generated videos with virtual avatars and voiceovers.
- HeyGen: create and translate videos with AI.
- AuthoShorts.ai: create faceless short videos on auto-pilot. Auto-post to TikTok and Youtube.
π€ Voice Generators
General tools
- ElevenLabs: high quality text to speech & AI voice generator. recommended
- DeepGram: text to speech, speech to text APIs.
- Wisper: human level speech recognition (speech to text). Access from HuggingFace. open source.
Specific Applications
- Speak: a language learning app that gets you speaking with AI.
- Elsa Speak: practice conversations with AI.
π΅ Music Generators
π€ Code Generators
General tools (code generation from text prompt)
Specific Applications
- Cursor: AI code editor, code in natural language. recommended
- GitHub Copilot: AI coding assistant inside VSCode.
- Replit Ghostwriter: AI for collaborative coding.
π§ 3D Generators
- Meshy: high quality text to 3D, image to 3D. recommended
- Luma Genie: text to 3D.
- Magic3D by NVidia: high-quality 3D mesh models from text prompts, not released.
- OpenAIβs Point-E: 3D point clouds from text prompts, low-resolution. open source
π AI Tutors
- Khanmigo by Khan Academy: AI tutor for K-12 students.
- Speak: a language learning app that gets you speaking with AI.
- Elsa Speak: practice conversations with AI.
𧬠AI for Science
- Biology
- AlphaFold: predicts a protein's 3D structure from its amino acid sequence.
- Math and Reasoning
- GPT o1: reason through complex tasks and solve harder problems. Available in ChatGPT Pro and API for developers.
- AlphaProof and AlphaGeometry: advanced reasoning in mathematics. Not released.
- Chip design
- AlphaChip: chip design assistant. Not released.
If you have questions or suggestions, contact me at aigenerator.cc@gmail.com.