AI & Machine Learning for CGEdit

Generative AI and ML tools across image, video, 3D, texture, audio. AI mocap lives in §3 (dual-listed).

Machine Learning for CG

ML fundamentals and courses for CG artists.

3D Machine Learning. Resource repository for 3D machine learning.
Awesome 3D Human. Curated list of papers and resources on 3D human research, including face, body, hand and motion.
DINOv2. Meta self-supervised vision features. Strong general-purpose image embeddings. Useful as backbone for CG ML tasks.
Generative AI Genius 2024. Free generative AI course material by Aishwarya Naresh Reganti.
Introduction to Generative AI Community Course. Free community course on generative AI fundamentals by iNeuron.
Meta 3D Gen. Meta research pipeline for text-to-3D. Combines AssetGen geometry with TextureGen materials. Paper and results.
PyTorch3D. Meta FAIR library for 3D deep learning. Differentiable rendering, mesh ops, point cloud utils on PyTorch.

Image Generation

text-to-image models and platforms.

Software	Description	License	Tags	Best For
Adobe Firefly 3	Commercial-safe, deep Creative Cloud integration. See also: Software Reference → AI Image & Texture Generation Software	Paid	Commercial-Safe · Adobe CC	Commercial-safe Adobe workflows
Aga Miko/pixel Character Generator	Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included. See also: Software Reference → AI Image & Texture Generation Software	Open Source
FLUX.2 (Black Forest Labs)	Open-weight photorealism with Kontext editing. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Open Weight · Photoreal	Open-weight photorealism
GPT Image (OpenAI)	Instruction-following image model with strong text rendering. GPT Image 1.5. See also: Software Reference → AI Image & Texture Generation Software	Paid	OpenAI · Text Rendering	Instruction following
Grok Aurora (xAI)	Photorealistic image gen integrated into Grok. See also: Software Reference → AI Image & Texture Generation Software	Paid	xAI · Photoreal	Photorealism in Grok
HunyuanDiT (Tencent)	Open-source DiT-based, strong Chinese text. See also: Software Reference → AI Image & Texture Generation Software	Open Source	Tencent · DiT	Chinese text rendering
Ideogram 2.0	Best-in-class typography and text rendering, canvas mode. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Typography · Text Rendering	Typography in images
Imagen	Google photoreal image model via Gemini and Vertex. See also: Software Reference → AI Image & Texture Generation Software	Paid	Photoreal · Google	Photoreal image gen
Jimeng / Dreamina (ByteDance)	High quality, integrated with video pipeline. See also: Software Reference → AI Image & Texture Generation Software	Freemium	ByteDance · Video Pipeline	Integrated image + video pipeline
Kolors (Kuaishou)	Open-source, bilingual Chinese/English. See also: Software Reference → AI Image & Texture Generation Software	Open Source	Open Source · Bilingual	Bilingual open model
Krea AI	Real-time generation, upscaling, and style transfer platform. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Real-time · Upscaling	Real-time + upscaling
Leonardo AI	Real-time generation with artistic control. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Real-time · Controlled	Controlled artistic generation
Midjourney v7	Industry-leading artistic coherence and stylized output. See also: Software Reference → AI Image & Texture Generation Software	Paid	Artistic · Subscription	Artistic and stylized imagery
Nano Banana Pro (Google Gemini)	Instruction-based editing with character and product consistency across edits. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Consistency · Editing	Consistency editing
Playground v3	Mixed-mode design canvas for graphic design. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Design Canvas · Mixed-mode	Graphic design canvas
Recraft V3	Design-oriented gen. Vector, icons, brand assets. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Design Focus · Vector	Design assets, vectors
Reve	Image model with strong prompt adherence. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Prompt Adherence	Prompt adherence
Snowpixel	Generate Images/Videos/Animations/Audio/Music/3D Objects with Text and/or Image. Upload your own data to create custom models. See also: Software Reference → AI Image & Texture Generation Software	Freemium
Stable Diffusion 3.5	Open-source diffusion (Large, Medium, Turbo sizes). See also: Software Reference → AI Image & Texture Generation Software	Open Source	Open Source · Self-Host	Self-hosted image gen
Visual Electric	AI image generator aimed at designers.	Freemium	AI Image

Related:

Canva AI Text-to-Image Generator Freemium. Canva's free built-in text-to-image generator.
RenderNet Freemium. AI tool for generating images and videos with control over character design, composition, and style.
Stable Diffusion Frivolous. Community response to the Stable Diffusion litigation.
Word-As-Image for Semantic Typography. Semantically transforming fonts into illustrations.

Video Generation

AI video generation platforms and models.

Software	Description	License	Tags	Best For
Cog Video X	Tsinghua/Zhipu open-source, multiple sizes. See also: Software Reference → AI Video Generation Software	Open Source	Open Source · Tsinghua	Research open-source video gen
Drawstory	Cinematography and story sequencing with strong character continuity. See also: Software Reference → AI Video Generation Software	Freemium	Cinematography · Continuity	Character continuity
Frameo	Script to cinematic video with scenes, characters, audio, and timeline assembly. See also: Software Reference → AI Video Generation Software	Freemium	Script to Video · Timeline	Structured film pipeline
Genmo Mochi 1	Open-source video gen model. See also: Software Reference → AI Video Generation Software	Open Source	Open Source · Genmo	Open-source video gen
Google Flow	Filmmaking app over Veo with scenebuilder and ingredients-to-video. See also: Software Reference → AI Video Generation Software	Paid	Filmmaking · Scenebuilder	Veo filmmaking app
Hailuo AI / MiniMax	"Director" model. Strong motion and character consistency. See also: Software Reference → AI Video Generation Software	Freemium	Director Model · Character Consist	Directed motion
Haiper	Ex-DeepMind team, animation and video-to-video modes. See also: Software Reference → AI Video Generation Software	Freemium	Ex-DeepMind · V2V	Video-to-video
HeyGen	AI avatar marketing videos with a talking-head focus. See also: Software Reference → AI Video Generation Software	Freemium	Avatars	Avatar videos
HunyuanVideo 1.5	Open-source 8.3B params, runs on 14GB VRAM. See also: Software Reference → AI Video Generation Software	Open Source	Open Source · 14GB VRAM	Self-hosted video gen
InVideo AI	Prompt to video for marketing and social content. See also: Software Reference → AI Video Generation Software	Freemium	Marketing	Marketing and social video
Kling 3.0	Up to 5min clips, strong human motion. Motion Brush control. See also: Software Reference → AI Video Generation Software	Freemium	5min Clips · Motion Brush	Long clips, human motion
LTX Studio	Pre-production to video pipeline with storyboards, character consistency, and timeline. See also: Software Reference → AI Video Generation Software	Freemium	Pre-production · Storyboards	Script to video pipeline
LTX-2 (Lightricks)	Fast open video model. LTX-2 Fast ranks top-3 in the arena. See also: Software Reference → AI Video Generation Software	Open Source	Open Source · Fast	Fast open-source video gen
Luma Dream Machine	Atmospheric image-to-video. Ray3 is the first native 16-bit HDR video model. See also: Software Reference → AI Video Generation Software	Freemium	HDR · Atmospheric	Atmospheric video gen
Mootion	Idea to a 2-minute cinematic video in one tool. See also: Software Reference → AI Video Generation Software	Freemium	All-in-one	Fast all-in-one film
Morphic	AI film studio for story-driven video. See also: Software Reference → AI Video Generation Software	Paid	Film Studio	Story-driven film
Pika 2.0	Extended video gen with improved consistency. See also: Software Reference → AI Video Generation Software	Freemium	Extended Gen · Stylized	Stylized video
PixVerse	Style-specific modes (anime, 3D, realistic), character consistency. See also: Software Reference → AI Video Generation Software	Freemium	Style Modes · Consistency	Style-specific video gen
Runway Gen-4.5	Motion brushes and scene consistency on the GWM-1 world model. See also: Software Reference → AI Video Generation Software	Freemium	Motion Brushes · Consistency	Motion control
Seedance 2.0 (ByteDance)	High raw quality video gen with audio. Tops the Artificial Analysis ranking. See also: Software Reference → AI Video Generation Software	Paid	Top Quality · Native Audio	Top raw quality
Stable Animation SDK	Text-to-animation SDK for developers by Stability AI.	Paid
Veo 3.1 (Google DeepMind)	Top leaderboard, native audio, 60s+ clips. See also: Software Reference → AI Video Generation Software	Paid	Top Leaderboard · Native Audio	Top-quality video gen
Vidu	16s clips, strong human motion (Shengshu). See also: Software Reference → AI Video Generation Software	Freemium	16s · Human Motion	Long human-motion clips
Wan 2.2 (Alibaba)	Cinematic MoE diffusion, 8GB+ VRAM. Open source. See also: Software Reference → AI Video Generation Software	Open Source	Open Source · MoE Diffusion	Cinematic open-source video gen

Related:

Deforum Notebook v0.5 (Stable Diffusion animation). Deforum v0.5 notebook for Stable Diffusion animation with math automation, perspective flips, prompt weights, and video masking.
Deforum Stable Diffusion Animation (Artificial Selections). YouTube channel covering Deforum Stable Diffusion animation techniques.
Emu Video (Meta). Meta's text-to-video research model. Factorized generation, image-then-video. Demos and paper.
Stable Diffusion KLMC2 Animation. Colab notebook by @RiversHaveWings generating animation from scripted prompts via KLMC2 discretization of underdamped Langevin dynamics.
Stable Diffusion KLMC2 Animation (forked). Colab notebook for Stable Diffusion animation, forked by @DigThatData.

3D Generation

AI tools for generating 3D models from text or images.

Software	Description	License	Tags	Best For
3DTopia	Open-source text-to-3D pipeline (coarse → refined). See also: Software Reference → AI 3D Generation Software	Open Source	Coarse→Refined · Open Source	Text-to-3D open-source
Ashawkey/stable Dreamfusion	A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.	Open Source
InstantMesh	Fast single-image-to-3D reconstruction. See also: Software Reference → AI 3D Generation Software	Open Source	Fast · Image→3D	Fast image→3D
Intangible	Prompt and drag-drop to 3D scenes for previz. No-code. See also: Software Reference → AI 3D Generation Software	Freemium	Previz · No-code	No-code 3D previz
Kaedim	Image-to-3D with hybrid AI plus artist cleanup for production quality. See also: Software Reference → AI 3D Generation Software	Paid	Hybrid AI+Artist · Production	Production-quality via hybrid AI+artist
Luma Genie	Text/image to 3D, integrated with Dream Machine. See also: Software Reference → AI 3D Generation Software	Freemium	Luma · Integrated	Luma-integrated 3D gen
Meshy v4	Production-reliable, improved topology and PBR textures. See also: Software Reference → AI 3D Generation Software	Freemium	Production-Reliable · PBR	Production-ready AI 3D
Rodin Gen-2 (Hyper3D)	10B params, photorealistic, free generation tier. See also: Software Reference → AI 3D Generation Software	Freemium	10B Params · Free Tier	Photorealistic AI 3D
Sloyd	Procedural 3D generation with parametric control. See also: Software Reference → AI 3D Generation Software	Freemium	Procedural · Parametric	Parametric procedural 3D
Spline AI	Generate 3D objects and textures from text prompts in-editor. See also: Software Reference → AI 3D Generation Software	Freemium	In-Editor · Text→3D	In-editor AI 3D
Stability SPAR3D	Open-source single-image 3D reconstruction. See also: Software Reference → AI 3D Generation Software	Open Source	Stability AI · Open Source	Single-image open-source 3D
TRELLIS.2 (Microsoft)	Full PBR materials, complex topologies. See also: Software Reference → AI 3D Generation Software	Open Source	Microsoft · PBR + Open Source	PBR AI 3D (open source)
Tripo v3.0	Sculpture-level precision, clean quad topology. See also: Software Reference → AI 3D Generation Software	Freemium	Clean Quads · Sculpt-level	Clean topology AI 3D
TripoSR	Tripo/Stability collab. Fast open-source image-to-3D. See also: Software Reference → AI 3D Generation Software	Open Source	Open Source · Fast	Fast open-source image→3D
Unique3D	High-quality mesh from single image (NeurIPS 2024). See also: Software Reference → AI 3D Generation Software	Open Source	Single-Image · NeurIPS	Single-image open-source 3D
Wonder3D++	Cross-domain diffusion, textured meshes in 2-3min. See also: Software Reference → AI 3D Generation Software	Open Source	Fast · Textured Mesh	Fast textured mesh gen

Related:

NativeBlend CLI. AI CLI to generate and edit low-poly 3D game assets in Blender from text prompts.
Pixal3D. Single-image to 3D asset generation via pixel-feature back-projection. Outputs geometry and PBR textures.
Point-E (OpenAI). OpenAI point-cloud diffusion for text and image to 3D synthesis.
Threestudio. A unified framework for 3D content generation.
Transforming 2D Images into 3D with AdaMPI. Guide to using the AdaMPI model to create 3D photos from 2D images.

Texture & Material Generation

AI-generated PBR textures and materials.

Software	Description	License	Tags	Best For
Adobe Firefly Textures	Prompt-to-edit texture workflows. See also: Software Reference → AI Image & Texture Generation Software	Paid	Adobe · Prompt-to-Edit	Adobe-integrated AI textures
InstaMAT	Material authoring with AI workflows (Substance alternative). See also: Software Reference → AI Image & Texture Generation Software	Paid	Substance-alt · AI Workflows	AI-assisted material authoring
Meshy Textures	Integrated with 3D pipeline. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Meshy Pipeline · Integrated	Textures tied to Meshy 3D
Poly	AI-generated PBR textures and materials, tileable with full map sets. See also: Software Reference → AI Image & Texture Generation Software	Freemium	AI PBR · Tileable	AI PBR textures
Ponzu	AI texture gen from text prompts for uploaded meshes. See also: Software Reference → AI Image & Texture Generation Software	Freemium	Mesh-aware · Text→Tex	Mesh-aware AI textures
Scenario	Game-ready PBR materials, full map sets. See also: Software Reference → AI Image & Texture Generation Software	Paid	Game-Ready · Full Maps	Game-ready AI materials

ComfyUI Ecosystem

Node-based AI generation. ComfyUI and its ecosystem.

Software	Description	License	Tags	Best For
ComfyUI	Node-based AI generator for image/video/3D/audio. See also: Software Reference → AI Image & Texture Generation Software	Open Source	Node-based · Multi-modal	Node-based AI workflows

Related:

Awesome ComfyUI. Curated custom nodes collection.
Comflowy. ComfyUI tutorials for ControlNet, SDXL, FLUX workflows.
Comfy UI Manager. Install and manage custom nodes and dependencies.
ComfyUI Registry. Community workflow and subgraph sharing.
ControlNet SD3.5. Blur/Canny/Depth models, free commercial use.

AI-Assisted CG Tools

AI tools that augment traditional CG workflows.

Software	Description	License	Tags	Best For
Autodesk Flow Studio (Wonder Dynamics)	AI VFX, auto CG characters in live footage, USD export. See also: Software Reference → AI-Assisted CG Software	Paid	Auto CG Chars · USD Export	AI CG characters in live plates
Blockade Labs Skybox AI	AI-generated 360° skyboxes/HDRIs from text. See also: Software Reference → AI-Assisted CG Software	Freemium	AI Skybox · 360°	AI skyboxes/HDRIs
Gigapixel AI	AI photo enlargement that adds detail to upscaled images. See also: Software Reference → AI-Assisted CG Software	Paid		Detail-preserving upscaling
Let's Enhance	Photo upscaler up to 16x with a free tier. See also: Software Reference → AI-Assisted CG Software	Freemium	Photo Upscale	Photo upscaling
Magnific AI	Creative upscaler that adds detail. Folded into Freepik plans Apr 2026. See also: Software Reference → AI-Assisted CG Software	Paid	Creative Upscale · Freepik	Creative AI upscaling
Rosebud AI	Vibe coding platform for creating 3D games and interactive web apps with AI. See also: Software Reference → AI-Assisted CG Software	Freemium	AI Game Builder · Vibe Coding
Topaz Photo AI / Video AI	Upscaling, denoising, and sharpening. Topaz Bloom adds unlimited creative upscaling. See also: Software Reference → AI-Assisted CG Software	Paid	Upscale · Denoise	Upscale/denoise
UneeQ Digital Humans	Cloud platform for real-time interactive digital humans.	Paid
Unity Muse	Unity in-editor AI for texture, sprite, and animation generation. See also: Software Reference → AI-Assisted CG Software	Paid	Unity AI Suite
Upscayl	Open-source desktop upscaler that runs locally at no cost. See also: Software Reference → AI-Assisted CG Software	Open Source	Open Source · Local	Free local upscaling

Related:

AI Game Developer. Unity Editor and Unity Runtime AI integration. Unit Test, Coding, C# Roslyn, Reflection, Assets. Helps to create games with AI. And helps to run AI logic during gameplay.
AI Render (Stable Diffusion in Blender). Blender addon that renders an AI image from a text prompt and your scene via Stable Diffusion.
ClipDrop: Image Upscaler Freemium. AI image upscaler and enhancer.
Comfy UI Blender AI. Run ComfyUI workflows inside Blender.
ControlNet. Depth/pose/edge control for precision CG workflows.
Coplay Freemium. AI copilot for Unity.
CorridorKey. Corridor Crew's open keying tool. AI-driven chroma key for VFX work.
CoTracker. Meta FAIR point tracker for video. Tracks arbitrary points jointly across frames. Useful for matchmove and roto prep.
DEVA (Tracking Anything). Decoupled video segmentation. Long-form, multi-object roto with text or click prompts.
Dream Textures (Blender). Generate textures in Blender via Stable Diffusion, locally.
Genetic Drawing. Python library that generates a stylized rendering from an image.
GPT 4. Prompt engineering techniques for GPT-4, including tips, applications, limitations, and additional reading materials.
RIFE for Nuke. Real-Time Intermediate Flow Estimation for video frame interpolation (ML framerate upscaling) for Nuke.
Robust Video Matting (RVM). Real-time human video matting. Strong alpha mattes with no greenscreen.
Rotobot Paid. Paid AI roto plugin for Nuke and After Effects. Auto-mattes people and objects.
SAM 2 (Segment Anything Model 2). Meta's promptable segmentation for images and video. Masks objects across frames from a single click. Useful for rotoscope and matte work.
Sammie Roto 2. SAM-based rotoscope app. Click-to-mask, propagates across frames.
SolidUI. AI-generated visualization prototyping and editing platform, support 2D, 3D models, combined with LLM(Large Language Model) for quick editing.
Track Anything. SAM plus XMem for interactive video object tracking and segmentation. Useful for roto and matte propagation.
Wonder Studio (Autodesk Flow Studio) Freemium. AI character replacement with auto roto, body tracking, and re-lighting from a single video. Now part of Autodesk Flow Studio.

AI Audio & Music

AI music generation, voice, TTS, and SFX.

Software	Description	License	Tags	Best For
ACE Studio	AI singing voice synthesizer with expression control. See also: Software Reference → AI Audio & Music Generation Software	Freemium	Singing · Expression	AI singing synthesis
AIVA	AI composition, good for orchestral/cinematic scoring. See also: Software Reference → AI Audio & Music Generation Software	Freemium	Orchestral · Scoring	Orchestral/cinematic scoring
DiffRhythm	Open-source full-song gen with vocals from lyrics. See also: Software Reference → AI Audio & Music Generation Software	Open Source	Open Source · Lyrics→Song	Open-source song gen with vocals
ElevenLabs	Voice cloning, narration, music generation. See also: Software Reference → AI Audio & Music Generation Software	Freemium	Voice Cloning · TTS	Voice cloning, TTS
F5 TTS	Open-source zero-shot voice cloning TTS. See also: Software Reference → AI Audio & Music Generation Software	Open Source	Zero-Shot · Voice Clone	Zero-shot voice cloning
Fish Audio	Open-source TTS with voice cloning, fast and multilingual. See also: Software Reference → AI Audio & Music Generation Software	Open Source	Open Source · Multilingual	Open-source TTS
Suno v5	Full song generation, 100M+ users. See also: Software Reference → AI Audio & Music Generation Software	Freemium	Song Gen · 100M users	Full song generation
Udio	Strong electronic/pop, licensed for commercial use. See also: Software Reference → AI Audio & Music Generation Software	Freemium	Electronic/Pop · Commercial	Commercial-licensed AI music

Related:

AudioCraft (MusicGen / AudioGen / EnCodec). Meta's open audio generation stack. MusicGen for music, AudioGen for SFX, EnCodec for neural audio compression.
COVAL. Guide to building, scaling, and evaluating voice AI, from speech recognition to emotional intelligence.

Open-Source Models (HuggingFace)

Notable open-source generative models.

Hugging Face Transformers. NLP models and pipelines.
Hunyuan Video I2V. Image-to-video, multilingual.
sd-concepts-library (Stable Diffusion Concepts Library). Community library of Stable Diffusion textual-inversion concepts to browse and use in prompts.
SV3D (Stable Video 3D). Orbital video from single image.

Research Papers

Research papers on ML for CG, generative 3D, neural rendering, and related topics. Each entry: title (the plain-English summary), year, and venue/links in the description.

3D Neural Scene Representations for Visuomotor Control. Li et al., CoRL 2021 Oral
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows. ArXiv 2022 paper on integrating text-to-image AI in 3D design workflows.
A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields. Park et al., Arxiv 2021 | github
A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis. Pan et al., NeurIPS 2021 | github
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models. Project Page
AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training. Jiang et al., CVPR 2023
Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies. Peng et al., ICCV 2021 | github
Animatable Neural Radiance Fields from Monocular RGB Videos. Chen et al., Arxiv 2021 | github
AutoInt: Automatic Integration for Fast Neural Volume Rendering. Lindell et al., CVPR 2021 | github
BeyondPixels: A Review of the Evolution of Neural Radiance Fields. AKM Shahariar Azad Rabby and Chengcui Zhang, Arxiv 2023
Block-NeRF: Scalable Large Scene Neural View Synthesis. Tancik et al., Arxiv 2022
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects. Wen et al., CVPR 2023 | github
CADOps-Net: Jointly Learning CAD Operation Types and Steps from Boundary-Representations. 3DV 2022 | [project]
CLA-NeRF: Category-Level Articulated Neural Radiance Field. Tseng et al., ICRA 2022
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields. Wang et al., CVPR 2022 | github
CodeNeRF: Disentangled Neural Radiance Fields for Object Categories. Jang et al., ICCV 2021
Color-NeRF. NeRF research paper on color and appearance modelling.
Consolidating Attention Features for Multi-view Image Editing. Project Page
Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion. Bhalgat et al., NeurIPS 2023 (Spotlight)
CROSSFIRE: Camera Relocalization On Self-Supervised Features from an Implicit Representation. Moreau et al., ICCV 2023
D-NeRF: Neural Radiance Fields for Dynamic Scenes. Pumarola et al., CVPR 2021 | github
Deep Generative Models on 3D Representations: A Survey. Project Page
DeepCAD: A Deep Generative Network for Computer-Aided Design Models. ICCV 2021 | [project] [code]
Deformable Neural Radiance Fields. Park et al., Arxiv 2020 | github
Depth-supervised NeRF: Fewer Views and Faster Training for Free. Deng et al., Arxiv 2021 | github
DeRF: Decomposed Radiance Fields. Rebain et al., Arxiv 2020
DFNet: Enhance Absolute Pose Regression with Direct Feature Matching. Chen et al., ECCV 2022 | github
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction. Sun et al., CVPR 2022 | github
DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images. Code
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model. Project Page
DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks. Neff et al., CGF 2021 | github
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models. Project Page
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior. Project Page | Code
DreamFusion: Text-to-3D using 2D Diffusion. Project Page | Code
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Project Page | Code
DreamGaussian4D: Generative 4D Gaussian Splatting. Project Page | Code
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars. Project Page | Code
DyLiN: Making Light Field Networks Dynamic. Yu et al., CVPR 2023 | github
Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction. Gafni et al., CVPR 2021 | github
DynIBaR: Neural Dynamic Image-Based Rendering. Li et al., CVPR 2023
Editable Free-viewpoint Video Using a Layered Neural Representation. Zhang et al., SIGGRAPH 2021 | github
Editing Conditional Radiance Fields. Liu et al., Arxiv 2021 | github
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data. Project Page | Code
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction. Project Page | Code
ENeRF: Efficient Neural Radiance Fields for Interactive Free-viewpoint Video. Lin et al., SIGGRAPH 2022 | github
FastNeRF: High-Fidelity Neural Rendering at 200FPS. Garbin et al., Arxiv 2021
FiG-NeRF: Figure Ground Neural Radiance Fields for 3D Object Category Modelling. Xie et al., Arxiv 2021
From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach. AAAI 2025 | [project]
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting. Project Page | Code
GARF: Gaussian Activated Radiance Fields for High Fidelity Reconstruction and Pose Estimation. Chng et al., ECCV 2022
GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians. Project Page
GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting. Project Page | Code
Gemini Robotics 1.5. Embodied reasoning, thinking, and motion transfer.
Generative Occupancy Fields for 3D Surface-Aware Image Synthesis. Xu et al., NeurIPS 2021 | github
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields. Niemeyer et al., CVPR 2021
GNeRF: GAN-based Neural Radiance Field without Posed Camera. Meng et al., Arxiv 2021
GO-NeRF: Generating Objects in Neural Radiance Fields for Virtual Reality Content Creation. Project Page
GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis. Schwarz et al., NeurIPS 2020 | github
GRF: Learning a General Radiance Field for 3D Scene Representation and Rendering. Trevithick and Yang, Arxiv 2020 | github
Ha-NeRF: Hallucinated Neural Radiance Fields in the Wild. Chen et al., CVPR 2022 | github
HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video. Weng et al., CVPR 2022 | github
IBRNet: Learning Multi-View Image-Based Rendering. Wang et al., CVPR 2021 | github
IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis. Project Page | Code
Image Sculpting: Precise Object Editing with 3D Geometry Control. Project Page | Code
In-Place Scene Labelling and Understanding with Implicit Scene Representation. Zhi et al., Arxiv 2021
iNeRF: Inverting Neural Radiance Fields for Pose Estimation. Yen-Chen et al. IROS 2021
InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity. Liang et al., SIGGRAPH Asia 2024 | github
Instant3D: Instant Text-to-3D Generation. Project Page | Code
KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs. Reiser et al., ICCV 2021 | github
KiloNeuS: Implicit Neural Representations with Real-Time Global Illumination. Esposito et al., Arxiv 2022
L2G-NeRF: Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields. Chen et al., CVPR 2023 | github
Learned Initializations for Optimizing Coordinate-Based Neural Representations. Tancik et al., CVPR 2021 | github
Learning Compositional Radiance Fields of Dynamic Human Heads. Wang et al., Arxiv 2020
Learning Neural Transmittance for Efficient Rendering of Reflectance Fields. Mohammad Shafiei et al., BMVC 2021
Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering. Yang et al., ICCV 2021 | github
Learning the 3D Fauna of the Web. Project Page | Code
Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving. Cao et al. ICRA 2024 | github
Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields. Maggio et al., ICRA 2023 | github
Local 3D Editing via 3D Distillation of CLIP Knowledge. Hyung et al., CVPR 2023| github
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts. Project Page | Code
Magic3D: High-Resolution Text-to-3D Content Creation. Project Page
Make-A-Character: High Quality Text-to-3D Character Generation within Minutes. Project Page | Code
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior. Project Page | Code
MinD-3D: Reconstruct High-quality 3D objects in Human Brain. Project Page | Code
MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis. Jiaxin Li et al., ICCV 2021 | github
Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. Barron et al., Arxiv 2022
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. Barron et al., Arxiv 2021 | github
Mixture of Volumetric Primitives for Efficient Neural Rendering. Lombardi et al., SIGGRAPH 2021
MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo. Chen et al., ICCV 2021 | github
NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes. Irshad et al., ICCV 2023 | github
NeRF (paper site). Mildenhall et al., ECCV 2020 | github
NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images. Ben Mildenhall et al, arXiv 2021
NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. Martin-Brualla et al., CVPR 2021
NeRF--: Neural Radiance Fields Without Known Camera Parameters. Wang et al., Arxiv 2021 | github
NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors. Liu et al., Arxiv 2022
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields. Irshad et al., ECCV 2024
NeRF-SOS: Any-view Self-supervised Object Segmentation on Complex Real-world Scenes. Fan et al., ICLR 2023
NeRF-VAE: A Geometry Aware 3D Scene Generative Model. Kosiorek et al., Arxiv 2021
NeRF: Neural Radiance Field in 3D Vision, Introduction and Review. Kyle Gao, Yina Gao, Hongjie He, Dening Lu, Linlin Xu, Jonathan Li
NeRF++: Analyzing and Improving Neural Radiance Fields. Zhang et al., Arxiv 2020 | github
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo. Wei et al., ICCV 2021
NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis. Srinivasan et al. CVPR 2021
Neural 3D Video Synthesis from Multi-view Video. Li et al., CVPR 2022
Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans. Peng et al., CVPR 2021 | github
Neural Radiance Flow for 4D View Synthesis and Video Processing. Du et al., Arxiv 2020
Neural Rays for Occlusion-aware Image-based Rendering. Liu et al., Arxiv 2021
Neural Refinement for Absolute Pose Regression with Feature Synthesis. Chen et al., CVPR 2024 | github
Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes. Li et al., CVPR 2021 | github
Neural Scene Graphs for Dynamic Scenes. Ost et al., CVPR 2021
Neural Sparse Voxel Fields. Liu et al., NeurIPS 2020 | github
Neural Volume Rendering: NeRF. Dellaert and Yen-Chen, Arxiv 2020 | blog
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction. Wang et al., NeurIPS 2021 | github
Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video. Tretschk et al., Arxiv 2020 | github
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior. Bian et al., CVPR 2023 | github
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion. Project Page | Code
ParSeNet: A Parametric Surface Fitting Network for 3D Point Clouds. ECCV 2020 | [project] [code]
PC2WF: 3D Wireframe Reconstruction from Raw Point Clouds. ICLR 2021 | [code]
Photo tourism: Exploring photo collections in 3D. Snavely, Seitz, Szeliski. SIGGRAPH 2006.
pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis. Chan et al., CVPR 2021
pixelNeRF: Neural Radiance Fields from One or Few Images. Yu et al., CVPR 2021 | github
PlankAssembly: 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs. ICCV 2023 | [project] [code]
PlenOctrees for Real-time Rendering of Neural Radiance Fields. Yu et al., Arxiv 2021 | github
Point-NeRF: Point-based Neural Radiance Fields. Xu et al., CVPR 2022 | github
Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders. CVPR 2022 | [project] [code]
PolyGen: An Autoregressive Generative Model of 3D Meshes. ICML 2020 | [code]
Ponder: Point Cloud Pre-training via Neural Rendering. Huang et al., ICCV 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm. Zhu et al., Arxiv 2023 | github
PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction. Bian et al., ICLR 2024 | github
Portrait Neural Radiance Fields from a Single Image. Gao et al., Arxiv 2020
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts. Project Page | Code
PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting. Code
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. Matthew Tancik et al., Arxiv 2021
PVA: Pixel-aligned Volumetric Avatars. Raj et al., CVPR 2021
PVDeconv: Point-voxel deconvolution for autoencoding cad construction in 3D. ICIP 2020 | [project]
Reconstructing Editable Prismatic CAD from Rounded Voxel Models. SIGGRAPH Asia 2022
Recovering HDR Radiance Maps (Debevec & Malik, 1997). SIGGRAPH 1997 paper. Original HDR-from-bracketed-exposures recovery method.
Rendering Synthetic Objects into Real Scenes (Debevec, 1998). SIGGRAPH 1998 paper founding image-based lighting.
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Project Page | Code
Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos. Project Page
Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion. Wang et al., CVPR 2023 | github
RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering. Li et al., ICCAD 2022
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance. Project Page
Self-Calibrating Neural Radiance Fields. Jeong et al., ICCV 2021 | github
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields. Project Page | Code
SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field. Bao et al., CVPR 2023 | github
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image. Xu et al., ECCV 2022 | github
Space-time Neural Irradiance Fields for Free-Viewpoint Video. Xian et al., CVPR 2021
Streaming Radiance Fields for 3D Video Synthesis. Li et al. NeurIPS 2022 | github
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting. Project Page | Code
StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis. Gu et al., Arxiv 2021
Supervised Fitting of Geometric Primitives to 3D Point Clouds. CVPR 2019 | [code]
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. Gao et al., CVPR 2023 | github
Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance Fields. Mi et al., ICLR 2023 | github
Taming Mode Collapse in Score Distillation for Text-to-3D Generation. Project Page | Code
Text-To-4D Dynamic Scene Generation. Project Page
Text2CAD: Text to 3D CAD Generation via Technical Drawings. NeurIPS 2024 | [project]
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields. Project Page
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion. Project Page
TiNeuVox: Fast Dynamic Radiance Fields with Time-Aware Neural Voxels. Fang et al., SIGGRAPH Asia 2022 | github
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image. Project Page
Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers. Project Page
UV Volumes for Real-time Rendering of Editable Free-view Human Performance. Chen et al., CVPR 2023 | github
ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields. Code
Volume Rendering of Neural Implicit Surfaces. Yariv et al., NeurIPS 2021 | github
Wonder3D: Single Image to 3D using Cross-Domain Diffusion. Project Page | Code
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360° Insufficient RGB-D Views. Zhu et al., WACV 2023 | github
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model. Project Page | Code | Hugging Face

AI Image & Texture Generation Software

Also in Software Reference → AI Image & Texture Generation Software

Software	Description	License	Tags	Best For
Adobe Firefly 3	Commercial-safe, deep Creative Cloud integration.	Paid	Commercial-Safe · Adobe CC	Commercial-safe Adobe workflows
Adobe Firefly Textures	Prompt-to-edit texture workflows.	Paid	Adobe · Prompt-to-Edit	Adobe-integrated AI textures
Aga Miko/pixel Character Generator	Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.	Open Source
ComfyUI	Node-based AI generator for image/video/3D/audio.	Open Source	Node-based · Multi-modal	Node-based AI workflows
FLUX.2 (Black Forest Labs)	Open-weight photorealism with Kontext editing.	Freemium	Open Weight · Photoreal	Open-weight photorealism
GPT Image (OpenAI)	Instruction-following image model with strong text rendering. GPT Image 1.5.	Paid	OpenAI · Text Rendering	Instruction following
Grok Aurora (xAI)	Photorealistic image gen integrated into Grok.	Paid	xAI · Photoreal	Photorealism in Grok
HunyuanDiT (Tencent)	Open-source DiT-based, strong Chinese text.	Open Source	Tencent · DiT	Chinese text rendering
Ideogram 2.0	Best-in-class typography and text rendering, canvas mode.	Freemium	Typography · Text Rendering	Typography in images
Imagen	Google photoreal image model via Gemini and Vertex.	Paid	Photoreal · Google	Photoreal image gen
InstaMAT	Material authoring with AI workflows (Substance alternative).	Paid	Substance-alt · AI Workflows	AI-assisted material authoring
Jimeng / Dreamina (ByteDance)	High quality, integrated with video pipeline.	Freemium	ByteDance · Video Pipeline	Integrated image + video pipeline
Kolors (Kuaishou)	Open-source, bilingual Chinese/English.	Open Source	Open Source · Bilingual	Bilingual open model
Krea AI	Real-time generation, upscaling, and style transfer platform.	Freemium	Real-time · Upscaling	Real-time + upscaling
Leonardo AI	Real-time generation with artistic control.	Freemium	Real-time · Controlled	Controlled artistic generation
Meshy Textures	Integrated with 3D pipeline.	Freemium	Meshy Pipeline · Integrated	Textures tied to Meshy 3D
Midjourney v7	Industry-leading artistic coherence and stylized output.	Paid	Artistic · Subscription	Artistic and stylized imagery
Nano Banana Pro (Google Gemini)	Instruction-based editing with character and product consistency across edits.	Freemium	Consistency · Editing	Consistency editing
Playground v3	Mixed-mode design canvas for graphic design.	Freemium	Design Canvas · Mixed-mode	Graphic design canvas
Poly	AI-generated PBR textures and materials, tileable with full map sets.	Freemium	AI PBR · Tileable	AI PBR textures
Ponzu	AI texture gen from text prompts for uploaded meshes.	Freemium	Mesh-aware · Text→Tex	Mesh-aware AI textures
Recraft V3	Design-oriented gen. Vector, icons, brand assets.	Freemium	Design Focus · Vector	Design assets, vectors
Reve	Image model with strong prompt adherence.	Freemium	Prompt Adherence	Prompt adherence
Scenario	Game-ready PBR materials, full map sets.	Paid	Game-Ready · Full Maps	Game-ready AI materials
Snowpixel	Generate Images/Videos/Animations/Audio/Music/3D Objects with Text and/or Image. Upload your own data to create custom models.	Freemium
Stable Diffusion 3.5	Open-source diffusion (Large, Medium, Turbo sizes).	Open Source	Open Source · Self-Host	Self-hosted image gen

AI Video Generation Software

Also in Software Reference → AI Video Generation Software

Software	Description	License	Tags	Best For
Cog Video X	Tsinghua/Zhipu open-source, multiple sizes.	Open Source	Open Source · Tsinghua	Research open-source video gen
Drawstory	Cinematography and story sequencing with strong character continuity.	Freemium	Cinematography · Continuity	Character continuity
Frameo	Script to cinematic video with scenes, characters, audio, and timeline assembly.	Freemium	Script to Video · Timeline	Structured film pipeline
Genmo Mochi 1	Open-source video gen model.	Open Source	Open Source · Genmo	Open-source video gen
Google Flow	Filmmaking app over Veo with scenebuilder and ingredients-to-video.	Paid	Filmmaking · Scenebuilder	Veo filmmaking app
Hailuo AI / MiniMax	"Director" model. Strong motion and character consistency.	Freemium	Director Model · Character Consist	Directed motion
Haiper	Ex-DeepMind team, animation and video-to-video modes.	Freemium	Ex-DeepMind · V2V	Video-to-video
HeyGen	AI avatar marketing videos with a talking-head focus.	Freemium	Avatars	Avatar videos
HunyuanVideo 1.5	Open-source 8.3B params, runs on 14GB VRAM.	Open Source	Open Source · 14GB VRAM	Self-hosted video gen
InVideo AI	Prompt to video for marketing and social content.	Freemium	Marketing	Marketing and social video
Kling 3.0	Up to 5min clips, strong human motion. Motion Brush control.	Freemium	5min Clips · Motion Brush	Long clips, human motion
LTX Studio	Pre-production to video pipeline with storyboards, character consistency, and timeline.	Freemium	Pre-production · Storyboards	Script to video pipeline
LTX-2 (Lightricks)	Fast open video model. LTX-2 Fast ranks top-3 in the arena.	Open Source	Open Source · Fast	Fast open-source video gen
Luma Dream Machine	Atmospheric image-to-video. Ray3 is the first native 16-bit HDR video model.	Freemium	HDR · Atmospheric	Atmospheric video gen
Mootion	Idea to a 2-minute cinematic video in one tool.	Freemium	All-in-one	Fast all-in-one film
Morphic	AI film studio for story-driven video.	Paid	Film Studio	Story-driven film
Pika 2.0	Extended video gen with improved consistency.	Freemium	Extended Gen · Stylized	Stylized video
PixVerse	Style-specific modes (anime, 3D, realistic), character consistency.	Freemium	Style Modes · Consistency	Style-specific video gen
Runway Gen-4.5	Motion brushes and scene consistency on the GWM-1 world model.	Freemium	Motion Brushes · Consistency	Motion control
Seedance 2.0 (ByteDance)	High raw quality video gen with audio. Tops the Artificial Analysis ranking.	Paid	Top Quality · Native Audio	Top raw quality
Veo 3.1 (Google DeepMind)	Top leaderboard, native audio, 60s+ clips.	Paid	Top Leaderboard · Native Audio	Top-quality video gen
Vidu	16s clips, strong human motion (Shengshu).	Freemium	16s · Human Motion	Long human-motion clips
Wan 2.2 (Alibaba)	Cinematic MoE diffusion, 8GB+ VRAM. Open source.	Open Source	Open Source · MoE Diffusion	Cinematic open-source video gen

AI 3D Generation Software

Also in Software Reference → AI 3D Generation Software

Software	Description	License	Tags	Best For
3DTopia	Open-source text-to-3D pipeline (coarse → refined).	Open Source	Coarse→Refined · Open Source	Text-to-3D open-source
InstantMesh	Fast single-image-to-3D reconstruction.	Open Source	Fast · Image→3D	Fast image→3D
Intangible	Prompt and drag-drop to 3D scenes for previz. No-code.	Freemium	Previz · No-code	No-code 3D previz
Kaedim	Image-to-3D with hybrid AI plus artist cleanup for production quality.	Paid	Hybrid AI+Artist · Production	Production-quality via hybrid AI+artist
Luma Genie	Text/image to 3D, integrated with Dream Machine.	Freemium	Luma · Integrated	Luma-integrated 3D gen
Meshy v4	Production-reliable, improved topology and PBR textures.	Freemium	Production-Reliable · PBR	Production-ready AI 3D
Rodin Gen-2 (Hyper3D)	10B params, photorealistic, free generation tier.	Freemium	10B Params · Free Tier	Photorealistic AI 3D
Sloyd	Procedural 3D generation with parametric control.	Freemium	Procedural · Parametric	Parametric procedural 3D
Spline AI	Generate 3D objects and textures from text prompts in-editor.	Freemium	In-Editor · Text→3D	In-editor AI 3D
Stability SPAR3D	Open-source single-image 3D reconstruction.	Open Source	Stability AI · Open Source	Single-image open-source 3D
TRELLIS.2 (Microsoft)	Full PBR materials, complex topologies.	Open Source	Microsoft · PBR + Open Source	PBR AI 3D (open source)
Tripo v3.0	Sculpture-level precision, clean quad topology.	Freemium	Clean Quads · Sculpt-level	Clean topology AI 3D
TripoSR	Tripo/Stability collab. Fast open-source image-to-3D.	Open Source	Open Source · Fast	Fast open-source image→3D
Unique3D	High-quality mesh from single image (NeurIPS 2024).	Open Source	Single-Image · NeurIPS	Single-image open-source 3D
Wonder3D++	Cross-domain diffusion, textured meshes in 2-3min.	Open Source	Fast · Textured Mesh	Fast textured mesh gen

AI Audio & Music Generation Software

Also in Software Reference → AI Audio & Music Generation Software

Software	Description	License	Tags	Best For
ACE Studio	AI singing voice synthesizer with expression control.	Freemium	Singing · Expression	AI singing synthesis
AIVA	AI composition, good for orchestral/cinematic scoring.	Freemium	Orchestral · Scoring	Orchestral/cinematic scoring
DiffRhythm	Open-source full-song gen with vocals from lyrics.	Open Source	Open Source · Lyrics→Song	Open-source song gen with vocals
ElevenLabs	Voice cloning, narration, music generation.	Freemium	Voice Cloning · TTS	Voice cloning, TTS
F5 TTS	Open-source zero-shot voice cloning TTS.	Open Source	Zero-Shot · Voice Clone	Zero-shot voice cloning
Fish Audio	Open-source TTS with voice cloning, fast and multilingual.	Open Source	Open Source · Multilingual	Open-source TTS
Suno v5	Full song generation, 100M+ users.	Freemium	Song Gen · 100M users	Full song generation
Udio	Strong electronic/pop, licensed for commercial use.	Freemium	Electronic/Pop · Commercial	Commercial-licensed AI music

AI-Assisted CG Software

Also in Software Reference → AI-Assisted CG Software

Software	Description	License	Tags	Best For
Autodesk Flow Studio (Wonder Dynamics)	AI VFX, auto CG characters in live footage, USD export.	Paid	Auto CG Chars · USD Export	AI CG characters in live plates
Blockade Labs Skybox AI	AI-generated 360° skyboxes/HDRIs from text.	Freemium	AI Skybox · 360°	AI skyboxes/HDRIs
Gigapixel AI	AI photo enlargement that adds detail to upscaled images.	Paid		Detail-preserving upscaling
Let's Enhance	Photo upscaler up to 16x with a free tier.	Freemium	Photo Upscale	Photo upscaling
Magnific AI	Creative upscaler that adds detail. Folded into Freepik plans Apr 2026.	Paid	Creative Upscale · Freepik	Creative AI upscaling
Rosebud AI	Vibe coding platform for creating 3D games and interactive web apps with AI.	Freemium	AI Game Builder · Vibe Coding
Topaz Photo AI / Video AI	Upscaling, denoising, and sharpening. Topaz Bloom adds unlimited creative upscaling.	Paid	Upscale · Denoise	Upscale/denoise
Unity Muse	Unity in-editor AI for texture, sprite, and animation generation.	Paid	Unity AI Suite
Upscayl	Open-source desktop upscaler that runs locally at no cost.	Open Source	Open Source · Local	Free local upscaling

AI Creative Canvas Software

Also in Software Reference → AI Creative Canvas Software

Software	Description	License	Tags	Best For
Flora	Node canvas for image and video gen with 50+ models and Style DNA.	Freemium	Node Canvas · Multi-model	Visual node workflows
Freepik Spaces	Node workflows tied to the Freepik stock library.	Freemium	Node Workflows · Stock Library	Node workflows with stock
Higgsfield Canvas	Node editor chaining prompts, style transfer, motion, and render across image and video.	Freemium	Node Editor · Image + Video	Image to video pipeline
Kaiber	Unified canvas for image and video generation and editing. Superstudio.	Paid	Unified Canvas · Image + Video	Unified gen canvas
Weavy	Node canvas with pro editing for compositing, matte, and relighting. Acquired by Figma.	Freemium	Node Canvas · Pro Editing	Node canvas plus editing
Wireflow	AI workflow canvas with a developer lean.	Freemium	Workflow Canvas	Dev-leaning AI canvas

AI Design Viz Software

Also in Software Reference → AI Design Viz Software

Software	Description	License	Tags	Best For
Arko.ai	Arch-viz AI renderer for sketch and 3D model screenshots.	Freemium	Architecture	Arch-viz rendering
LookX	Arch-viz AI renderer with style references.	Freemium	Architecture	Arch-viz rendering
Mnml.ai	Arch-viz AI renderer from sketch and 3D screenshot input.	Freemium	Architecture	Arch-viz rendering
NewArc	Sketch to render for product, fashion, and automotive design.	Freemium	Sketch to Render	Product and fashion render
SketchPro	Sketch, SketchUp, and massing to render for architecture.	Freemium	Architecture	Architecture massing render
Visualizee	Sketch and 3D screenshot to render for architecture and interiors.	Freemium	Architecture · Sketch to Render	Architecture rendering
Vizcom	Sketch to render for industrial, product, and footwear design with line control.	Freemium	Sketch to Render · Product Design	Industrial design sketching

AI Product Photography Software

Also in Software Reference → AI Product Photography Software

Software	Description	License	Tags	Best For
Claid.ai	Catalog image cleanup and AI product shots at ecommerce scale.	Freemium	Ecommerce · Product Shots	Catalog cleanup at scale
Emersya	3D and AR product experiences from ideation to ecommerce.	Paid	3D + AR	3D and AR product viz
Flair.ai	AI product photography canvas. Drag products into staged scenes.	Freemium	Product Photography	Staged product scenes
Omi	3D digital twins of products for virtual photoshoots. Built for CPG brands.	Paid	Digital Twins · Product Shots	CPG product twins
Photoroom	Marketplace-first mobile app for batch product listing images.	Freemium	Mobile · Batch	Batch listing images
Threekit	Enterprise 3D twins with photoreal renders and configurators from one file.	Paid	Enterprise · Configurators	Enterprise 3D twins