Overview
Welcome to The 3rd Workshop on Synthetic Data for Computer Vision (SynData4CV) at CVPR 2026!
During the last decade, advances in computer vision have been catalyzed by the release of meticulously curated human-labeled datasets. Recently, people have increasingly resorted to synthetic data as an alternative to labor-intensive human-labeled datasets for its scalability, customizability, and cost-effectiveness. Synthetic data offers the potential to generate large volumes of diverse and high-quality vision data, tailored to specific scenarios and edge cases that are hard to capture in real-world data. However, challenges such as the domain gap between synthetic and real-world data, potential biases in synthetic generation, and the generalizability of models trained on synthetic data remain.
This workshop aims to provide a forum for discussion and encouragement of further exploration in these areas.
Topics of interest include, but are not limited to:
- Effectiveness: What is the most effective way to generate and leverage synthetic data? Does synthetic data need to "look" realistic?
- Efficiency and scalability: Can we make synthetic data generation more efficient and scalable without much sacrifice on the quality?
- Benchmark and evaluation: What benchmark and evaluation methods are needed to assess the efficacy of synthetic data for computer vision?
- Risks and ethical considerations: How can we mitigate the risks of generating and using synthetic data? How do we address relevant ethical questions, such as bias amplification in synthetic datasets?
- Applications: What are other tasks in computer vision or other related fields (e.g., robotics, NLP) that could benefit from synthetic data?
- Other open problems: How do we decide which type of data to use, synthetic or real-world data? What is the optimal way to combine both if both are available?
Invited Speakers
Schedule
- 1:00 – 1:10 PM Opening Remarks Opening
- 1:10 – 1:45 PM Invited Talk · Manling Li Talk
- 1:45 – 2:20 PM Invited Talk · Jia Deng Talk
- 2:20 – 2:55 PM Invited Talk · Georgia Gkioxari Talk
- 2:55 – 3:10 PM Break Break
- 3:10 – 3:45 PM Invited Talk · Andrew Owens Talk
- 3:45 – 4:20 PM Invited Talk · Nupur Kumari Talk
- 4:20 – 4:30 PM Closing Remarks Closing
- 4:30 – 5:30 PM Poster Session Poster
Accepted Papers · 54 papers, sorted alphabetically
-
Addressing Data Scarcity in Depth-Based Human Action Recognition via Zero-Shot Depth Estimation
-
AfriST-VQA: Benchmarking MLLMs for Scene-Text Visual Question Answering for African Languages
-
Appreciate the View: A Task-Aware Evaluation Framework for Novel View Synthesis
-
Assessing the Predictive Value of Physics-Grounded Synthetic Data for Computer Vision in Space Environments
-
Auto-Comp: Scalable Controlled Synthetic Benchmarks for VL Compositionality
-
Avatar4D: Synthesizing Domain-Specific 4D Humans for Real-World Pose Estimation
-
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification
-
Beyond Photorealism: Counterfactual Synthetic Bundles for Invariant Sim-to-Real Vision
-
Beyond Raw Signals: Undecoded Generative Latents as Privileged Synthetic Data
-
Completing Missing Modalities: Synthetic Data for RGB–Infrared–Thermal–Text Person Re-Identification
-
CryoDiff: Cryo-EM Synthesis via Biophysics and Cycle-Consistent Diffusion
-
Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation
-
Disentangled Anatomy-Disease Diffusion (DADD) for Controllable Ulcerative Colitis Progression Synthesis
-
Durian: Dual Reference Image-Guided Portrait Animation with Attribute Transfer
-
Evaluating the Trade-offs of MDL-to-UsdPreviewSurface Material Simplification in NVIDIA Isaac Sim: Visual Quality, Feature Preservation, and AI Task Performance
-
Few-Shot Synthetic Data Generation with Diffusion Models for Downstream Vision Tasks
-
Fréchet Inception Distance is Failing to Preserve Rank Consistency for Synthetic Out-of-Distribution Samples
-
Generating Synthetic Illumination Variation with Co-Located Relighting
-
Grounding Synthetic Data Generation With Vision and Language Models
-
How Far Can We Go With Synthetic Data for Audio-Visual Sound Source Localization?
-
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation
-
iARCS: Iterative Agentic RL for Controllable 3D Scene Generation
-
Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition
-
Multi-Objective Photoreal Simulation (MOPS) Dataset for Computer Vision in Robot Manipulation
-
Narrowing the Performance Gap in Synthetic VLM Pre-training via Multi-Generator Ensembles
-
Object-Centric Data Synthesis for Category-level Object Detection
-
One Category One Prompt: Dataset Distillation using Diffusion Models
-
OrbitArch
-
Personalized Generative Models for Contextual Debiasing
-
PLLM: Pseudo-Labeling Large Language Models for CAD Program Synthesis
-
Privacy-Aware Synthetic Video Benchmarking and Relational Evaluation for Worker-Under-Suspended-Load Detection
-
ProductConsistency: Improving Product Identity Preservation in Instruction-Based Image Editing via SFT and RL
-
RareCrafter: Controllable Generative Augmentation for Rare Object Detection in Driving Scenes
-
Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning
-
Representation-Conditioned Diffusion Models for Guided Training Data Generation
-
Restereo: Unifying diffusion stereo video generation and restoration
-
SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning
-
Scaling Up 3D Forest Vision with Synthetic LiDAR
-
Sea-Mie: Physically-Based Synthetic Fog for Maritime Image Defogging via Curriculum Learning
-
Sequential Dataset for Satellite Pose Estimation and a Frequency-Space Neural Operator for HIL-Free Generalization Benchmarking
-
Sim-to-Real Metrology: Calibrated Digital Twins for Fringe Projection Profilometry
-
SJEPA: Joint Embedding Predictive Architecture for Synthetic-to-Real Alignment
-
Structure-Consistent Joint Image-Mask Synthesis for Data-Scarce Medical Image Segmentation
-
Structure-retained low-rank adapters for weather synthesis
-
StyleText: A Large-Scale Dataset and Benchmark for Stylized Scene Text Inpainting
-
Synthetic Data Generation for Long-Tail Medical Image Classification: A Case Study in Skin Lesions
-
Synthetic Designed Experiments for Diagnosing Vision Model Failures
-
Theory of Space: Evaluating Active Spatial Belief Construction in Foundation Models with Synthetic 3D Environments
-
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
-
Video-Consistent Synthetic Skiing Trajectories
-
WaterGen: Decoupling Scene and Medium in Underwater Image Generation
-
When Does Synthetic Data Help? A Spectral Theory of Task-Relevant Domain Gap with Applications to Guided Generation and Bias Auditing
-
Why Training with Synthetic Data Fails for OOD: Distribution Gap Amplifies Noise Misalignment in Diffusion Models
-
WireSeg-32K: A Physics-Grounded Synthetic Dataset for Wire Instance Segmentation
Call for Papers
We invite submissions on topics related to synthetic data for computer vision, including but not limited to:
- Novel methods for generating synthetic data
- Techniques for bridging the domain gap between synthetic and real data
- Benchmarks and evaluation metrics for synthetic data
- Applications of synthetic data in various computer vision tasks
- Ethical considerations and bias mitigation in synthetic data generation
- Efficient and scalable synthetic data generation pipelines
- Papers should be formatted according to the CVPR 2026 template
- Short papers: 4 pages (excluding references); Long papers: 8 pages (excluding references)
- Submissions should be made through OpenReview
- All submissions will be double-blind reviewed
- Accepted papers will NOT be included in CVPR proceedings, so there are no double submission concerns.
Important Workshop Dates
-
Submission OpensFebruary 25, 2026
-
Submission DeadlineMarch 17, 2026 · 11:59 AM UTC
-
NotificationTBD
-
Camera ReadyTBD
-
Workshop DateJune 4, 2026 · Room 607
Related Workshops
- Synthetic Data for Computer Vision @ CVPR 2025
- Synthetic Data for Computer Vision @ CVPR 2024
- Machine Learning with Synthetic Data @ CVPR 2022
- Synthetic Data for Autonomous Systems @ CVPR 2023
- Synthetic Data Generation with Generative AI @ NeurIPS 2023