Quality Assurance
U-Gen uses AI vision models to evaluate every generated keyframe and video segment before proceeding to the next pipeline stage. If content doesn't meet the quality threshold, it's automatically regenerated.
QA models
Image QA (keyframes)
Primary: GPT-4o-mini
Fallback: Gemini 2.5 Flash
Product visibility, persona consistency, prohibited content (phones), interaction mode, composition.
Video QA (segments)
Primary: Gemini 2.5 Flash
Fallback: Gemini 3 Flash Preview
Motion quality, visual consistency, artifacts, product positioning throughout the segment.
Strictness modes
Choose a QA mode when creating a job to control how strictly content is evaluated:
| Mode | Threshold |
|---|---|
| Lenient | 0.55 |
| Standard | 0.70 |
| Strict | 0.80 |
Retries are configured separately with sliders: keyframe retries (0–5, default 1) and video retries (0–5, default 0). When QA is enabled, keyframe retries minimum is 1.
Default Mode
Fail categories
When QA detects an issue, it's classified into one of these categories:
PHONE_DETECTEDPhone, camera, or recording device visible in frame.
MODE_VIOLATIONInteraction mode not respected (e.g., product not held when holdable).
PRODUCT_INTEGRITYProduct missing, distorted, or incorrectly represented.
CRITICAL_DEFECTMajor visual defect that makes the content unusable.
MOTION_ARTIFACTUnnatural motion, jitter, or warping in video segments.
TEMPORAL_INCONSISTENCYVisual inconsistency across frames (flickering, identity shift).
VISUAL_ARTIFACTMinor rendering artifacts (slight blur, compression noise).
BELOW_THRESHOLDOverall quality score fell below the configured threshold.
Phone Detection
Failure notifications
If you configure a Telegram chat ID in your settings, the system sends a notification via Telegram when a job fails QA after exhausting all retries. The message includes failure details and job ID. This is optional and disabled by default.