AI Pipeline
The AI pipeline is the core of article generation. It takes clustered source articles and produces verified, structured content through a generate-critique-revise loop.
Pipeline Flow
Clustered Sources (2+ unique sites required)
|
v
+------------------+
| Generator | Nemotron Super 49B
| (type detection) | Detects: deal, summary, roundup
+--------+---------+
|
v
+------------------+
| Critic #1 | Llama 3.3 70B Instruct
| Factual, source, |
| attribution, |
| hallucination |
+--------+---------+
|
issues found?
/ \
no yes
| |
| +------v-------+
| | Auto-Revise | Generator rewrites
| +------+-------+ with issue list
| |
| +------v-------+
| | Re-Verify | Critic scores again
| +------+-------+ Keep better version
| |
+------+-----+
|
v
+------------------+
| Detail Extraction| Extracts key details
| (if needed) | and links via AI
+--------+---------+
|
v
+------------------+
| Scout's Take | Brief editorial opinion
+--------+---------+
|
v
+------------------+
| Image Pipeline | See Image Pipeline docs
+--------+---------+
|
v
PipelineResultModels
| Role | Model | Purpose |
|---|---|---|
| Generator | nvidia/llama-3.3-nemotron-super-49b-v1.5 | Article generation and revision |
| Critic | meta/llama-3.3-70b-instruct | Factual verification and scoring |
| Vision | meta/llama-3.2-90b-vision-instruct | Image relevance scoring (Phase 2) |
All models are accessed through the NVIDIA API at https://integrate.api.nvidia.com/v1.
Generation Process
The generator receives clustered source articles and produces structured output. Article type is auto-detected from source content:
- deal – time-limited offers (bonuses, sales, discounts, signup offers)
- summary – news, changes, analysis, reviews
- roundup – multiple related stories grouped together
The generator prompt enforces a specific writing style: informative and practical, not promotional. The output uses structural markers for parsing:
Title line
TLDR: One-sentence summary
KEY_DETAILS_START
- Label: Value
- Label: Value
KEY_DETAILS_END
LINKS_START
- Link Text | URL
LINKS_END
Article body content...Critique Process
The critic evaluates generated articles against the original sources on four dimensions:
| Check | What It Verifies |
|---|---|
FactualCheck | All facts match the source material |
AttributionCheck | Claims are properly attributed to sources |
CompletenessCheck | Key information from sources is included |
HallucinationCheck | No invented facts or unsupported claims |
The critic produces a score from 0-100 which is then adjusted:
- Hallucination failure caps the score at 60
- Each other failed check deducts 10 points
- Any failure caps the score at 85
- Two or more failures cap the score at 70
Articles scoring at or above critic_threshold (default 75) are marked as passing.
Revision Flow
When the critic finds issues, the pipeline automatically triggers one revision cycle:
- The generator receives the original content plus the list of issues
- It rewrites the article addressing each issue
- The critic re-evaluates the revised version
- The higher-scoring version is kept
Admins can also trigger manual revisions through the dashboard by providing feedback text.
Detail Extraction
If the structured output parsing does not yield key details, a separate AI call extracts them. The extraction prompt asks for:
- Key details – label/value pairs summarizing the most important facts
- Links – relevant URLs mentioned in the article
Scout’s Take
Each article includes a brief editorial opinion from the “Scout” persona. This is generated separately and stored in the scouts_take field. It provides a practical recommendation or perspective beyond the factual reporting.
Image Pipeline
After article generation, the image pipeline finds and scores a relevant image. See the dedicated page for details.