Skip to content

Align VLM caption model, fix V2 ingest() example, document run_pipeline; misc README/Helm fixesUpdating files per bugs 5966185, 5966211, and 5966281#1742

Merged
sosahi merged 2 commits intoNVIDIA:26.03from
kheiss-uwzoo:kheiss/qa-review8
Mar 31, 2026
Merged

Align VLM caption model, fix V2 ingest() example, document run_pipeline; misc README/Helm fixesUpdating files per bugs 5966185, 5966211, and 5966281#1742
sosahi merged 2 commits intoNVIDIA:26.03from
kheiss-uwzoo:kheiss/qa-review8

Conversation

@kheiss-uwzoo
Copy link
Copy Markdown
Collaborator

Align VLM caption model, fix V2 ingest() example, document run_pipeline; misc README/Helm fixes

Summary

This change brings documentation and a pipeline default in line with the deployed VLM stack (Nemotron Nano 12B v2 VL), fixes a broken V2 Python quickstart snippet, and adds accurate library-mode reference for run_pipeline (parameters, defaults, and return types). Follow-up edits correct the stable-release branch link, image-caption fallback in the custom summarization pipeline, and a few Helm README details.

Internal refs: NVBug 5966185, 5966211, 5966281 (per commit message).

What changed

Documentation (docs/docs/extraction/)

Area Change

  • VLM / caption Default and example model_name updated from nvidia/llama-3.1-nemotron-nano-vl-8b-v1 to nvidia/nemotron-nano-12b-v2-vl in python-api-reference.md and nv-ingest-python-api.md. Quickstart Profile table: vlm row now points to Nemotron Nano 12B v2 VL and the matching model card URL in quickstart-guide.md.
  • V2 API Minimal example uses .ingest(return_full_response=True) and prints results[0]['metadata']['total_pages'] so the sample matches a list of per-job responses and top-level metadata in v2-api-guide.md.
  • Library mode quickstart-library-mode.md: new run_pipeline section—parameter table (optional defaults per pipeline_runners.py), return-type table (RayPipelineInterface vs RayPipelineSubprocessInterface vs float), pointer to source docstring.
  • Config
  • config/custom_summarization_pipeline.yaml: image caption model_name fallback updated to nvidia/nemotron-nano-12b-v2-vl when VLM_CAPTION_MODEL_NAME is unset (matches default_pipeline.yaml / compose defaults).
  • README / Helm (local only; commit or squash before merge)
  • README.md: stable release branch link target set to NVIDIA/nv-ingest 26.03.
  • helm/README.md: sample kubectl get pods output refreshed; grammar fix (“its process”); MIG example link corrected to mig/nv-ingest-mig-values.yaml.
  • Testing / verification

@kheiss-uwzoo kheiss-uwzoo requested review from jdye64 and sosahi March 27, 2026 19:37
@kheiss-uwzoo kheiss-uwzoo requested a review from a team as a code owner March 27, 2026 19:37
@kheiss-uwzoo kheiss-uwzoo added the doc Improvements or additions to documentation label Mar 27, 2026
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jdye64 can you check these changes please?

@sosahi sosahi merged commit c5e1c22 into NVIDIA:26.03 Mar 31, 2026
4 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

doc Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants