Skip to content

Added method for prompt generation from raw data#272

Merged
dmjoy merged 6 commits into
mainfrom
dev/unstructured-translation
May 19, 2026
Merged

Added method for prompt generation from raw data#272
dmjoy merged 6 commits into
mainfrom
dev/unstructured-translation

Conversation

@alright-code
Copy link
Copy Markdown

Two phase approach for prompt generation:

  1. Extract the relevant information from the raw dataset
  2. Simplify extracted entries into a concise list
    phase2_direct_gen.yaml shows how this can be configured for medical kdma.

Comment thread align_system/prompt_engineering/direct_regression_system_prompt.py
Comment thread align_system/algorithms/direct_regression_adm_component.py Outdated
if callable(system_prompt_template):
system_prompt = call_with_coerced_args(
system_prompt_template,
{'model':self.structured_inference_engine.model}
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wondering why we aren't initializing this component with structured_inference_engine and instead opting to pass in the model and create a new generator (looks like we're just passing in the structured_inference_engine.model anyway). Are there parameters that need to be different or are we otherwise invoking this differently that makes this necessary? The main detractor to doing it this way is that we can't freely swap out the inference engine backend.

Comment thread align_system/prompt_engineering/direct_regression_system_prompt.py
@dmjoy
Copy link
Copy Markdown
Contributor

dmjoy commented May 19, 2026

Merging as-is, but when / if we need to revisit we should opt to use the structured_inference_engine rather than pass the model instance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants