Developed an end-to-end automation pipeline using LLM's for generating programming problem datasets and selecting a reproducible subset for evaluation. Engineered systems to interact with large language models for both code synthesis and translation tasks. Implemented automated prompting for detecting bugs and generating detailed code coverage reports. Designed integrated validation routines to verify datasets, model outputs, and essential artifacts. Built command-line scripts and automated workflows for reproducibility.

LLM Based Automated Code Synthesis & Evaluation Pipeline