• Spearheaded the extraction of critical data from IRS instruction documents using advanced Large Language Models (LLMs) through the integration of GPT-4 and Gemini APIs
• Engineered and optimized complex prompts to maximize the accuracy of information extraction, leading to significant improvements in data retrieval efficiency
• Transformed extracted data into RDF statements, meticulously ensuring adherence to predefined schemas for consistency and interoperability
• Conducted comprehensive validation and quality assurance of RDF data, achieving high standards of accuracy and completeness