- Transformed unstructured project documents into structured data using Python
- Implemented OCR with libraries such as pdfplumber and pytesseract to extract text from PDFs and images
- Designed and implemented AI-driven prompts to guide machine learning models in identifying and extracting key data
- Enhanced the data extraction and analysis process, significantly increasing construction project assessment accuracy and minimizing manual errors
- Accuracy: Achieved an 82.35% accuracy rate, indicating a high level of reliability and consistency in the extracted data across different test samples and conditions