Job Summary
A company is looking for a Machine Learning Engineer to enhance and optimize their data extraction pipeline for commercial real estate lease processing.
Key Responsibilities
- Improve and maintain the data extraction pipeline for lease document processing
- Fine-tune and retrain existing ML models for text categorization
- Own the QA process for ML outputs and continuously optimize model performance
Required Qualifications
- Proven experience in machine learning, focusing on text classification and document processing
- Strong proficiency in Python and core NLP libraries (e.g., spaCy, NLTK, scikit-learn, transformers)
- Experience with TF-IDF vectorization and traditional ML techniques for text classification
- Familiarity with OCR technologies and PDF parsing tools
- Experience deploying models on AWS and working with APIs like OpenAI and Claude
Comments