Andrei Goponenko
Skill
Experience
Vetting results
Machine learning
Certifications
Experience
Skills
Languages: Python, SQL
Tools: Pytorch, TensorFlow, Keras, NumPy, SciPy, Scikit-Learn, OpenCV, Skimage, Pandas, Flask
Work Experience
Machine Learning Engineer, QuantumSoft LLC, 2022 - present
Clearlaw project / AI - based contract-review and management
Korn Ferry project / Recommendation platform for HR specialists
Achieved:
- Developed ML and service parts of graph for career transitions which are
used in recommendations.
- Created service to extract candidate skills from the raw text.
Technology Stack: Python (Pandas, Numpy, Scikit-learn, spaCy, Jupyter), Azure (Databricks, Pipelines, Devops), PySpark, Git, YAML.
Minnow project / Aggregator for series and movies from the most popular streaming platforms
Achieved:
- Developed matching algorithm for links from streaming platforms and corresponding movies / shows on Minnow's site: brought matching rate 1.7x higher for movies and 2.1x higher for shows. Matching algorithm included more thorough data preprocessing, search by several similarity metrics, fuzzy matching by several fields, validation top-k candidates.
- Trained language models (FastText) on domain-specific texts.
- Worked on data analysis to enhance overall data quality.
Technology Stack: Python (Pandas, Numpy, Scikit-learn, FastText, Plotly, Jupyter), PostgreSQL.
Anagram project / Insurance billing automation SaaS for doctors and patients /
Achieved:
- Built predictive models for set of benefits based on prior data about patients: list of benefits and coverage, copay, quantity for each benefit.
- Cluster analysis of insurance plans for better understanding of patient segments being verified in Anagram app.
- Built churn prediction model and tested the bunch of hypotheses for key components of customer health scoring system.
Technology Stack: Python (Pandas, Numpy, Scipy, Scikit-learn, imbalanced-learn, XGBoost, Lightgbm, Catboost, UMAP, SHAP, LIME, Matplotlib, Seaborn, Plotly, Jupyter), Linux, Git.
Clearlaw project / AI - based contract-review and management
Achieved:
- Developed an algorithmic part of system to help legal teams exchange and analyze documents at fast pace: it included classification models for short and long texts, sequential text classification, NER.
- Took part in deployment trained models to microservice infrastructure.
- Training language models (GloVe, fastText, ELMo) on domain-specific texts, fine-tuning pretrained models in downstream tasks.
- Implemented and deployed brat-based annotation pipeline for text classification task with auto-tagging.
- Ad-hoc analysis and EDA for business hypotheses.
Technology Stack: Python (Pandas, Numpy, Scipy, Scikit-learn, XGBoost, Lightgbm, Catboost, PyTorch, Keras, Spacy, NLTK, Matplotlib, Seaborn, Plotly, Jupyter), Linux, Git, AWS, Docker, SQLLite, Brat.
Data Scientist, SIBUR, 2017 - 2018
Worked on projects for several plants: optimization of technological processes, predictive maintenance.
Achieved:
- Created ML models in frames of operation efficiency projects on several SIBUR plants. ML models were to forecast quality of products, energy consumption and material fluxes.
- Deployment of trained ML models to production for use by plant personnel: gathering requirements, optimization of models output, visualization of
dashboards.
- Support of the existing solutions: retraining models due to new process conditions, improvement of dashboards, complex rebuilding of solution framework.
- IT infrastrucure maintenance and development for data science projects.
- Support of users in questions of data mining software.
Technology Stack: Python (Pandas, Numpy, Scipy, Scikit-learn, XGBoost, Lightgbm, Matplotlib, Seaborn, Plotly, Jupyter), Proficy CSense, Proficy Historian, Advanced Excel (VBA scripts, Pivot, Analysis ToolPak), R (Regression, Visualization).
Field Engineer Intern, Schlumberger, 2015 - 2015
Working with reservoir monitoring and control equipment: downhole pressure and temperature gauges, fiber optics equipment. Participation in workshop and lab activities.