Skip to content

Data-Driven Approach to Predict Polymer Glass Transition Temperature: Interpretability and Extrapolation

License

Notifications You must be signed in to change notification settings

Tamina2022/Tamina-ML-Tg-project

Repository files navigation

Tamina-ML-Tg-project

This repository contains code for predicting the glass transition temperature (Tg) of polymers using machine learning techniques. The model utilizes Extended Connectivity Fingerprints (ECFP) for feature representation and implements various regression algorithms, including LightGBM, XGBoost, and Random Forest.

We utilized the dataset from https://doi.org/10.1038/s41524-023-01088-3, but instead of the OPSIN data used in the reference, we generated canonical SMILES and ECFP4 fingerprints for the polymers to enhance our analysis. Features Data Preparation: Converts SMILES representations of polymers to ECFP4. Model Training: Implements multiple regression models for Tg prediction. Evaluation Metrics: Evaluates model performance using RMSE and R². Requirements Python 3.x Pandas NumPy Scikit-learn LightGBM XGBoost CatBoost RDKit Usage Clone this repository. Install the required packages: bash Copy code pip install -r requirements.txt Run the main script to train the models and make predictions.

About

Data-Driven Approach to Predict Polymer Glass Transition Temperature: Interpretability and Extrapolation

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published