Extrapolative Machine Learning for Accurate Efficiency Prediction in Non-Fullerene Ternary Organic Solar Cells: Leveraging Computable Molecular Descriptors in High-Throughput Virtual Screening

Jian Ming Liao, Hui Hsu Gavin Tsai

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Adding a third component to binary organic solar cells (OSCs) enhances ternary OSCs, boosting power conversion efficiency (PCE). However, developing and optimizing appropriate donors, acceptors, and ternary materials remains a complex and demanding task. This study presents four machine-learning (ML) predictive models using XGBoost and ANN approaches, utilizing both experimental and DFT-derived HOMO and LUMO levels for efficient high-throughput virtual screening (HTVS) of top candidates based on PCE. Two distinct latent databases were employed for HTVS: one consisting of 429 413 uniquely recombined ternary OSC systems from experimentally available data, and another comprising ≈2.3 million unique donor molecules from the Harvard Clean Energy Project database (CEPDB). The four ML models demonstrated notable predictive accuracy for PCE on a test dataset containing molecules closely aligned with the training set (interpolation). However, the XGBoost model showed constrained extrapolative ability for molecules significantly divergent from those in the training dataset. In contrast, the ANN models displayed a robust extrapolative capacity in HTVS, successfully predicting new potential ternary OSC systems and leading donors with PCE values exceeding 20%. Our ML models use HOMO and LUMO inputs for donors, acceptors, and ternaries, facilitating efficient optimization via rapid HTVS of high-performance ternary materials.

Original languageEnglish
Article number2400287
JournalSolar RRL
Volume8
Issue number13
DOIs
StatePublished - Jul 2024

Keywords

  • clean energy
  • computable molecular descriptor
  • high throughput virtual screening
  • predictive machine learning model
  • ternary organic solar cells

Fingerprint

Dive into the research topics of 'Extrapolative Machine Learning for Accurate Efficiency Prediction in Non-Fullerene Ternary Organic Solar Cells: Leveraging Computable Molecular Descriptors in High-Throughput Virtual Screening'. Together they form a unique fingerprint.

Cite this