Pre-training Molecular Graph Representation with 3D Geometry

Pre-training Molecular Graph Representation with 3D Geometry
-- Rethinking Self-Supervised Learning on Structured Data
ICLR 2022

¹Mila
²Université de Montréal
³University of Cambridge
⁴MPI for Intelligent Systems, Tübingen

⁵National Research Council Canada
⁶HEC Montréal
⁷CIFAR AI Chair

Abstract

Molecular graph representation learning is a fundamental problem in modern drug and material discovery. Molecular graphs are typically modeled by their 2D topological structures, but it has been recently discovered that 3D geometric information plays a more vital role in predicting molecular functionalities. However, the lack of 3D information in real-world scenarios has significantly impeded the learning of geometric graph representation. To cope with this challenge, we propose the Graph Multi-View Pre-training (GraphMVP) framework where self-supervised learning (SSL) is performed by leveraging the correspondence and consistency between 2D topological structures and 3D geometric views. GraphMVP effectively learns a 2D molecular graph encoder that is enhanced by richer and more discriminative 3D geometry. We further provide theoretical insights to justify the effectiveness of GraphMVP. Finally, comprehensive experiments show that GraphMVP can consistently outperform existing graph SSL methods.

Method: GraphMVP

We start by aiming at maximizing the lower bound for MI: $$I(X;Y) \ge \mathcal{L}_{\text{MI}} = \frac{1}{2} \mathbb{E}_{p(x,y)} \big[ \log p(y|x) + \log p(x|y) \big].$$

This is essentially two conditional log-likelihood terms. Then we formulate each conditional term with:

In Sec 3.2, we describe how to framing it as an energy-based model (EBM).
- We propose EBM-NCE, a contrastive SSL objective with noise contrastive estimation (NCE).
- Thus, we successfully connect EBM and SSL, especially with the latest contrastive learning method.
- This opens a big track, where other EBM methods (contrastive divergence, score matching) can also be applied. We leave this for future exploration.
In Sec 3.3, we describe a variational lower bound.
- The reconstruction is hard for structured data, like molecular graph. Thus, we propose a variational representation reconstruction (VRR), a generative SSL objective.
- VRR provides another perspective of explaining the intuition behind non-contrastive SSL method (BYOL, SimSiam).

Citation

@inproceedings{liu2022pretraining,
    title={Pre-training Molecular Graph Representation with 3D Geometry},
    author={Shengchao Liu and Hanchen Wang and Weiyang Liu and Joan Lasenby and Hongyu Guo and Jian Tang},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=xQUe1pOKPam}
}