Machine learning-driven data integration for drug discovery