An approach that identifies, and predicts compatibility between, chemical and enzyme-sequence spaces can streamline and reduce risk in the discovery of enzymes that can catalyse a desired reaction. The strategy uses high-throughput experiments to generate data about enzyme-mediated reactions, and there is a tool that can predict compatible substrate–enzyme pairs.