Thanks to the recent progress in numerical methods and computer technology, the application fields of artificial intelligence (AI) and machine learning methods (ML) are growing at a very fast pace. The field of geochemistry for nuclear waste management has recently started using ML for the acceleration of numerical simulations of reactive transport processes, for the improvement of multiscale and multiphysics couplings efficiency, and for uncertainty quantification and sensitivity analysis. Several case studies indicate that the use of ML based approaches brings an overall acceleration of geochemical and reactive transport simulations between one and four orders of magnitude. This paper presents a benchmarking exercise that aims at providing a set of reference data and models for developing and applying ML techniques for geochemical and reactive transport simulations. Several well-known geochemical speciation codes are used to generate systematically a consistent set of high-quality chemical equilibrium data, to be used as input for the training of several ML methods. Two benchmarks are formulated, each with multiple levels of gradually increasing degree of complexity. The first benchmark focuses on cement chemistry, while the second one considers uranium sorption on a clay mineral. The performance of different ML techniques is then evaluated in terms of their numerical efficiency and accuracy. A speedup of several orders of magnitude is observed. The benefits and the limitations of different ML based techniques are then analysed and highlighted.