Big biomedical data is often not large enough for advanced machine learning techniques. Platforms for sharing data are problematic as each operates independently with different data formats and tools. Computation power alone cannot solve these issues. The document discusses solutions like allowing larger datasets to be assembled using standards, developing interoperable "meta-platforms", and choosing machine learning approaches wisely based on the problem and dataset. It provides examples of projects that have successfully combined datasets and used techniques like Spark clustering to better leverage available data.