This document discusses infrastructure and tools for data scientists. It covers databases, notebooks/IDEs like Jupyter Notebook and Jupyter Lab. Popular software includes R, Python, and deep learning tools like TensorFlow. Programming languages used include R, Python, Java and Scala. Visualization tools mentioned are Tableau, Plotly and Matplotlib. Computing resources discussed are NVIDIA GPUs, Intel CPUs, and cloud services from Amazon, Azure and Google. Docker and HPC clusters are covered. The presenter's company, FlyElephant, provides a platform for data science projects across computing resources and clouds.