This document introduces basic data analytics tools. It discusses the data analytics pipeline of collecting, refining, storing, analyzing, and presenting data. It describes tools for each step including Requests and BeautifulSoup for data acquisition, Pandas and SQLAlchemy for data processing and storage, R and RStudio for data analysis, and Plotly and Matplotlib for data visualization. Apache Superset is highlighted as a tool for data visualization and exploration. Challenges of data analytics like data quality, privacy, and scaling are also outlined.
Related topics: