The objective of this training course is to increase scientists’ expertise on scientific data analysis at scale applied to all HiDALGO domains, using high-performance data analytics tools available from the open source market. The training covers from simple analytics tasks to workflows and applications (e.g., Python-based) and provides best practices and guidelines on dealing with massive scientific datasets on HPC architectures. The training foresees hands-on exercises carried out through Jupyter Notebooks.