Pandas: Data structure and data analysis tool for Python
Hadoop: Distributed processing of large scale data
Competitions related to machine learning problems and more at Kaggle.
Learning data science: Cognitive class by IBM.