Tag: pandas

iPython Notebooks

/ /

From GitHub: “This repo contains various iPython notebooks I’ve created to experiment with libraries and work through exercises, and explore subjects that I find interesting.” The notebooks include: Popular Python data science libraries NumPy SciPy Matplotlib Pandas Statsmodels Scikit-learn Seaborn NetworkX PyMC NLTK DEAP Genism Machine Learning Exercises Tensorflow Deep Learning Exercises Spark Big Data Labs Miscellaneous

Python for Data Science and Machine Learning Bootcamp


From Udemy: “Learn how to use NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn, Machine Learning, Tensorflow, and more! What Will I Learn? Use Python for Data Science and Machine Learning Use Spark for Big Data Analysis Implement Machine Learning Algorithms Learn to use NumPy for Numerical Data Learn to use Pandas for Data Analysis Learn to use Matplotlib for Python Plotting Learn to use Seaborn for statistical plots Use Plotly for interactive … Continue Reading

Pandas Cheat Sheet – Python for Data Science


“It’s common when first learning pandas to have trouble remembering all the functions and methods that you need, and while we at Dataquest advocate getting used to consulting the pandas documentation, sometimes it’s nice to have a handy reference, so we’ve put together this cheat sheet to help you out!” This cheat sheet covers the following topics: Key and Imports Importing Data Exporting Data Create Test Objects Veiwing/Inspecting Data Selection … Continue Reading

Comprehensive learning path – Data Science in Python


From Analytics Vidhya: “So, you want to become a data scientist or may be you are already one and want to expand your tool repository. You have landed at the right place. The aim of this page is to provide a comprehensive learning path to people new to python for data analysis. This path provides a comprehensive overview of steps you need to learn to use Python for data analysis. … Continue Reading

Python Data Science Handbook: Essential Tools for Working with Data


From Amazon: “For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools. Working scientists and data crunchers familiar with reading and writing Python code will find this … Continue Reading

DataCamp – Merging DataFrames with pandas


From DataCamp: “As a Data Scientist, you’ll often find that the data you need is not in a single file. It may be spread across a number of text files, spreadsheets, or databases. You want to be able to import the data of interest as a collection of DataFrames and figure out how to combine them to answer your central questions. This course is all about the act of combining, or … Continue Reading

DataCamp – Manipulating DataFrames with pandas


From DataCamp: “In this course, you’ll learn how to leverage pandas’ extremely powerful data manipulation engine to get the most out of your data. It is important to be able to extract, filter, and transform data from DataFrames in order to drill into the data that really matters. The pandas library has many techniques that make this process efficient and intuitive. You will learn how to tidy, rearrange, and restructure your … Continue Reading

DataCamp – pandas Foundations


From DataCamp: “Many real-world data sets include strings, integers, time-stamps and unstructured data. How do you store data like this so that you can manipulate it and easily retrieve important information? The answer is in a pandas DataFrame! In this course, you’ll learn how to use the industry-standard pandas library to import, build, and manipulate DataFrames. With pandas, you’ll always be able to convert your data into a form that permits … Continue Reading

Data Analysis with Pandas and Python


From Udemy: “The most comprehensive Pandas course available on Udemy! An excellent choice for both beginners and experts looking to refresh their technical knowledge on one of the most popular Python libraries in the world! Data Analysis with Pandas and Python offers 19+ hours of in-depth tutorials on one of the most powerful data analysis toolkits available today. Lessons include: installing sorting filtering grouping aggregating de-duplicating pivoting munging deleting merging text cleaning visualizing If you’ve spent time in … Continue Reading

DataCamp – Intermediate Python for Data Science


From DataCamp: “The intermediate python course is crucial to your data science curriculum. Learn to visualize real data with matplotlib’s functions and get to know new data structures such as the dictionary and the Pandas DataFrame. After covering key concepts such as boolean logic, control flow and loops in Python, you’re ready to blend together everything you’ve learned to solve a case study using hacker statistics.” This course consists of 5 … Continue Reading