In One Minute : Pandas

Pandas is a Python library for PANel DAta manipulation and analysis, example: multidimensional time series and cross-sectional data sets commonly found in statistics, experimental science results, econometrics, or finance.
Pandas is implemented primar…


This content originally appeared on DEV Community 👩‍💻👨‍💻 and was authored by Rakesh KR

Pandas is a Python library for PANel DAta manipulation and analysis, example: multidimensional time series and cross-sectional data sets commonly found in statistics, experimental science results, econometrics, or finance.
Pandas is implemented primarily using NumPy and Cython; it is intended to be able to integrate very easily with NumPy-based scientific libraries, such as statsmodels.

Pandas is one of the main data science libraries in Python.

Pandas allows importing data from various file formats such as comma-separated values, JSON, Parquet, SQL database tables or queries, and Microsoft Excel.
Pandas allows various data manipulation operations such as merging, reshaping, selecting, as well as data cleaning, and data wrangling features.

Main Features:

  • Data structures: for one- and two-dimensional labeled datasets (respectively Series and DataFrames). Some of their main features include:
  1. Automatically aligning data and interpolation
  2. Handling missing observations in calculations
  3. Convenient slicing and reshaping ("reindexing") functions
  4. Categorical data types
  5. Provide 'group by' aggregation or transformation functionality
  6. Tools for merging and joining together data sets
  7. Simple Matplotlib integration for plotting and graphing
  8. Multi-Indexing providing structure to indices that allow for representation of an arbitrary number of dimensions.
  • Date tools: objects for expressing date offsets or generating date ranges. Dates can be aligned to a specific time zone and converted or compared at will
  • Statistical models: convenient ordinary least squares and panel OLS implementations for in-sample or rolling time series and cross-sectional regressions. These will hopefully be the starting point for implementing models
  • Intelligent Cython offloading; complex computations are performed rapidly due to these optimizations.
  • Static and moving statistical tools: mean, standard deviation, correlation, and covariance
  • Rich User Documentation, using Sphinx

Resources and Tutorials:

Books:


This content originally appeared on DEV Community 👩‍💻👨‍💻 and was authored by Rakesh KR


Print Share Comment Cite Upload Translate Updates
APA

Rakesh KR | Sciencx (2022-11-05T10:30:00+00:00) In One Minute : Pandas. Retrieved from https://www.scien.cx/2022/11/05/in-one-minute-pandas/

MLA
" » In One Minute : Pandas." Rakesh KR | Sciencx - Saturday November 5, 2022, https://www.scien.cx/2022/11/05/in-one-minute-pandas/
HARVARD
Rakesh KR | Sciencx Saturday November 5, 2022 » In One Minute : Pandas., viewed ,<https://www.scien.cx/2022/11/05/in-one-minute-pandas/>
VANCOUVER
Rakesh KR | Sciencx - » In One Minute : Pandas. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2022/11/05/in-one-minute-pandas/
CHICAGO
" » In One Minute : Pandas." Rakesh KR | Sciencx - Accessed . https://www.scien.cx/2022/11/05/in-one-minute-pandas/
IEEE
" » In One Minute : Pandas." Rakesh KR | Sciencx [Online]. Available: https://www.scien.cx/2022/11/05/in-one-minute-pandas/. [Accessed: ]
rf:citation
» In One Minute : Pandas | Rakesh KR | Sciencx | https://www.scien.cx/2022/11/05/in-one-minute-pandas/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.