Tutorial Example

A Beginner Guide to Python Pandas Read CSV – Python Pandas Tutorial

Python pandas is a good library to read csv file. We will introduce how to read csv data in this tutorial for python beginners. You can learn how to do by following our tutorial.

Install pandas

You can use pip to install pandas library first.

pip install pandas

Analysis csv file

You should notice the header and separation character of a csv file. Here is an example.

Read data from a csv file using python pandas

Create a csv file and write some data. You can use code below to read csv file using pandas.

import pandas as pd

file = r'data/601988.csv'

csv = pd.read_csv(file, sep=',', encoding='gbk')
print(csv)

Run this code, you will find csv is a DataFrame object. All data is contained in csv.

<class 'pandas.core.frame.DataFrame'>

If you have encountered UnicodeDecodeError, you can read this tutorial to fix.

Fix Python Pandas Read CSV File: UnicodeDecodeError

Get the data size

How much data is containd in this csv file? To get this size, you can use code below.

size = csv.shape
print(size)

The size is: (197, 15), which means this csv file contains 197 rows * 15 columns data.

Get the header name of a csv file

The header name of a csv file is important, which can help us to read data by column.

columns = csv.columns.values
print(columns)

The header name of this csv file is:

['日期' '股票代码' '名称' '收盘价' '最高价' '最低价' '开盘价' '前收盘' '涨跌额' '涨跌幅' '换手率' '成交量'
 '成交金额' '总市值' '流通市值']

Read data by column

We can read csv data by column name, here is an example.

print(csv['收盘价'].values)

Read data by row

We also can read csv data by row number. Here is an example.

print(csv.iloc[0].values)

csv.iloc[0] means we will read the first row data. The result is:

['2020-03-23' "'601988" '中国银行' 3.44 3.48 3.43 3.48 3.5 -0.06 -1.7143 0.039
 82294152 283847805.0 1012694001870.0 725033371070.0]

If you plan to read second or third row data, you can do like this:

print(csv.iloc[1].values)
print(csv.iloc[2].values)