【学习笔记】Datawhale零基础入门NLP赛事--天池新闻文本分类--Day2数据理解与数据分析

今日目标

  1. 学习使用Pandas读取赛题数据
  2. 分析赛题数据的分布规律

了解Pandas

Intro to Pandas

Basic Concepts

pandas 是面向列的数据分析应用程序接口

Primary Data structure: dataframe & series

  1. series: a single column
  2. dataframe: contains one or more series
  3. load an entire file into a Dataframe:
    1. DataFrame.describe: show interesting statistics about a DataFrame
    2. Data.head: displays the first few records of a dataFrame
    3. DataFrame.hist: quickly study the distribution of values in a column
      --------------------------------------------打卡分割线----------------------------------------------------------

你可能感兴趣的:(【学习笔记】Datawhale零基础入门NLP赛事--天池新闻文本分类--Day2数据理解与数据分析)