当前位置:网站首页>pycaret源码分析:下载数据集\Lib\site-packages\pycaret\datasets.py
pycaret源码分析:下载数据集\Lib\site-packages\pycaret\datasets.py
2022-08-01 00:00:00 【人工智能曾小健】

def get_data(
dataset="index",
save_copy=False,
profile=False,
verbose=True,
address="https://raw.githubusercontent.com/pycaret/pycaret/master/datasets/",
):
"""
This function loads sample datasets from git repository. List of available
datasets can be checked using ``get_data('index')``.
Example
-------
>>> from pycaret.datasets import get_data
>>> all_datasets = get_data('index')
>>> juice = get_data('juice')
dataset: str, default = 'index'
Index value of dataset.
save_copy: bool, default = False
When set to true, it saves a copy in current working directory.
profile: bool, default = False
When set to true, an interactive EDA report is displayed.
verbose: bool, default = True
When set to False, head of data is not displayed.
address: string, default = "https://raw.githubusercontent.com/pycaret/pycaret/master/datasets/"
Download url of dataset. For people have difficulty linking to github, they can change
the default address to their own (e.g. "https://gitee.com/IncubatorShokuhou/pycaret/raw/master/datasets/")
Returns:
pandas.DataFrame
Warnings
--------
- Use of ``get_data`` requires internet connection.
"""
import pandas as pd
import os.path
from IPython.display import display, HTML, clear_output, update_display
extension = ".csv"
filename = str(dataset) + extension
complete_address = address + filename
if os.path.isfile(filename):
data = pd.read_csv(filename)
else:
data = pd.read_csv(complete_address)
# create a copy for pandas profiler
data_for_profiling = data.copy()
if save_copy:
save_name = filename
data.to_csv(save_name, index=False)
if dataset == "index":
display(data)
else:
if profile:
import pandas_profiling
pf = pandas_profiling.ProfileReport(data_for_profiling)
display(pf)
else:
if verbose:
display(data.head())
return data
边栏推荐
猜你喜欢

类和对象:中

UOS统信系统 - WindTerm使用

SVN server construction + SVN client + TeamCity integrated environment construction + VS2019 development
Mysql environment installation under Linux (centos)

【Acwing】第62场周赛 题解

浏览器下载快捷方式到桌面(PWA)

类和对象:上

Kyoto University:Masaki Waga | 黑箱环境中强化学习的动态屏蔽

基于单片机GSM的防火防盗系统的设计

C# Rectangle basic usage and picture cutting
随机推荐
thymeleaf iterates the map collection
新产品如何进行网络推广?
lua入门案例实战123DIY
Program processes and threads (concurrency and parallelism of threads) and basic creation and use of threads
Thinking and Implementation of Object Cache Service
The difference between /usr/local/bin and /usr/bin
MySQL数据库‘反斜杠\’ ,‘单引号‘’,‘双引号“’,‘null’无法存储
(26) About menu of the top menu of Blender source code analysis
How to import a Golang external package and use it?
内核对设备树的处理
数据分析(一)——matplotlib
10大主流3D建模技术
C# Rectangle basic usage and picture cutting
@JsonFormat(pattern="yyyy-MM-dd") time difference problem
SQL注入 Less38(堆叠注入)
日常--Kali开启SSH(详细教程)
cobaltstrike
SQL注入 Less54(限制次数的SQL注入+union注入)
力扣2326、197
开源好用的 流程图绘制工具 drawio