Pandas Profiling funktioniert nicht

Wenn du dir nicht sicher bist, in welchem der anderen Foren du die Frage stellen sollst, dann bist du hier im Forum für allgemeine Fragen sicher richtig.
Antworten
Je1ns
User
Beiträge: 5
Registriert: Montag 21. August 2023, 19:21

Hi zusammen,
ich habe jetzt schon vieles verschiedenes ausprobiert. Aber ich bekomme pandas profiling einfach nicht unter jupyter notebook zum Laufen:(... Ich nutze jupyter notebook über anaconda. Und habe bsp. diesen Code probiert:
pip install "pydantic==1.*"

import numpy as np
!pip install pandas_profiling
import pandas as pd
from pandas_profiling import ProfileReport

Falls jemand hier helfen kann wäre das super, ich komme leider nicht voran:( Habe schon viel im Forum geschaut und andere Codes ausprobiert, bisher ohne Erfolg

Ganz lieben Dank, Jens :geek: :D

Der Error-Output:


Requirement already satisfied: pandas_profiling in c:\users\je1ns\anaconda3\lib\site-packages (3.2.0)
Requirement already satisfied: joblib~=1.1.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.1.1)
Requirement already satisfied: scipy>=1.4.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10.1)
Requirement already satisfied: pandas!=1.0.0,!=1.0.1,!=1.0.2,!=1.1.0,>=0.25.3 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.5.3)
Requirement already satisfied: matplotlib>=3.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (3.7.1)
Requirement already satisfied: pydantic>=1.8.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10.13)
Requirement already satisfied: PyYAML>=5.0.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (6.0)
Requirement already satisfied: jinja2>=2.11.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (3.1.2)
Requirement already satisfied: markupsafe~=2.1.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (2.1.1)
Requirement already satisfied: visions[type_image_path]==0.7.4 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.7.4)
Requirement already satisfied: numpy>=1.16.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.24.3)
Requirement already satisfied: htmlmin>=0.1.12 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.1.12)
Requirement already satisfied: missingno>=0.4.2 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.5.2)
Requirement already satisfied: phik>=0.11.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.12.3)
Requirement already satisfied: tangled-up-in-unicode==0.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.2.0)
Requirement already satisfied: requests>=2.24.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (2.31.0)
Requirement already satisfied: tqdm>=4.48.2 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (4.65.0)
Requirement already satisfied: seaborn>=0.10.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.12.2)
Requirement already satisfied: multimethod>=1.4 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10)
Requirement already satisfied: attrs>=19.3.0 in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (22.1.0)
Requirement already satisfied: networkx>=2.4 in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (3.1)
Requirement already satisfied: imagehash in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (4.3.1)
Requirement already satisfied: Pillow in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (9.4.0)
Requirement already satisfied: contourpy>=1.0.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (1.0.5)
Requirement already satisfied: cycler>=0.10 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (0.11.0)
Requirement already satisfied: fonttools>=4.22.0 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (4.25.0)
Requirement already satisfied: kiwisolver>=1.0.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (1.4.4)
Requirement already satisfied: packaging>=20.0 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (23.0)
Requirement already satisfied: pyparsing>=2.3.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (3.0.9)
Requirement already satisfied: python-dateutil>=2.7 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (2.8.2)
Requirement already satisfied: pytz>=2020.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas!=1.0.0,!=1.0.1,!=1.0.2,!=1.1.0,>=0.25.3->pandas_profiling) (2022.7)
Requirement already satisfied: typing-extensions>=4.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pydantic>=1.8.1->pandas_profiling) (4.7.1)
Requirement already satisfied: charset-normalizer<4,>=2 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (2.0.4)
Requirement already satisfied: idna<4,>=2.5 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (3.4)
Requirement already satisfied: urllib3<3,>=1.21.1 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (1.26.16)
Requirement already satisfied: certifi>=2017.4.17 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (2023.7.22)
Requirement already satisfied: colorama in c:\users\je1ns\anaconda3\lib\site-packages (from tqdm>=4.48.2->pandas_profiling) (0.4.6)
Requirement already satisfied: six>=1.5 in c:\users\je1ns\anaconda3\lib\site-packages (from python-dateutil>=2.7->matplotlib>=3.2.0->pandas_profiling) (1.16.0)
Requirement already satisfied: PyWavelets in c:\users\je1ns\anaconda3\lib\site-packages (from imagehash->visions[type_image_path]==0.7.4->pandas_profiling) (1.4.1)
---------------------------------------------------------------------------
ImportError Traceback (most recent call last)
Cell In[6], line 4
2 get_ipython().system('pip install pandas_profiling')
3 import pandas as pd
----> 4 from pandas_profiling import ProfileReport

File ~\anaconda3\Lib\site-packages\pandas_profiling\__init__.py:6
1 """Main module of pandas-profiling.
2
3 .. include:: ../../README.md
4 """
----> 6 from pandas_profiling.controller import pandas_decorator
7 from pandas_profiling.profile_report import ProfileReport
8 from pandas_profiling.version import __version__

File ~\anaconda3\Lib\site-packages\pandas_profiling\controller\pandas_decorator.py:4
1 """This file add the decorator on the DataFrame object."""
2 from pandas import DataFrame
----> 4 from pandas_profiling.profile_report import ProfileReport
7 def profile_report(df: DataFrame, **kwargs) -> ProfileReport:
8 """Profile a DataFrame.
9
10 Args:
(...)
15 A ProfileReport of the DataFrame.
16 """

File ~\anaconda3\Lib\site-packages\pandas_profiling\profile_report.py:15
13 from pandas_profiling.config import Config, Settings
14 from pandas_profiling.expectations_report import ExpectationsReport
---> 15 from pandas_profiling.model.alerts import AlertType
16 from pandas_profiling.model.describe import describe as describe_df
17 from pandas_profiling.model.sample import Sample

File ~\anaconda3\Lib\site-packages\pandas_profiling\model\alerts.py:10
7 import pandas as pd
9 from pandas_profiling.config import Settings
---> 10 from pandas_profiling.model.correlations import perform_check_correlation
13 @unique
14 class AlertType(Enum):
15 """Alert types"""

File ~\anaconda3\Lib\site-packages\pandas_profiling\model\correlations.py:8
6 import pandas as pd
7 from multimethod import multimethod
----> 8 from pandas.core.base import DataError
10 from pandas_profiling.config import Settings
13 class Correlation:

ImportError: cannot import name 'DataError' from 'pandas.core.base' (C:\Users\...)
Sirius3
User
Beiträge: 18226
Registriert: Sonntag 21. Oktober 2012, 17:20

pandas_profiling wird nicht mehr gewartet: https://pypi.org/project/pandas-profiling/
Statt dessen soll man laut Anleitung ydata_profiling benutzen.
Je1ns
User
Beiträge: 5
Registriert: Montag 21. August 2023, 19:21

Sirius3 hat geschrieben: Sonntag 1. Oktober 2023, 12:30 pandas_profiling wird nicht mehr gewartet: https://pypi.org/project/pandas-profiling/
Statt dessen soll man laut Anleitung ydata_profiling benutzen.
Hi, danke für den Hinweis interessant... ich habe probiert ydata zu installieren- leider auch ohne Erfolg. Ich habe folgenden Code genutzt:
!jupyter nbextension enable --py widgetsnbextension

import numpy as np
import pandas as pd

pip install ydata-profiling

from ydata_profiling import ProfileReport
profile = ProfileReport(df)

Fehlermeldung:

TypeCheckError Traceback (most recent call last)
Cell In[12], line 1
----> 1 profile = ProfileReport(df)

File ~\anaconda3\Lib\site-packages\ydata_profiling\profile_report.py:59, in ProfileReport.__init__(self, df, minimal, tsmode, sortby, sensitive, explorative, dark_mode, orange_mode, sample, config_file, lazy, typeset, summarizer, config, type_schema, **kwargs)
56 _json = None
57 config: Settings
---> 59 def __init__(
60 self,
61 df: Optional[Union[pd.DataFrame, sDataFrame]] = None,
62 minimal: bool = False,
63 tsmode: bool = False,
64 sortby: Optional[str] = None,
65 sensitive: bool = False,
66 explorative: bool = False,
67 dark_mode: bool = False,
68 orange_mode: bool = False,
69 sample: Optional[dict] = None,
70 config_file: Union[Path, str] = None,
71 lazy: bool = True,
72 typeset: Optional[VisionsTypeset] = None,
73 summarizer: Optional[BaseSummarizer] = None,
74 config: Optional[Settings] = None,
75 type_schema: Optional[dict] = None,
76 **kwargs,
77 ):
78 """Generate a ProfileReport based on a pandas or spark.sql DataFrame
79
80 Config processing order (in case of duplicate entries, entries later in the order are retained):
(...)
98 **kwargs: other arguments, for valid arguments, check the default configuration file.
99 """
100 self.__validate_inputs(df, minimal, tsmode, config_file, lazy)

File ~\anaconda3\Lib\site-packages\typeguard\_functions.py:138, in check_argument_types(func_name, arguments, memo)
135 raise exc
137 try:
--> 138 check_type_internal(value, annotation, memo)
139 except TypeCheckError as exc:
140 qualname = qualified_name(value, add_class_prefix=True)

File ~\anaconda3\Lib\site-packages\typeguard\_checkers.py:759, in check_type_internal(value, annotation, memo)
757 checker = lookup_func(origin_type, args, extras)
758 if checker:
--> 759 checker(value, origin_type, args, memo)
760 return
762 if isclass(origin_type):

File ~\anaconda3\Lib\site-packages\typeguard\_checkers.py:408, in check_union(value, origin_type, args, memo)
403 errors[get_type_name(type_)] = exc
405 formatted_errors = indent(
406 "\n".join(f"{key}: {error}" for key, error in errors.items()), " "
407 )
--> 408 raise TypeCheckError(f"did not match any element in the union:\n{formatted_errors}")

TypeCheckError: argument "config_file" (None) did not match any element in the union:
pathlib.Path: is not an instance of pathlib.Path
str: is not an instance of str
Benutzeravatar
__blackjack__
User
Beiträge: 13933
Registriert: Samstag 2. Juni 2018, 10:21
Wohnort: 127.0.0.1
Kontaktdaten:

Das ist ziemlich eindeutig ein Fehler von ydata_profiling. Solltest Du dort melden, falls es nicht schon jemand getan hat.

Edit: Wurde schon gemeldet und zumindest im Repository korrigiert: https://github.com/ydataai/ydata-profiling/issues/1439
“Java is a DSL to transform big Xml documents into long exception stack traces.”
— Scott Bellware
Antworten