Seite 1 von 1

Pandas Profiling funktioniert nicht

Verfasst: Sonntag 1. Oktober 2023, 11:54
von Je1ns
Hi zusammen,
ich habe jetzt schon vieles verschiedenes ausprobiert. Aber ich bekomme pandas profiling einfach nicht unter jupyter notebook zum Laufen:(... Ich nutze jupyter notebook über anaconda. Und habe bsp. diesen Code probiert:
pip install "pydantic==1.*"

import numpy as np
!pip install pandas_profiling
import pandas as pd
from pandas_profiling import ProfileReport

Falls jemand hier helfen kann wäre das super, ich komme leider nicht voran:( Habe schon viel im Forum geschaut und andere Codes ausprobiert, bisher ohne Erfolg

Ganz lieben Dank, Jens :geek: :D

Der Error-Output:


Requirement already satisfied: pandas_profiling in c:\users\je1ns\anaconda3\lib\site-packages (3.2.0)
Requirement already satisfied: joblib~=1.1.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.1.1)
Requirement already satisfied: scipy>=1.4.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10.1)
Requirement already satisfied: pandas!=1.0.0,!=1.0.1,!=1.0.2,!=1.1.0,>=0.25.3 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.5.3)
Requirement already satisfied: matplotlib>=3.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (3.7.1)
Requirement already satisfied: pydantic>=1.8.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10.13)
Requirement already satisfied: PyYAML>=5.0.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (6.0)
Requirement already satisfied: jinja2>=2.11.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (3.1.2)
Requirement already satisfied: markupsafe~=2.1.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (2.1.1)
Requirement already satisfied: visions[type_image_path]==0.7.4 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.7.4)
Requirement already satisfied: numpy>=1.16.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.24.3)
Requirement already satisfied: htmlmin>=0.1.12 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.1.12)
Requirement already satisfied: missingno>=0.4.2 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.5.2)
Requirement already satisfied: phik>=0.11.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.12.3)
Requirement already satisfied: tangled-up-in-unicode==0.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.2.0)
Requirement already satisfied: requests>=2.24.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (2.31.0)
Requirement already satisfied: tqdm>=4.48.2 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (4.65.0)
Requirement already satisfied: seaborn>=0.10.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (0.12.2)
Requirement already satisfied: multimethod>=1.4 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas_profiling) (1.10)
Requirement already satisfied: attrs>=19.3.0 in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (22.1.0)
Requirement already satisfied: networkx>=2.4 in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (3.1)
Requirement already satisfied: imagehash in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (4.3.1)
Requirement already satisfied: Pillow in c:\users\je1ns\anaconda3\lib\site-packages (from visions[type_image_path]==0.7.4->pandas_profiling) (9.4.0)
Requirement already satisfied: contourpy>=1.0.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (1.0.5)
Requirement already satisfied: cycler>=0.10 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (0.11.0)
Requirement already satisfied: fonttools>=4.22.0 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (4.25.0)
Requirement already satisfied: kiwisolver>=1.0.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (1.4.4)
Requirement already satisfied: packaging>=20.0 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (23.0)
Requirement already satisfied: pyparsing>=2.3.1 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (3.0.9)
Requirement already satisfied: python-dateutil>=2.7 in c:\users\je1ns\anaconda3\lib\site-packages (from matplotlib>=3.2.0->pandas_profiling) (2.8.2)
Requirement already satisfied: pytz>=2020.1 in c:\users\je1ns\anaconda3\lib\site-packages (from pandas!=1.0.0,!=1.0.1,!=1.0.2,!=1.1.0,>=0.25.3->pandas_profiling) (2022.7)
Requirement already satisfied: typing-extensions>=4.2.0 in c:\users\je1ns\anaconda3\lib\site-packages (from pydantic>=1.8.1->pandas_profiling) (4.7.1)
Requirement already satisfied: charset-normalizer<4,>=2 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (2.0.4)
Requirement already satisfied: idna<4,>=2.5 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (3.4)
Requirement already satisfied: urllib3<3,>=1.21.1 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (1.26.16)
Requirement already satisfied: certifi>=2017.4.17 in c:\users\je1ns\anaconda3\lib\site-packages (from requests>=2.24.0->pandas_profiling) (2023.7.22)
Requirement already satisfied: colorama in c:\users\je1ns\anaconda3\lib\site-packages (from tqdm>=4.48.2->pandas_profiling) (0.4.6)
Requirement already satisfied: six>=1.5 in c:\users\je1ns\anaconda3\lib\site-packages (from python-dateutil>=2.7->matplotlib>=3.2.0->pandas_profiling) (1.16.0)
Requirement already satisfied: PyWavelets in c:\users\je1ns\anaconda3\lib\site-packages (from imagehash->visions[type_image_path]==0.7.4->pandas_profiling) (1.4.1)
---------------------------------------------------------------------------
ImportError Traceback (most recent call last)
Cell In[6], line 4
2 get_ipython().system('pip install pandas_profiling')
3 import pandas as pd
----> 4 from pandas_profiling import ProfileReport

File ~\anaconda3\Lib\site-packages\pandas_profiling\__init__.py:6
1 """Main module of pandas-profiling.
2
3 .. include:: ../../README.md
4 """
----> 6 from pandas_profiling.controller import pandas_decorator
7 from pandas_profiling.profile_report import ProfileReport
8 from pandas_profiling.version import __version__

File ~\anaconda3\Lib\site-packages\pandas_profiling\controller\pandas_decorator.py:4
1 """This file add the decorator on the DataFrame object."""
2 from pandas import DataFrame
----> 4 from pandas_profiling.profile_report import ProfileReport
7 def profile_report(df: DataFrame, **kwargs) -> ProfileReport:
8 """Profile a DataFrame.
9
10 Args:
(...)
15 A ProfileReport of the DataFrame.
16 """

File ~\anaconda3\Lib\site-packages\pandas_profiling\profile_report.py:15
13 from pandas_profiling.config import Config, Settings
14 from pandas_profiling.expectations_report import ExpectationsReport
---> 15 from pandas_profiling.model.alerts import AlertType
16 from pandas_profiling.model.describe import describe as describe_df
17 from pandas_profiling.model.sample import Sample

File ~\anaconda3\Lib\site-packages\pandas_profiling\model\alerts.py:10
7 import pandas as pd
9 from pandas_profiling.config import Settings
---> 10 from pandas_profiling.model.correlations import perform_check_correlation
13 @unique
14 class AlertType(Enum):
15 """Alert types"""

File ~\anaconda3\Lib\site-packages\pandas_profiling\model\correlations.py:8
6 import pandas as pd
7 from multimethod import multimethod
----> 8 from pandas.core.base import DataError
10 from pandas_profiling.config import Settings
13 class Correlation:

ImportError: cannot import name 'DataError' from 'pandas.core.base' (C:\Users\...)

Re: Pandas Profiling funktioniert nicht

Verfasst: Sonntag 1. Oktober 2023, 12:30
von Sirius3
pandas_profiling wird nicht mehr gewartet: https://pypi.org/project/pandas-profiling/
Statt dessen soll man laut Anleitung ydata_profiling benutzen.

Re: Pandas Profiling funktioniert nicht

Verfasst: Montag 2. Oktober 2023, 05:52
von Je1ns
Sirius3 hat geschrieben: Sonntag 1. Oktober 2023, 12:30 pandas_profiling wird nicht mehr gewartet: https://pypi.org/project/pandas-profiling/
Statt dessen soll man laut Anleitung ydata_profiling benutzen.
Hi, danke für den Hinweis interessant... ich habe probiert ydata zu installieren- leider auch ohne Erfolg. Ich habe folgenden Code genutzt:
!jupyter nbextension enable --py widgetsnbextension

import numpy as np
import pandas as pd

pip install ydata-profiling

from ydata_profiling import ProfileReport
profile = ProfileReport(df)

Fehlermeldung:

TypeCheckError Traceback (most recent call last)
Cell In[12], line 1
----> 1 profile = ProfileReport(df)

File ~\anaconda3\Lib\site-packages\ydata_profiling\profile_report.py:59, in ProfileReport.__init__(self, df, minimal, tsmode, sortby, sensitive, explorative, dark_mode, orange_mode, sample, config_file, lazy, typeset, summarizer, config, type_schema, **kwargs)
56 _json = None
57 config: Settings
---> 59 def __init__(
60 self,
61 df: Optional[Union[pd.DataFrame, sDataFrame]] = None,
62 minimal: bool = False,
63 tsmode: bool = False,
64 sortby: Optional[str] = None,
65 sensitive: bool = False,
66 explorative: bool = False,
67 dark_mode: bool = False,
68 orange_mode: bool = False,
69 sample: Optional[dict] = None,
70 config_file: Union[Path, str] = None,
71 lazy: bool = True,
72 typeset: Optional[VisionsTypeset] = None,
73 summarizer: Optional[BaseSummarizer] = None,
74 config: Optional[Settings] = None,
75 type_schema: Optional[dict] = None,
76 **kwargs,
77 ):
78 """Generate a ProfileReport based on a pandas or spark.sql DataFrame
79
80 Config processing order (in case of duplicate entries, entries later in the order are retained):
(...)
98 **kwargs: other arguments, for valid arguments, check the default configuration file.
99 """
100 self.__validate_inputs(df, minimal, tsmode, config_file, lazy)

File ~\anaconda3\Lib\site-packages\typeguard\_functions.py:138, in check_argument_types(func_name, arguments, memo)
135 raise exc
137 try:
--> 138 check_type_internal(value, annotation, memo)
139 except TypeCheckError as exc:
140 qualname = qualified_name(value, add_class_prefix=True)

File ~\anaconda3\Lib\site-packages\typeguard\_checkers.py:759, in check_type_internal(value, annotation, memo)
757 checker = lookup_func(origin_type, args, extras)
758 if checker:
--> 759 checker(value, origin_type, args, memo)
760 return
762 if isclass(origin_type):

File ~\anaconda3\Lib\site-packages\typeguard\_checkers.py:408, in check_union(value, origin_type, args, memo)
403 errors[get_type_name(type_)] = exc
405 formatted_errors = indent(
406 "\n".join(f"{key}: {error}" for key, error in errors.items()), " "
407 )
--> 408 raise TypeCheckError(f"did not match any element in the union:\n{formatted_errors}")

TypeCheckError: argument "config_file" (None) did not match any element in the union:
pathlib.Path: is not an instance of pathlib.Path
str: is not an instance of str

Re: Pandas Profiling funktioniert nicht

Verfasst: Montag 2. Oktober 2023, 08:01
von __blackjack__
Das ist ziemlich eindeutig ein Fehler von ydata_profiling. Solltest Du dort melden, falls es nicht schon jemand getan hat.

Edit: Wurde schon gemeldet und zumindest im Repository korrigiert: https://github.com/ydataai/ydata-profiling/issues/1439