{"id":15754,"date":"2022-04-23T17:41:38","date_gmt":"2022-04-23T16:41:38","guid":{"rendered":"https:\/\/complex-systems-ai.com\/?page_id=15754"},"modified":"2022-04-23T18:32:41","modified_gmt":"2022-04-23T17:32:41","slug":"analyse-de-donnees-sous-sweetviz","status":"publish","type":"page","link":"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/","title":{"rendered":"Data analysis under Sweetviz"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-page\" data-elementor-id=\"15754\" class=\"elementor elementor-15754\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-94caa6e elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"94caa6e\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-6f3edb2\" data-id=\"6f3edb2\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-dec1435 elementor-align-justify elementor-widget elementor-widget-button\" data-id=\"dec1435\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm\" href=\"https:\/\/complex-systems-ai.com\/analyse-descriptive\/\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Analyse descriptive<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-cf1bf06\" data-id=\"cf1bf06\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-49b4cf4 elementor-align-justify elementor-widget elementor-widget-button\" data-id=\"49b4cf4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm\" href=\"https:\/\/complex-systems-ai.com\/\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Page d'accueil<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-f1fa96a\" data-id=\"f1fa96a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-8b58edb elementor-align-justify elementor-widget elementor-widget-button\" data-id=\"8b58edb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm\" href=\"https:\/\/en.wikipedia.org\/wiki\/Descriptive_statistics\" target=\"_blank\" rel=\"noopener\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Wiki<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-851c67c elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"851c67c\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e531231\" data-id=\"e531231\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-85ed25c elementor-widget elementor-widget-text-editor\" data-id=\"85ed25c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>L&rsquo;analyse exploratoire des donn\u00e9es (EDA) est une premi\u00e8re \u00e9tape essentielle dans la plupart des projets de science des donn\u00e9es et consiste souvent \u00e0 suivre les m\u00eames \u00e9tapes pour caract\u00e9riser un ensemble de donn\u00e9es (par exemple, trouver les types de donn\u00e9es, les informations manquantes, la distribution des valeurs, les corr\u00e9lations, etc.). L&rsquo;une des derni\u00e8res est une nouvelle biblioth\u00e8que Python open-source appel\u00e9e Sweetviz.<\/p><p><img decoding=\"async\" class=\"aligncenter wp-image-11096 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2020\/09\/cropped-Capture.png\" alt=\"Sweetviz\" width=\"97\" height=\"97\" title=\"\"><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-9dc5a32 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"9dc5a32\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-416a338\" data-id=\"416a338\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-9bc5e08 elementor-widget elementor-widget-heading\" data-id=\"9bc5e08\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Contenus<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Installation-et-lancement-de-Sweetviz\" >Installation et lancement de Sweetviz<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Resume-global\" >R\u00e9sum\u00e9 global<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Associations\" >Associations<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Selection-dune-variable\" >S\u00e9lection d'une variable<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Donnees-qualitatives-et-booleennes\" >Donn\u00e9es qualitatives et bool\u00e9ennes<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Donnees-quantitatives\" >Donn\u00e9es quantitatives<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Details-dune-variable-quantitative\" >D\u00e9tails d'une variable quantitative<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Donnees-textuelles\" >Donn\u00e9es textuelles<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Comparaison-de-sous-populations\" >Comparaison de sous-populations<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Exemple-sur-le-jeu-de-donnees-du-Titanic\" >Exemple sur le jeu de donn\u00e9es du Titanic<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/complex-systems-ai.com\/en\/descriptive-analysis\/data-analysis-under-sweetviz\/#Analyse-generale\" >Analyse g\u00e9n\u00e9rale<\/a><\/li><\/ul><\/nav><\/div>\n<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Installation-et-lancement-de-Sweetviz\"><\/span>Installation et lancement de Sweetviz<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-c89b1f6 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c89b1f6\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-416018d\" data-id=\"416018d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-4183c79 elementor-widget elementor-widget-text-editor\" data-id=\"4183c79\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Apr\u00e8s l&rsquo;installation de Sweetviz (en utilisant pip install sweetviz), chargez simplement les dataframes pandas comme vous le feriez normalement, puis appelez analyze(), compare() ou compare_intra().<\/p><pre class=\"kt ku kv kw gz wx bt wy\"><span id=\"8014\" class=\"gc wz wc jd ww b do xa xb l xc\" data-selectable-paragraph=\"\">import sweetviz<br \/>import pandas as pd<br \/>train = pd.read_csv(\"train.csv\")<br \/>test = pd.read_csv(\"test.csv\")<\/span><\/pre><p id=\"a70c\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\">Nous avons maintenant 2 dataframes (train et test), et nous aimerions analyser la valeur cible \u00ab\u00a0Survived\u00a0\u00bb. Je tiens \u00e0 souligner que dans ce cas, nous connaissons \u00e0 l&rsquo;avance le nom de la colonne cible, mais il est toujours facultatif de sp\u00e9cifier une colonne cible. Nous pouvons g\u00e9n\u00e9rer un rapport avec cette ligne de code\u00a0:<\/p><pre class=\"kt ku kv kw gz wx bt wy\"><span id=\"c004\" class=\"gc wz wc jd ww b do xa xb l xc\" data-selectable-paragraph=\"\">my_report = sweetviz.compare([train, \"Train\"], [test, \"Test\"], \"Survived\")<\/span><\/pre><p id=\"cc82\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\">L&rsquo;ex\u00e9cution de cette commande effectuera l&rsquo;analyse et cr\u00e9era l&rsquo;objet de rapport. Pour obtenir le r\u00e9sultat, utilisez simplement la commande\u00a0show_html()\u00a0:<\/p><pre class=\"kt ku kv kw gz wx bt wy\"><span id=\"8652\" class=\"gc wz wc jd ww b do xa xb l xc\" data-selectable-paragraph=\"\">my_report.show_html(\"Report.html\") # Not providing a filename will default to SWEETVIZ_REPORT.html<\/span><\/pre>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-8f55724 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"8f55724\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-12a9901\" data-id=\"12a9901\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6b2d47e elementor-widget elementor-widget-heading\" data-id=\"6b2d47e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Resume-global\"><\/span>R\u00e9sum\u00e9 global<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-8ab5853 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"8ab5853\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-1d7d29d\" data-id=\"1d7d29d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-1f405f1 elementor-widget elementor-widget-text-editor\" data-id=\"1f405f1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Le r\u00e9sum\u00e9 nous montre les caract\u00e9ristiques des deux dataframes c\u00f4te \u00e0 c\u00f4te. Nous pouvons imm\u00e9diatement identifier que l&rsquo;ensemble de test est \u00e0 peu pr\u00e8s la moiti\u00e9 de la taille de l&rsquo;ensemble d&rsquo;apprentissage, mais qu&rsquo;il contient les m\u00eames fonctionnalit\u00e9s. Cette l\u00e9gende en bas nous montre que l&rsquo;ensemble d&rsquo;apprentissage contient la variable cible \u00ab\u00a0Survived\u00a0\u00bb, mais que l&rsquo;ensemble de test ne le fait pas.<\/p><p>Notez que Sweetviz fera une meilleure estimation pour d\u00e9terminer le type de donn\u00e9es de chaque colonne, entre num\u00e9rique, cat\u00e9gorie\/bool\u00e9en et texte.\u00a0<\/p><p><img fetchpriority=\"high\" decoding=\"async\" class=\"aligncenter wp-image-15758 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_KY0T3_vqAI3sClqY0W9RQg.png\" alt=\"\" width=\"583\" height=\"191\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_KY0T3_vqAI3sClqY0W9RQg.png 583w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_KY0T3_vqAI3sClqY0W9RQg-300x98.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_KY0T3_vqAI3sClqY0W9RQg-18x6.png 18w\" sizes=\"(max-width: 583px) 100vw, 583px\" \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-d4ce914 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"d4ce914\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-97b9743\" data-id=\"97b9743\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-e60a834 elementor-widget elementor-widget-heading\" data-id=\"e60a834\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Associations\"><\/span>Associations<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-9161cd2 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"9161cd2\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e47c6f2\" data-id=\"e47c6f2\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-5f69ee5 elementor-widget elementor-widget-text-editor\" data-id=\"5f69ee5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Ce graphique est un composite des \u00e9l\u00e9ments visuels de Drazen Zaric\u00a0: Better Heatmaps and Correlation Matrix Plots in Python et des concepts de Shaked Zychlinski\u00a0: The Search for Categorical Correlation.<\/p><p>Fondamentalement, en plus de montrer les corr\u00e9lations num\u00e9riques traditionnelles, il unifie dans un seul graphique \u00e0 la fois la <a href=\"https:\/\/complex-systems-ai.com\/en\/correlation-and-regressions\/\">corr\u00e9lation<\/a> num\u00e9rique mais aussi le coefficient d&rsquo;incertitude (pour cat\u00e9goriel-cat\u00e9goriel) et le rapport de corr\u00e9lation (pour cat\u00e9goriel-num\u00e9rique). Les carr\u00e9s repr\u00e9sentent les variables li\u00e9es aux caract\u00e9ristiques cat\u00e9gorielles et les cercles repr\u00e9sentent les corr\u00e9lations num\u00e9riques-num\u00e9riques. Notez que la diagonale triviale est laiss\u00e9e vide, pour plus de clart\u00e9.<\/p><p>Les associations cat\u00e9gorielles-cat\u00e9gorielles (fournies par le coefficient d&rsquo;incertitude) sont ASSYMETRIQUES, ce qui signifie que chaque ligne repr\u00e9sente \u00e0 quel point le titre de la ligne (\u00e0 gauche) donne des informations sur chaque colonne. Par exemple, \u00ab Sex \u00bb, \u00ab Pclass \u00bb et \u00ab Fare \u00bb sont les \u00e9l\u00e9ments qui donnent le plus d&rsquo;informations sur \u00ab Survived \u00bb. Pour le jeu de donn\u00e9es Titanic, cette information est plut\u00f4t sym\u00e9trique mais ce n&rsquo;est pas toujours le cas.<\/p><p>Enfin, il convient de noter ces m\u00e9thodes de corr\u00e9lation\/association<br \/>ne doivent pas \u00eatre pris comme un \u00e9vangile car ils font des hypoth\u00e8ses sur la distribution sous-jacente des donn\u00e9es et des relations. Cependant, ils peuvent \u00eatre un point de d\u00e9part tr\u00e8s utile.<\/p><p><img decoding=\"async\" class=\"aligncenter wp-image-15760 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RUFIJ9SGBjrEVzFuGyXFJA.png\" alt=\"\" width=\"700\" height=\"730\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RUFIJ9SGBjrEVzFuGyXFJA.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RUFIJ9SGBjrEVzFuGyXFJA-288x300.png 288w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RUFIJ9SGBjrEVzFuGyXFJA-12x12.png 12w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RUFIJ9SGBjrEVzFuGyXFJA-600x626.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-ccccdad elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"ccccdad\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-ec14fd2\" data-id=\"ec14fd2\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-d09c71d elementor-widget elementor-widget-heading\" data-id=\"d09c71d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Selection-dune-variable\"><\/span>S\u00e9lection d'une variable<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-c3b00cd elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c3b00cd\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-82a8587\" data-id=\"82a8587\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-d57d9d2 elementor-widget elementor-widget-text-editor\" data-id=\"d57d9d2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Lorsqu&rsquo;une variable cible est sp\u00e9cifi\u00e9e, elle appara\u00eetra en premier, dans une bo\u00eete noire sp\u00e9ciale. Seules les entit\u00e9s num\u00e9riques et bool\u00e9ennes peuvent \u00eatre des cibles actuellement.<\/p><p>Nous pouvons d\u00e9duire de ce r\u00e9sum\u00e9 que \u00ab\u00a0Survived\u00a0\u00bb n&rsquo;a pas de donn\u00e9es manquantes dans l&rsquo;ensemble d&rsquo;apprentissage (891, 100%), qu&rsquo;il existe 2 valeurs possibles distinctes (repr\u00e9sentant moins de 1% de toutes les valeurs), et \u00e0 partir du graphique, il peut On estime qu&rsquo;environ 60 % n&rsquo;ont pas surv\u00e9cu.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-a09d4e5 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"a09d4e5\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-ba02d48\" data-id=\"ba02d48\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-cbd0d1a elementor-widget elementor-widget-heading\" data-id=\"cbd0d1a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Donnees-qualitatives-et-booleennes\"><\/span>Donn\u00e9es qualitatives et bool\u00e9ennes<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-8733361 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"8733361\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-2e0995d\" data-id=\"2e0995d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-ea75308 elementor-widget elementor-widget-text-editor\" data-id=\"ea75308\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Lorsque vous d\u00e9placez la souris pour survoler l&rsquo;une des variables, une zone \u00e0 droite affichera les d\u00e9tails. Le contenu des d\u00e9tails d\u00e9pend du type de variable analys\u00e9e. Dans le cas d&rsquo;une variable cat\u00e9gorielle (ou bool\u00e9enne), comme c&rsquo;est le cas avec la cible, l&rsquo;analyse est la suivante :<\/p><p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15761 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_rE0a3EKtLkkQa5UXdzNTOw.png\" alt=\"\" width=\"700\" height=\"383\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_rE0a3EKtLkkQa5UXdzNTOw.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_rE0a3EKtLkkQa5UXdzNTOw-300x164.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_rE0a3EKtLkkQa5UXdzNTOw-18x10.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_rE0a3EKtLkkQa5UXdzNTOw-600x328.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/p><p>Ici, nous pouvons voir les statistiques exactes pour chaque classe, o\u00f9 62% n&rsquo;ont pas surv\u00e9cu et 38% ont surv\u00e9cu. Vous obtenez \u00e9galement le d\u00e9tail des associations pour chacune des autres fonctionnalit\u00e9s.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-539599e elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"539599e\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-956011f\" data-id=\"956011f\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-58748d3 elementor-widget elementor-widget-heading\" data-id=\"58748d3\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Donnees-quantitatives\"><\/span>Donn\u00e9es quantitatives<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-aafb96c elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"aafb96c\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-36a507d\" data-id=\"36a507d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-602c69b elementor-widget elementor-widget-text-editor\" data-id=\"602c69b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Les donn\u00e9es num\u00e9riques montrent plus d&rsquo;informations sur son r\u00e9sum\u00e9. Ici, nous pouvons voir que dans ce cas, environ 20\u00a0% des donn\u00e9es manquent (21\u00a0% dans les donn\u00e9es de test, ce qui est tr\u00e8s coh\u00e9rent).<\/p><p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15762 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_TwLCAor_2ntLXVVNuQxvxg.png\" alt=\"\" width=\"700\" height=\"131\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_TwLCAor_2ntLXVVNuQxvxg.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_TwLCAor_2ntLXVVNuQxvxg-300x56.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_TwLCAor_2ntLXVVNuQxvxg-18x3.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_TwLCAor_2ntLXVVNuQxvxg-600x112.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/p><p>Notez que la valeur cible (\u00ab\u00a0Survived\u00a0\u00bb dans ce cas) est trac\u00e9e sous la forme d&rsquo;une ligne, juste au-dessus du graphique de distribution. Cela permet une analyse instantan\u00e9e de la distribution cible par rapport aux autres variables.<\/p><p>Fait int\u00e9ressant, nous pouvons voir sur le graphique de droite que le taux de survie est assez constant \u00e0 tous les \u00e2ges, sauf pour les plus jeunes qui ont un taux de survie plus \u00e9lev\u00e9. Il semblerait que \u00ables femmes et les enfants d&rsquo;abord\u00bb ne soient pas que des paroles.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-492ae35 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"492ae35\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-676479c\" data-id=\"676479c\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-a62a731 elementor-widget elementor-widget-heading\" data-id=\"a62a731\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Details-dune-variable-quantitative\"><\/span>D\u00e9tails d'une variable quantitative<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-bac9a9b elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"bac9a9b\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-aec49fa\" data-id=\"aec49fa\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-0120422 elementor-widget elementor-widget-text-editor\" data-id=\"0120422\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Comme pour le type de donn\u00e9es cat\u00e9gorielles, le type de donn\u00e9es num\u00e9riques affiche des informations suppl\u00e9mentaires dans sa zone de d\u00e9tail. Il convient de noter ici les boutons en haut du graphique.<\/p><p>Ces boutons modifient le nombre de \u00ab bacs \u00bb affich\u00e9s dans le graphique. Vous pouvez s\u00e9lectionner ce qui suit : Auto, 5, 15, 30.<\/p><p>Pour acc\u00e9der \u00e0 ces boutons, vous devez \u00ab\u00a0verrouiller en place\u00a0\u00bb la fonctionnalit\u00e9 actuelle en cliquant dessus. La fonction a alors un CONTOUR ROUGE pour montrer qu&rsquo;elle est verrouill\u00e9e en place et vous pouvez acc\u00e9der \u00e0 la zone de d\u00e9tail.<\/p><p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15763 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_QuHrFPt50DLlmhpG6IMgJw.png\" alt=\"\" width=\"700\" height=\"620\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_QuHrFPt50DLlmhpG6IMgJw.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_QuHrFPt50DLlmhpG6IMgJw-300x266.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_QuHrFPt50DLlmhpG6IMgJw-14x12.png 14w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_QuHrFPt50DLlmhpG6IMgJw-600x531.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-789adcb elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"789adcb\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-c2591ee\" data-id=\"c2591ee\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-8d63686 elementor-widget elementor-widget-heading\" data-id=\"8d63686\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Donnees-textuelles\"><\/span>Donn\u00e9es textuelles<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-1b9de97 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"1b9de97\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e93a2db\" data-id=\"e93a2db\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-8ce81ef elementor-widget elementor-widget-text-editor\" data-id=\"8ce81ef\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Pour l&rsquo;instant, tout ce que le syst\u00e8me ne consid\u00e8re pas comme num\u00e9rique ou cat\u00e9gorique sera consid\u00e9r\u00e9 comme du \u00ab\u00a0texte\u00a0\u00bb. Les fonctionnalit\u00e9s textuelles n&rsquo;affichent actuellement que le nombre (pourcentage) sous forme de statistiques.<\/p><p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15764 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_r7NFyLWyNz5LrFuzBlp51Q.png\" alt=\"\" width=\"678\" height=\"166\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_r7NFyLWyNz5LrFuzBlp51Q.png 678w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_r7NFyLWyNz5LrFuzBlp51Q-300x73.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_r7NFyLWyNz5LrFuzBlp51Q-18x4.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_r7NFyLWyNz5LrFuzBlp51Q-600x147.png 600w\" sizes=\"(max-width: 678px) 100vw, 678px\" \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-3824bec elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"3824bec\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e11000b\" data-id=\"e11000b\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-e666f07 elementor-widget elementor-widget-heading\" data-id=\"e666f07\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Comparaison-de-sous-populations\"><\/span>Comparaison de sous-populations<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-538f607 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"538f607\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-7f17234\" data-id=\"7f17234\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-7138e0b elementor-widget elementor-widget-text-editor\" data-id=\"7138e0b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"9c28\" class=\"pw-post-body-paragraph lh li jd lj b lk wo ke lm ln wp kh lp lq wq ls lt lu wr lw lx ly ws ma mb mc iw gc\" data-selectable-paragraph=\"\">M\u00eame si vous ne regardez qu&rsquo;un seul ensemble de donn\u00e9es, il peut \u00eatre tr\u00e8s utile d&rsquo;\u00e9tudier les caract\u00e9ristiques de diff\u00e9rentes sous-populations au sein de cet ensemble de donn\u00e9es.<\/p><p class=\"pw-post-body-paragraph lh li jd lj b lk wo ke lm ln wp kh lp lq wq ls lt lu wr lw lx ly ws ma mb mc iw gc\" data-selectable-paragraph=\"\">Pour cela, Sweetviz propose la fonction compare_intra(). Pour l&rsquo;utiliser, vous fournissez un test bool\u00e9en qui divise la population (ici, nous essayons train[\u00ab\u00a0Sex\u00a0\u00bb] == &lsquo;male&rsquo;, pour avoir une id\u00e9e des diff\u00e9rentes populations de genre), et donnez un nom \u00e0 chaque sous-population. Par example:<\/p><pre class=\"kt ku kv kw gz wx bt wy\"><span id=\"c090\" class=\"gc wz wc jd ww b do xa xb l xc\" data-selectable-paragraph=\"\">my_report = sweetviz.compare_intra(train, train[\"Sex\"] == 'male', [\"Male\", \"Female\"], 'Survived')<\/span><span id=\"9dd8\" class=\"gc wz wc jd ww b do xm xn xo xp xq xb l xc\" data-selectable-paragraph=\"\">my_report.show_html() # Not providing a filename will default to SWEETVIZ_REPORT.html<\/span><\/pre><p id=\"8377\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\">Ce qui donne l&rsquo;analyse suivante :<\/p><p data-selectable-paragraph=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15757 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_7sw4PM1hXkt0QgWBJbGrkg.png\" alt=\"\" width=\"700\" height=\"487\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_7sw4PM1hXkt0QgWBJbGrkg.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_7sw4PM1hXkt0QgWBJbGrkg-300x209.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_7sw4PM1hXkt0QgWBJbGrkg-18x12.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_7sw4PM1hXkt0QgWBJbGrkg-600x417.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/p><p data-selectable-paragraph=\"\">Notez que la valeur cible (\u00ab\u00a0Survived\u00a0\u00bb dans ce cas) est maintenant trac\u00e9e sous forme de lignes s\u00e9par\u00e9es, une pour chaque ensemble de donn\u00e9es compar\u00e9es (par exemple, homme en bleu, femme en orange).<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-8a9e523 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"8a9e523\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-2a75cfb\" data-id=\"2a75cfb\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-0c767f5 elementor-widget elementor-widget-heading\" data-id=\"0c767f5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\"><span class=\"ez-toc-section\" id=\"Exemple-sur-le-jeu-de-donnees-du-Titanic\"><\/span>Exemple sur le jeu de donn\u00e9es du Titanic<span class=\"ez-toc-section-end\"><\/span><\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-d940539 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"d940539\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-90e44ca\" data-id=\"90e44ca\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-2cbd98c elementor-widget elementor-widget-text-editor\" data-id=\"2cbd98c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p id=\"7b53\" class=\"pw-post-body-paragraph lh li jd lj b lk wo ke lm ln wp kh lp lq wq ls lt lu wr lw lx ly ws ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">PassengerId<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go xx\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15765 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_s3pAV47OATMMH_4sL9KKzw.png\" alt=\"\" width=\"700\" height=\"94\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_s3pAV47OATMMH_4sL9KKzw.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_s3pAV47OATMMH_4sL9KKzw-300x40.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_s3pAV47OATMMH_4sL9KKzw-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_s3pAV47OATMMH_4sL9KKzw-600x81.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"07a7\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">La distribution des pi\u00e8ces d&rsquo;identit\u00e9 et de la capacit\u00e9 de survie est uniforme et ordonn\u00e9e.<\/li><li id=\"af67\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Pas de donn\u00e9es manquantes<\/li><\/ul><p id=\"d48f\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Sex<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go xy\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-15768\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RrcnMPiEuYF2RD0bSkZ80A-300x37.png\" alt=\"\" width=\"300\" height=\"37\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RrcnMPiEuYF2RD0bSkZ80A-300x37.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RrcnMPiEuYF2RD0bSkZ80A-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RrcnMPiEuYF2RD0bSkZ80A-600x75.png 600w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RrcnMPiEuYF2RD0bSkZ80A.png 700w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"365c\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Deux fois plus d&rsquo;hommes que de femmes<\/li><li id=\"927f\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Les femmes survivent 30% mieux que les hommes<\/li><li data-selectable-paragraph=\"\">M\u00eames distributions dans les jeux d&rsquo;entrainement et de test<\/li><li id=\"13bf\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Pas de donn\u00e9es manquantes<\/li><\/ul><p id=\"994c\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Age<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go xx\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15769 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_1enTNvbCTMCv8Ahhzu5HPQ.png\" alt=\"\" width=\"700\" height=\"90\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_1enTNvbCTMCv8Ahhzu5HPQ.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_1enTNvbCTMCv8Ahhzu5HPQ-300x39.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_1enTNvbCTMCv8Ahhzu5HPQ-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_1enTNvbCTMCv8Ahhzu5HPQ-600x77.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"5fc3\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">20\u00a0% de donn\u00e9es manquantes, donn\u00e9es manquantes coh\u00e9rentes et r\u00e9partition entre Train et Test<\/li><li id=\"5c4e\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Population centr\u00e9e sur les jeunes adultes, mais les \u00e2ges de 0 \u00e0 70 ans sont bien repr\u00e9sent\u00e9s<\/li><li id=\"89a7\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Survivabilit\u00e9 \u00e9tonnamment uniform\u00e9ment r\u00e9partie, \u00e0 l&rsquo;exception d&rsquo;un pic au plus jeune \u00e2ge<\/li><li id=\"b202\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">L&rsquo;\u00e2ge semble li\u00e9 \u00e0 Siblings, Pclass et Fare, et un peu plus surprenant \u00e0 Embarked<\/li><\/ul><p id=\"5378\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Name<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go xz\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15770 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_wE5tp_v5tiGktWMa8zOQxQ.png\" alt=\"\" width=\"700\" height=\"88\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_wE5tp_v5tiGktWMa8zOQxQ.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_wE5tp_v5tiGktWMa8zOQxQ-300x38.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_wE5tp_v5tiGktWMa8zOQxQ-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_wE5tp_v5tiGktWMa8zOQxQ-600x75.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"dbe9\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Pas de donn\u00e9es manquantes<\/li><li id=\"9f2e\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Tous les noms sont distincts<\/li><\/ul><p id=\"2086\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Pclass<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go ya\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-15771\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_d5WgH6RL3fCO_utnDigvTg-300x37.png\" alt=\"\" width=\"300\" height=\"37\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_d5WgH6RL3fCO_utnDigvTg-300x37.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_d5WgH6RL3fCO_utnDigvTg-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_d5WgH6RL3fCO_utnDigvTg-600x75.png 600w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_d5WgH6RL3fCO_utnDigvTg.png 700w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"5ad4\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">La capacit\u00e9 de survie suit de pr\u00e8s la classe (la premi\u00e8re classe est la plus susceptible de survivre, la troisi\u00e8me classe la moins susceptible)<\/li><li id=\"4667\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">R\u00e9partition similaire entre Train et Test<\/li><li id=\"4617\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Aucune donn\u00e9e manquante<\/li><\/ul><p id=\"ca16\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">SibSp<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go yb\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15772 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_tXOvSlsPsBssxZiP_coMdw.png\" alt=\"\" width=\"700\" height=\"89\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_tXOvSlsPsBssxZiP_coMdw.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_tXOvSlsPsBssxZiP_coMdw-300x38.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_tXOvSlsPsBssxZiP_coMdw-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_tXOvSlsPsBssxZiP_coMdw-600x76.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"f979\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Il semble y avoir un pic de survie \u00e0 1 et dans une certaine mesure \u00e0 2, mais (en regardant le volet de d\u00e9tails non illustr\u00e9 ici), il y a une forte baisse \u00e0 3 et plus. Les familles nombreuses ne pouvaient pas le faire ou \u00e9taient peut-\u00eatre plus pauvres ?<\/li><\/ul><p id=\"7615\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Parch<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go xz\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15773 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_jEG2f2TAqONtRRGqyXKCVQ.png\" alt=\"\" width=\"700\" height=\"90\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_jEG2f2TAqONtRRGqyXKCVQ.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_jEG2f2TAqONtRRGqyXKCVQ-300x39.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_jEG2f2TAqONtRRGqyXKCVQ-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_jEG2f2TAqONtRRGqyXKCVQ-600x77.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"cd76\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Distribution similaire<\/li><li id=\"aa50\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Aucune donn\u00e9e manquante<\/li><\/ul><p id=\"3515\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Ticket<\/strong><\/p><ul class=\"\"><li id=\"2c60\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">~80\u00a0% de valeurs distinctes, soit environ 1\u00a0ticket partag\u00e9 sur 5 en moyenne<\/li><li id=\"e929\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Le ticket de fr\u00e9quence le plus \u00e9lev\u00e9 \u00e9tait de 7, ce qui est g\u00e9n\u00e9ralement coh\u00e9rent avec le nombre maximum de fr\u00e8res et s\u0153urs (8)<\/li><li id=\"ebeb\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Aucune donn\u00e9e manquante, les donn\u00e9es semblent assez propres<\/li><\/ul><p id=\"90f2\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Fare<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go yc\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15774 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RouVzHB1cez22l_pzcp-vg.png\" alt=\"\" width=\"700\" height=\"92\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RouVzHB1cez22l_pzcp-vg.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RouVzHB1cez22l_pzcp-vg-300x39.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RouVzHB1cez22l_pzcp-vg-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RouVzHB1cez22l_pzcp-vg-600x79.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"5b60\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Comme pr\u00e9vu, et comme pour Pclass, les tarifs les plus \u00e9lev\u00e9s ont mieux surv\u00e9cu (bien que la taille de l&rsquo;\u00e9chantillon devienne assez mince \u00e0 des niveaux plus \u00e9lev\u00e9s)<\/li><li id=\"45cf\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Un rapport de corr\u00e9lation de 0,26 pour \u00ab Survived \u00bb est relativement \u00e9lev\u00e9, il aurait donc tendance \u00e0 soutenir cette th\u00e9orie<\/li><li id=\"e5fc\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Environ 30\u00a0% de valeurs distinctes semblent un peu \u00e9lev\u00e9es car vous vous attendriez \u00e0 moins de prix fixes, mais il semble qu&rsquo;il y ait beaucoup de granularit\u00e9, donc \u00e7a va<\/li><li id=\"c112\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Seulement 1 enregistrement manquant dans le jeu de test, donn\u00e9es assez coh\u00e9rentes entre Train et Test<\/li><\/ul><p id=\"af3e\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Cabin<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go yb\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15775 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_-7Ndqe4_rKv4QwwAsenHrA.png\" alt=\"\" width=\"700\" height=\"96\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_-7Ndqe4_rKv4QwwAsenHrA.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_-7Ndqe4_rKv4QwwAsenHrA-300x41.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_-7Ndqe4_rKv4QwwAsenHrA-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_-7Ndqe4_rKv4QwwAsenHrA-600x82.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"c68d\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">78% de donn\u00e9es manquantes<\/li><li id=\"6818\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">La fr\u00e9quence maximale est de 4, ce qui serait logique d&rsquo;avoir 4 personnes maximum dans une cabine<\/li><\/ul><p id=\"f07d\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Embarked<\/strong><\/p><figure class=\"kt ku kv kw gz kx gn go paragraph-image\"><div class=\"ky kz dq la cf lb\" tabindex=\"0\" role=\"button\"><div class=\"gn go ya\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-15776 size-full\" src=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RKQGXELxzuRw5_ELrwEHgg.png\" alt=\"\" width=\"700\" height=\"92\" title=\"\" srcset=\"https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RKQGXELxzuRw5_ELrwEHgg.png 700w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RKQGXELxzuRw5_ELrwEHgg-300x39.png 300w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RKQGXELxzuRw5_ELrwEHgg-18x2.png 18w, https:\/\/complex-systems-ai.com\/wp-content\/uploads\/2022\/04\/1_RKQGXELxzuRw5_ELrwEHgg-600x79.png 600w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/><\/div><\/div><\/figure><ul class=\"\"><li id=\"850c\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">3 valeurs distinctes (S, C, Q)<\/li><li id=\"1bc8\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Capacit\u00e9 de survie un peu plus \u00e9lev\u00e9e \u00e0 C\u00a0; cela pourrait-il \u00eatre un endroit avec des gens plus riches\u00a0?<\/li><li id=\"3b61\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Quoi qu&rsquo;il en soit, \u00ab\u00a0Embarqu\u00e9\u00a0\u00bb affiche un coefficient d&rsquo;incertitude de seulement 0,03 pour \u00ab\u00a0Survived\u00a0\u00bb, il peut donc ne pas \u00eatre tr\u00e8s significatif<\/li><\/ul><h2 id=\"fdb5\" class=\"wz wc jd bn wd og xr oh oj ok xs ol on lq xt oo oq lu xu or ot ly xv ou ow xw gc\" data-selectable-paragraph=\"\"><span class=\"ez-toc-section\" id=\"Analyse-generale\"><\/span>Analyse g\u00e9n\u00e9rale<span class=\"ez-toc-section-end\"><\/span><\/h2><ul class=\"\"><li id=\"a4e5\" class=\"vn vo jd lj b lk wo ln wp lq yd lu ye ly yf mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Dans l&rsquo;ensemble, la plupart des donn\u00e9es sont pr\u00e9sentes et semblent coh\u00e9rentes et logiques\u00a0; pas de valeurs aberrantes majeures ou d&rsquo;\u00e9normes surprises<\/li><\/ul><p id=\"cbec\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Test versus Training data<\/strong><\/p><p>Le test contient environ 50 % de lignes en moins.<\/p><p>Train et Test sont tr\u00e8s proches dans la distribution des donn\u00e9es manquantes.<\/p><p>Les valeurs des donn\u00e9es d&rsquo;entra\u00eenement et de test sont tr\u00e8s coh\u00e9rentes dans tous les domaines<\/p><p id=\"11ec\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Association\/correlation analysis<\/strong><\/p><ul class=\"\"><li id=\"21d5\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Le sexe, le tarif et la classe donnent le plus d&rsquo;informations sur les survivants<\/li><li id=\"ca61\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Comme pr\u00e9vu, Fare et Pclass sont fortement corr\u00e9l\u00e9s<\/li><li id=\"bca4\" class=\"vn vo jd lj b lk vw ln vx lq vy lu vz ly wa mc vs vt vu vv gc\" data-selectable-paragraph=\"\">L&rsquo;\u00e2ge semble nous en dire beaucoup sur Pclass, les fr\u00e8res et s\u0153urs et dans une certaine mesure Fare, ce qui serait quelque peu attendu. Il semble nous en dire beaucoup sur \u00ab\u00a0Embarqu\u00e9\u00a0\u00bb ce qui est un peu plus surprenant.<\/li><\/ul><p id=\"c654\" class=\"pw-post-body-paragraph lh li jd lj b lk ll ke lm ln lo kh lp lq lr ls lt lu lv lw lx ly lz ma mb mc iw gc\" data-selectable-paragraph=\"\"><strong class=\"lj je\">Missing data<\/strong><\/p><ul class=\"\"><li id=\"c160\" class=\"vn vo jd lj b lk ll ln lo lq vp lu vq ly vr mc vs vt vu vv gc\" data-selectable-paragraph=\"\">Il n&rsquo;y a pas de donn\u00e9es manquantes significatives, sauf pour l&rsquo;\u00e2ge (~ 20 %) et la cabine (~ 77 %)<\/li><\/ul>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Descriptive Analysis Wiki Homepage Exploratory data analysis (EDA) is an essential first step in most data science projects and \u2026 <\/p>","protected":false},"author":1,"featured_media":0,"parent":15506,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-15754","page","type-page","status-publish","hentry"],"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/pages\/15754","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/comments?post=15754"}],"version-history":[{"count":3,"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/pages\/15754\/revisions"}],"predecessor-version":[{"id":15779,"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/pages\/15754\/revisions\/15779"}],"up":[{"embeddable":true,"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/pages\/15506"}],"wp:attachment":[{"href":"https:\/\/complex-systems-ai.com\/en\/wp-json\/wp\/v2\/media?parent=15754"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}