diff --git a/2023/homework/WangTianlong/homework_credit_scoring.ipynb b/2023/homework/WangTianlong/homework_credit_scoring.ipynb
new file mode 100644
index 00000000..c2f7845d
--- /dev/null
+++ b/2023/homework/WangTianlong/homework_credit_scoring.ipynb
@@ -0,0 +1,1219 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# 一起来打怪之 Credit Scoring 练习"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "## 作业说明\n",
+    "\n",
+    "- 答题步骤：\n",
+    "    - 回答问题**请保留每一步**操作过程，请不要仅仅给出最后答案\n",
+    "    - 请养成代码注释的好习惯\n",
+    "\n",
+    "- 解题思路：\n",
+    "    - 为方便大家准确理解题目，在习题实战中有所收获，本文档提供了解题思路提示\n",
+    "    - 解题思路**仅供参考**，鼓励原创解题方法\n",
+    "    - 为督促同学们自己思考，解题思路内容设置为**白色**，必要时请从冒号后拖动鼠标查看\n",
+    "\n",
+    "- 所用数据\n",
+    "    - 请注意导入数据库后先**查看和了解数据的基本性质**，后面的问题不再一一提醒"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## machine learning for credit scoring\n",
+    "\n",
+    "\n",
+    "Banks play a crucial role in market economies. They decide who can get finance and on what terms and can make or break investment decisions. For markets and society to function, individuals and companies need access to credit. \n",
+    "\n",
+    "Credit scoring algorithms, which make a guess at the probability of default, are the method banks use to determine whether or not a loan should be granted. This competition requires participants to improve on the state of the art in credit scoring, by predicting the probability that somebody will experience financial distress in the next two years. [Dataset](https://www.kaggle.com/c/GiveMeSomeCredit)\n",
+    "\n",
+    "Attribute Information:\n",
+    "\n",
+    "|Variable Name\t|\tDescription\t|\tType|\n",
+    "|----|----|----|\n",
+    "|SeriousDlqin2yrs\t|\tPerson experienced 90 days past due delinquency or worse \t|\tY/N|\n",
+    "|RevolvingUtilizationOfUnsecuredLines\t|\tTotal balance on credit divided by the sum of credit limits\t|\tpercentage|\n",
+    "|age\t|\tAge of borrower in years\t|\tinteger|\n",
+    "|NumberOfTime30-59DaysPastDueNotWorse\t|\tNumber of times borrower has been 30-59 days past due |\tinteger|\n",
+    "|DebtRatio\t|\tMonthly debt payments\t|\tpercentage|\n",
+    "|MonthlyIncome\t|\tMonthly income\t|\treal|\n",
+    "|NumberOfOpenCreditLinesAndLoans\t|\tNumber of Open loans |\tinteger|\n",
+    "|NumberOfTimes90DaysLate\t|\tNumber of times borrower has been 90 days or more past due.\t|\tinteger|\n",
+    "|NumberRealEstateLoansOrLines\t|\tNumber of mortgage and real estate loans\t|\tinteger|\n",
+    "|NumberOfTime60-89DaysPastDueNotWorse\t|\tNumber of times borrower has been 60-89 days past due |integer|\n",
+    "|NumberOfDependents\t|\tNumber of dependents in family\t|\tinteger|\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "----------\n",
+    "## Read the data into Pandas "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>SeriousDlqin2yrs</th>\n",
+       "      <th>RevolvingUtilizationOfUnsecuredLines</th>\n",
+       "      <th>age</th>\n",
+       "      <th>NumberOfTime30-59DaysPastDueNotWorse</th>\n",
+       "      <th>DebtRatio</th>\n",
+       "      <th>MonthlyIncome</th>\n",
+       "      <th>NumberOfOpenCreditLinesAndLoans</th>\n",
+       "      <th>NumberOfTimes90DaysLate</th>\n",
+       "      <th>NumberRealEstateLoansOrLines</th>\n",
+       "      <th>NumberOfTime60-89DaysPastDueNotWorse</th>\n",
+       "      <th>NumberOfDependents</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>1</td>\n",
+       "      <td>0.766127</td>\n",
+       "      <td>45.0</td>\n",
+       "      <td>2.0</td>\n",
+       "      <td>0.802982</td>\n",
+       "      <td>9120.0</td>\n",
+       "      <td>13.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>6.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>2.0</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>0</td>\n",
+       "      <td>0.957151</td>\n",
+       "      <td>40.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.121876</td>\n",
+       "      <td>2600.0</td>\n",
+       "      <td>4.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>1.0</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>0</td>\n",
+       "      <td>0.658180</td>\n",
+       "      <td>38.0</td>\n",
+       "      <td>1.0</td>\n",
+       "      <td>0.085113</td>\n",
+       "      <td>3042.0</td>\n",
+       "      <td>2.0</td>\n",
+       "      <td>1.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>0</td>\n",
+       "      <td>0.233810</td>\n",
+       "      <td>30.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.036050</td>\n",
+       "      <td>3300.0</td>\n",
+       "      <td>5.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4</th>\n",
+       "      <td>0</td>\n",
+       "      <td>0.907239</td>\n",
+       "      <td>49.0</td>\n",
+       "      <td>1.0</td>\n",
+       "      <td>0.024926</td>\n",
+       "      <td>63588.0</td>\n",
+       "      <td>7.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>1.0</td>\n",
+       "      <td>0.0</td>\n",
+       "      <td>0.0</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "   SeriousDlqin2yrs  RevolvingUtilizationOfUnsecuredLines   age  \\\n",
+       "0                 1                              0.766127  45.0   \n",
+       "1                 0                              0.957151  40.0   \n",
+       "2                 0                              0.658180  38.0   \n",
+       "3                 0                              0.233810  30.0   \n",
+       "4                 0                              0.907239  49.0   \n",
+       "\n",
+       "   NumberOfTime30-59DaysPastDueNotWorse  DebtRatio  MonthlyIncome  \\\n",
+       "0                                   2.0   0.802982         9120.0   \n",
+       "1                                   0.0   0.121876         2600.0   \n",
+       "2                                   1.0   0.085113         3042.0   \n",
+       "3                                   0.0   0.036050         3300.0   \n",
+       "4                                   1.0   0.024926        63588.0   \n",
+       "\n",
+       "   NumberOfOpenCreditLinesAndLoans  NumberOfTimes90DaysLate  \\\n",
+       "0                             13.0                      0.0   \n",
+       "1                              4.0                      0.0   \n",
+       "2                              2.0                      1.0   \n",
+       "3                              5.0                      0.0   \n",
+       "4                              7.0                      0.0   \n",
+       "\n",
+       "   NumberRealEstateLoansOrLines  NumberOfTime60-89DaysPastDueNotWorse  \\\n",
+       "0                           6.0                                   0.0   \n",
+       "1                           0.0                                   0.0   \n",
+       "2                           0.0                                   0.0   \n",
+       "3                           0.0                                   0.0   \n",
+       "4                           1.0                                   0.0   \n",
+       "\n",
+       "   NumberOfDependents  \n",
+       "0                 2.0  \n",
+       "1                 1.0  \n",
+       "2                 0.0  \n",
+       "3                 0.0  \n",
+       "4                 0.0  "
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "import pandas as pd\n",
+    "pd.set_option('display.max_columns', 500)\n",
+    "import zipfile\n",
+    "with zipfile.ZipFile('KaggleCredit2.csv.zip', 'r') as z:\n",
+    "    f = z.open('KaggleCredit2.csv')\n",
+    "    data = pd.read_csv(f, index_col=0)\n",
+    "data.head()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(112915, 11)"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "data.shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "------------\n",
+    "## Drop na"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "SeriousDlqin2yrs                           0\n",
+       "RevolvingUtilizationOfUnsecuredLines       0\n",
+       "age                                     4267\n",
+       "NumberOfTime30-59DaysPastDueNotWorse       0\n",
+       "DebtRatio                                  0\n",
+       "MonthlyIncome                              0\n",
+       "NumberOfOpenCreditLinesAndLoans            0\n",
+       "NumberOfTimes90DaysLate                    0\n",
+       "NumberRealEstateLoansOrLines               0\n",
+       "NumberOfTime60-89DaysPastDueNotWorse       0\n",
+       "NumberOfDependents                      4267\n",
+       "dtype: int64"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "data.isnull().sum(axis=0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(108648, 11)"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "data.dropna(inplace=True)\n",
+    "data.shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---------\n",
+    "## Create X and y"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "y = data['SeriousDlqin2yrs']\n",
+    "X = data.drop('SeriousDlqin2yrs', axis=1)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "0.06742876076872101"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "y.mean()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "## 练习1：把数据切分成训练集和测试集\n",
+    "- 提示：<span style='color:white'>from sklearn.model_selection import train_test_split('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.model_selection import train_test_split\n",
+    "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "----\n",
+    "## 练习2：使用logistic regression/决策树/SVM/KNN...等sklearn分类算法进行分类\n",
+    "尝试查sklearn API了解模型参数含义，调整不同的参数"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Logistic regression\n",
+    "- 提示：<span style='color:white'>from sklearn import linear_model('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    },
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9319374137137598\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\linear_model\\_logistic.py:458: ConvergenceWarning: lbfgs failed to converge (status=1):\n",
+      "STOP: TOTAL NO. of ITERATIONS REACHED LIMIT.\n",
+      "\n",
+      "Increase the number of iterations (max_iter) or scale the data as shown in:\n",
+      "    https://scikit-learn.org/stable/modules/preprocessing.html\n",
+      "Please also refer to the documentation for alternative solver options:\n",
+      "    https://scikit-learn.org/stable/modules/linear_model.html#logistic-regression\n",
+      "  n_iter_i = _check_optimize_result(\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.linear_model import LogisticRegression\n",
+    "from sklearn.metrics import accuracy_score\n",
+    "model = LogisticRegression()\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Decision Tree\n",
+    "- 提示：<span style='color:white'>from sklearn.tree import DecisionTreeClassifier('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.8938794293603314\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.tree import DecisionTreeClassifier\n",
+    "model = DecisionTreeClassifier(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Random Forest\n",
+    "- 提示：<span style='color:white'>from sklearn.ensemble import RandomForestClassifier('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9340082834790612\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.ensemble import RandomForestClassifier\n",
+    "model =RandomForestClassifier(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### SVM\n",
+    "- 提示：<span style='color:white'>from sklearn.svm import SVC('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9317073170731708\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.svm import SVC\n",
+    "model = SVC(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### KNN\n",
+    "- 提示：<span style='color:white'>from sklearn.neighbors import KNeighborsClassifier('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9267832489645651\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.neighbors import KNeighborsClassifier\n",
+    "model =  KNeighborsClassifier(3)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "\n",
+    "## 练习3：在测试集上进行预测，计算准确度"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Logistic regression\n",
+    "- 提示：<span style='color:white'>y_pred_LR = clf_LR.predict(x_test)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.linear_model import LogisticRegression\n",
+    "from sklearn.metrics import accuracy_score\n",
+    "model = LogisticRegression()\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Decision Tree\n",
+    "- 提示：<span style='color:white'>y_pred_tree = tree.predict(x_test)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.tree import DecisionTreeClassifier\n",
+    "model = DecisionTreeClassifier(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Random Forest\n",
+    "- 提示：<span style='color:white'>y_pred_forest = forest.predict(x_test)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.ensemble import RandomForestClassifier\n",
+    "model =RandomForestClassifier(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### SVM\n",
+    "- 提示：<span style='color:white'>y_pred_SVC = clf_svc.predict(x_test)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.svm import SVC\n",
+    "model = SVC(random_state=42)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### KNN\n",
+    "- 提示：<span style='color:white'>y_pred_KNN = neigh.predict(x_test)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sklearn.neighbors import KNeighborsClassifier\n",
+    "model =  KNeighborsClassifier(3)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "## 练习4：查看sklearn的官方说明，了解分类问题的评估标准，并对此例进行评估"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "**混淆矩阵（Confusion Matrix）相关学习链接**\n",
+    "\n",
+    "- Blog:<br>\n",
+    "http://blog.csdn.net/vesper305/article/details/44927047<br>\n",
+    "- WiKi:<br>\n",
+    "http://en.wikipedia.org/wiki/Confusion_matrix<br>\n",
+    "- sklearn doc:<br>\n",
+    "http://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9267832489645651\n"
+     ]
+    },
+    {
+     "data": {
+      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAi8AAAHFCAYAAAA64xk9AAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjguMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy81sbWrAAAACXBIWXMAAA9hAAAPYQGoP6dpAAArHklEQVR4nO3deViVdf7/8dcRWUVQwA0C1zS0UsQ0LXO3yEzGJi0tRRFzqbSMHHMUzUrl24yWua9pjss3l1FLv1naZpi7pZEz5oKWjFtqIhLL/fvDn2c8AnqQg/Ch5+O6uGbOfd/nvt83V+az+9znHJtlWZYAAAAMUaa4BwAAACgI4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFMMB3332nPn36qGbNmvLy8pKvr68aN26sxMREnT17tkiPvXv3brVq1Ur+/v6y2WyaPHmyy49hs9k0ZswYl+/3ZhYsWCCbzSabzabPP/8813rLslSnTh3ZbDa1bt36lo4xbdo0LViwoEDP+fzzz/OdCYBUtrgHAHBjs2fP1qBBg1SvXj3Fx8erfv36yszM1I4dOzRjxgwlJSVp1apVRXb8vn37Ki0tTUuXLlXFihVVo0YNlx8jKSlJd9xxh8v366zy5ctr7ty5uQLliy++0E8//aTy5cvf8r6nTZumoKAgxcTEOP2cxo0bKykpSfXr17/l4wKlGfEClGBJSUkaOHCgOnTooNWrV8vT09O+rkOHDho2bJg2bNhQpDPs27dPcXFxioqKKrJj3H///UW2b2d0795dixcv1tSpU+Xn52dfPnfuXDVv3lwXLly4LXNkZmbKZrPJz8+v2H8nQEnGy0ZACfbWW2/JZrNp1qxZDuFylYeHhx5//HH745ycHCUmJuquu+6Sp6enKleurF69eun48eMOz2vdurXuvvtubd++XS1btpSPj49q1aqlCRMmKCcnR9J/X1LJysrS9OnT7S+vSNKYMWPs//9aV59z5MgR+7JNmzapdevWCgwMlLe3t8LCwvTEE0/o0qVL9m3yetlo37596tKliypWrCgvLy81atRI77//vsM2V19eWbJkiUaOHKng4GD5+fmpffv2OnDggHO/ZElPP/20JGnJkiX2ZefPn9eKFSvUt2/fPJ8zduxYNWvWTAEBAfLz81Pjxo01d+5cXftdtzVq1ND+/fv1xRdf2H9/V69cXZ190aJFGjZsmEJCQuTp6amDBw/metno9OnTCg0NVYsWLZSZmWnf/w8//KBy5crp2WefdfpcgdKAeAFKqOzsbG3atEmRkZEKDQ116jkDBw7U8OHD1aFDB61Zs0bjxo3Thg0b1KJFC50+fdph29TUVPXs2VPPPPOM1qxZo6ioKI0YMUIffPCBJKlTp05KSkqSJP35z39WUlKS/bGzjhw5ok6dOsnDw0Pz5s3Thg0bNGHCBJUrV06///57vs87cOCAWrRoof379+vdd9/VypUrVb9+fcXExCgxMTHX9q+99pqOHj2qOXPmaNasWfr3v/+tzp07Kzs726k5/fz89Oc//1nz5s2zL1uyZInKlCmj7t2753tuzz33nJYvX66VK1eqa9eueuGFFzRu3Dj7NqtWrVKtWrUUERFh//1d/xLfiBEjlJKSohkzZmjt2rWqXLlyrmMFBQVp6dKl2r59u4YPHy5JunTpkp588kmFhYVpxowZTp0nUGpYAEqk1NRUS5L11FNPObV9cnKyJckaNGiQw/Jvv/3WkmS99tpr9mWtWrWyJFnffvutw7b169e3Hn74YYdlkqzBgwc7LEtISLDy+tfH/PnzLUnW4cOHLcuyrA8//NCSZO3Zs+eGs0uyEhIS7I+feuopy9PT00pJSXHYLioqyvLx8bHOnTtnWZZlbd682ZJkPfroow7bLV++3JJkJSUl3fC4V+fdvn27fV/79u2zLMuy7rvvPismJsayLMtq0KCB1apVq3z3k52dbWVmZlqvv/66FRgYaOXk5NjX5ffcq8d76KGH8l23efNmh+UTJ060JFmrVq2yevfubXl7e1vffffdDc8RKI248gKUEps3b5akXDeGNm3aVOHh4frss88clletWlVNmzZ1WHbvvffq6NGjLpupUaNG8vDwUP/+/fX+++/r0KFDTj1v06ZNateuXa4rTjExMbp06VKuK0DXvnQmXTkPSQU6l1atWql27dqaN2+evv/+e23fvj3fl4yuzti+fXv5+/vLzc1N7u7uGj16tM6cOaOTJ086fdwnnnjC6W3j4+PVqVMnPf3003r//fc1ZcoU3XPPPU4/HygtiBeghAoKCpKPj48OHz7s1PZnzpyRJFWrVi3XuuDgYPv6qwIDA3Nt5+npqfT09FuYNm+1a9fWp59+qsqVK2vw4MGqXbu2ateurXfeeeeGzztz5ky+53F1/bWuP5er9wcV5FxsNpv69OmjDz74QDNmzFDdunXVsmXLPLfdtm2bOnbsKOnKu8G2bNmi7du3a+TIkQU+bl7neaMZY2JidPnyZVWtWpV7XfCHRbwAJZSbm5vatWunnTt35rrhNi9X/wI/ceJErnW//PKLgoKCXDabl5eXJCkjI8Nh+fX31UhSy5YttXbtWp0/f15bt25V8+bNNXToUC1dujTf/QcGBuZ7HpJcei7XiomJ0enTpzVjxgz16dMn3+2WLl0qd3d3rVu3Tt26dVOLFi3UpEmTWzpmXjc+5+fEiRMaPHiwGjVqpDNnzuiVV165pWMCpiNegBJsxIgRsixLcXFxed7gmpmZqbVr10qS2rZtK0n2G26v2r59u5KTk9WuXTuXzXX1HTPfffedw/Krs+TFzc1NzZo109SpUyVJu3btynfbdu3aadOmTfZYuWrhwoXy8fEpsrcRh4SEKD4+Xp07d1bv3r3z3c5ms6ls2bJyc3OzL0tPT9eiRYtybeuqq1nZ2dl6+umnZbPZtH79eo0fP15TpkzRypUrC71vwDR8zgtQgjVv3lzTp0/XoEGDFBkZqYEDB6pBgwbKzMzU7t27NWvWLN19993q3Lmz6tWrp/79+2vKlCkqU6aMoqKidOTIEY0aNUqhoaF66aWXXDbXo48+qoCAAMXGxur1119X2bJltWDBAh07dsxhuxkzZmjTpk3q1KmTwsLCdPnyZfs7etq3b5/v/hMSErRu3Tq1adNGo0ePVkBAgBYvXqyPPvpIiYmJ8vf3d9m5XG/ChAk33aZTp076+9//rh49eqh///46c+aM3n777Tzfzn7PPfdo6dKlWrZsmWrVqiUvL69buk8lISFBX331lT755BNVrVpVw4YN0xdffKHY2FhFRESoZs2aBd4nYCriBSjh4uLi1LRpU02aNEkTJ05Uamqq3N3dVbduXfXo0UPPP/+8fdvp06erdu3amjt3rqZOnSp/f3898sgjGj9+fJ73uNwqPz8/bdiwQUOHDtUzzzyjChUqqF+/foqKilK/fv3s2zVq1EiffPKJEhISlJqaKl9fX919991as2aN/Z6RvNSrV0/ffPONXnvtNQ0ePFjp6ekKDw/X/PnzC/RJtUWlbdu2mjdvniZOnKjOnTsrJCREcXFxqly5smJjYx22HTt2rE6cOKG4uDj99ttvql69usPn4Dhj48aNGj9+vEaNGuVwBW3BggWKiIhQ9+7d9fXXX8vDw8MVpweUeDbLuuYTlQAAAEo47nkBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYJRS+SF13hHP33wjAEY6u+294h4BQBHxdnduO668AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKGWLewD8sbzSt6Oi2zZU3RpVlJ6RqW/3HtLId/6pfx896bDdyOceVewTD6hCeW9t33dUQ8cvU/KhVPt6D/eymvDyn/Tkw5Hy9nLX5m3/0tC3lunnk+ckSS0j79Qnc4bkOcODPRO184cUh2UB/uW0bdlfFFKloqq2jNf5i+muPXEAdjt3bNf78+cq+Yd9OnXqlP7+zlS1bdfevr7R3fXyfN7Ql+MV07efzp8/p+lTpyjpm6/1n9RUVahQUW3attegF4aofPnyt+s0UIyIF9xWLRvX0YxlX2rn/qMqW9ZNYwZ31rrpzyui6xu6dPl3SdKwmPZ68Zk26p/wgf599KT+EveIPprxgu6Nfl0XL2VIkv4n/gl1euhu9RoxX2fPpWnCy3/SincHqEWPicrJsbR17yHVaD/C4dijBz2mts3q5QoXSZqR0EPf//sXhVSpWPS/BOAPLj39kurWq6cu0V017KUXcq3/9POvHR5//dWXGjt6pNp3eFiSdOrkSZ06eVIvvzJctWrV0YkTP+uN18fo1KmTenvSu7fjFFDMiBfcVl2en+bw+LkxH+jYpgmKqB+qLbt+kiQN7tFGiXP/T//ctFeS1G/UIh397C11j2qiuSu2yM/XSzHRzRX714Xa/O0BSVLfvy7Uv9ePU9tmd+nTpGRlZmXrP2d+sx+nbNky6tTqHs1Y9mWumeKefFD+5X301qz1euTBBkV16gD+vwdbttKDLVvluz4oqJLD4883f6b7mjbTHaGhkqQ6d9bV3yZPsa8PDQvT8y8O1ci/xCsrK0tly/JXW2lXrPe8HD9+XCNHjlSbNm0UHh6u+vXrq02bNho5cqSOHTtWnKPhNvHz9ZIk/Xr+kiSpRkigqlXy16dJP9q3+T0zS1/tPKj7G9aSJEWEh8nDvaw+TUq2b3Pi1Hnt/+kX3d+wZp7HeazVvQqq4KsP1mx1WH5XraoaERelfqMWKifHcum5ASi8M6dP6+svv1B01z/fcLuLv12Ur68v4fIHUWzx8vXXXys8PFyrVq1Sw4YN1atXLz3zzDNq2LChVq9erQYNGmjLli3FNR5uk4nDntCWXQf1w08nJElVg/wkSSfP/uaw3ckzv6lK4JV1VQP9lPF7ps79lp7vNtfrHd1cG5OSdfw/5+zLPNzL6v3xMXpt8modS/3VVacEwIXWrFklH59yate+Y77bnDv3q2bPnKYnnux+GydDcSq2RH3ppZfUr18/TZo0Kd/1Q4cO1fbt22+4n4yMDGVkZDgss3KyZSvj5rJZUTQm/aWb7rkzWO365P5nwLIcr4LYbLmXXc9msymvLUIqV1CH5uF6Zvg8h+XjXnxcBw7/R0s/vvE/YwCKzz9XrdCjj3WWp6dnnusvXryoFwY9p1q1a+u5gc/f5ulQXIrtysu+ffs0YMCAfNc/99xz2rdv3033M378ePn7+zv8ZP1npytHRRH4+/An9Vire/Rw3Lv2dwhJUurpC5KU6wpKpYDy9qsxqWcuyNPDXRXKe1+3ja9OnrmQ61jPdrlfZ86nad0X3zksb3VfXXVtH6Hftr+j37a/o/Uzr9w4eHzzBP11wKOFPkcAhbNr5w4dOXxYf+r6ZJ7r09IuatBz/eTj46O/vzNV7u7ut3lCFJdii5dq1arpm2++yXd9UlKSqlWrdtP9jBgxQufPn3f4KVsl0pWjwsUmDX9SXdo21CPPvaujv5xxWHfk5zM6ceq82t1/l32Ze1k3tYyso617D0mSdien6PfMLIdtqgb5qUHtYG3dezjX8Xo9fr/+sW6bsrJyHJY//cocNe0+Xs2emqBmT03QwNf/IUlqHztZM/O4sRfA7bVq5YeqX7+B6t11V651Fy9e1MD+sXJ3d9fkKdPzvTKD0qnYXjZ65ZVXNGDAAO3cuVMdOnRQlSpVZLPZlJqaqo0bN2rOnDmaPHnyTffj6emZ6x9aXjIquSaP6KbuUU305EuzdDHtsqoEXvlMhvMXL+tyRqYkaeo/Nis+tqMOppzUwZRTejX2YaVfztSy9TskSRcuXtaC1Uma8HJXnTmfpl/PX9L4l/6kfQd/0aZvf3Q4XuumdVXzjiAtWJ07lA8fP+3wOLCCryTpx0OpfM4LUIQuXUpTSsp/P7Lg55+P68cfk+Xv769q1YIlXYmTjZ9s0LBXhud6flraRQ3s31eX09P15jv/o7S0i0pLuyhJqlgxQG5u/B1Q2hVbvAwaNEiBgYGaNGmSZs6cqezsbEmSm5ubIiMjtXDhQnXr1q24xkMRea7bQ5KkjXOGOiyPG71IH6z9VpL0twWfysvTQ5NHdFdFPx9t33dEjw18z/4ZL5L06tsrlJ2dow8mxsrb012btx1Q/yGLcr1jKCa6hZL2/KQDh/9TtCcGwGn79+1TXN9e9sd/SxwvSerc5U8a9+YESdKG9R9JlqVHHn0s1/N/2L9f33935aMUOj/awWHdR//3mUJC7iiq0VFC2Kyb3QV5G2RmZur06Sv/FRwUFFTo1y29I7hpCyitzm57r7hHAFBEvJ38679EvCHe3d3dqftbAAAA+GJGAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUVwSL+fOnXPFbgAAAG6qwPEyceJELVu2zP64W7duCgwMVEhIiPbu3evS4QAAAK5X4HiZOXOmQkNDJUkbN27Uxo0btX79ekVFRSk+Pt7lAwIAAFyrbEGfcOLECXu8rFu3Tt26dVPHjh1Vo0YNNWvWzOUDAgAAXKvAV14qVqyoY8eOSZI2bNig9u3bS5Isy1J2drZrpwMAALhOga+8dO3aVT169NCdd96pM2fOKCoqSpK0Z88e1alTx+UDAgAAXKvA8TJp0iTVqFFDx44dU2Jionx9fSVdeTlp0KBBLh8QAADgWjbLsqziHsLVvCOeL+4RABSRs9veK+4RABQRb3fntnPqysuaNWucPvDjjz/u9LYAAAAF5VS8REdHO7Uzm83GTbsAAKBIORUvOTk5RT0HAACAUwr19QCXL1921RwAAABOKXC8ZGdna9y4cQoJCZGvr68OHTokSRo1apTmzp3r8gEBAACuVeB4efPNN7VgwQIlJibKw8PDvvyee+7RnDlzXDocAADA9QocLwsXLtSsWbPUs2dPubm52Zffe++9+vHHH106HAAAwPUKHC8///xznp+km5OTo8zMTJcMBQAAkJ8Cx0uDBg301Vdf5Vr+v//7v4qIiHDJUAAAAPkp8NcDJCQk6Nlnn9XPP/+snJwcrVy5UgcOHNDChQu1bt26opgRAADArsBXXjp37qxly5bp448/ls1m0+jRo5WcnKy1a9eqQ4cORTEjAACAHd9tBMAofLcRUHq59LuN8rJjxw4lJyfLZrMpPDxckZGRt7orAAAApxU4Xo4fP66nn35aW7ZsUYUKFSRJ586dU4sWLbRkyRKFhoa6ekYAAAC7At/z0rdvX2VmZio5OVlnz57V2bNnlZycLMuyFBsbWxQzAgAA2BX4nhdvb2998803ud4WvWvXLj3wwANKT0936YC3gntegNKLe16A0svZe14KfOUlLCwszw+jy8rKUkhISEF3BwAAUCAFjpfExES98MIL2rFjh65etNmxY4eGDBmit99+2+UDAgAAXMupl40qVqwom81mf5yWlqasrCyVLXvlft+r/79cuXI6e/Zs0U3rJF42AkovXjYCSi+XvlV68uTJhRgFAADAdZyKl969exf1HAAAAE655Q+pk6T09PRcN+/6+fkVaiAAAIAbKfANu2lpaXr++edVuXJl+fr6qmLFig4/AAAARanA8fLqq69q06ZNmjZtmjw9PTVnzhyNHTtWwcHBWrhwYVHMCAAAYFfgl43Wrl2rhQsXqnXr1urbt69atmypOnXqqHr16lq8eLF69uxZFHMCAABIuoUrL2fPnlXNmjUlXbm/5epbox988EF9+eWXrp0OAADgOgWOl1q1aunIkSOSpPr162v58uWSrlyRufpFjQAAAEWlwPHSp08f7d27V5I0YsQI+70vL730kuLj410+IAAAwLUK/MWM10tJSdGOHTtUu3ZtNWzY0FVzFQqfsAuUXnzCLlB6FdkXM14vLCxMXbt2VUBAgPr27VvY3QEAANxQoa+8XLV37141btxY2dnZrthdoZz6Lau4RwBQRMp7F+qzNQGUYF5O/vEu9JUXAACA24l4AQAARiFeAACAUZx+8bhr1643XH/u3LnCzgIAAHBTTseLv7//Tdf36tWr0AMBAADciMvebVSS8G4joPTi3UZA6cW7jQAAQKlEvAAAAKMQLwAAwCjECwAAMArxAgAAjHJL8bJo0SI98MADCg4O1tGjRyVJkydP1j//+U+XDgcAAHC9AsfL9OnT9fLLL+vRRx/VuXPn7F/EWKFCBU2ePNnV8wEAADgocLxMmTJFs2fP1siRI+Xm5mZf3qRJE33//fcuHQ4AAOB6BY6Xw4cPKyIiItdyT09PpaWluWQoAACA/BQ4XmrWrKk9e/bkWr5+/XrVr1/fFTMBAADkq8Cfsx0fH6/Bgwfr8uXLsixL27Zt05IlSzR+/HjNmTOnKGYEAACwu6XvNpo9e7beeOMNHTt2TJIUEhKiMWPGKDY21uUD3gq+2wgovfhuI6D0cva7jQr1xYynT59WTk6OKleufKu7KBLEC1B6ES9A6eVsvBTq3wJBQUGFeToAAECBFTheatasKZvNlu/6Q4cOFWogAACAGylwvAwdOtThcWZmpnbv3q0NGzYoPj7eVXMBAADkqcDxMmTIkDyXT506VTt27Cj0QAAAADdSqBt2r3Xo0CE1atRIFy5ccMXuCoUbdoHSixt2gdLL2Rt2Xfat0h9++KECAgJctTsAAIA8Ffg/YSIiIhxu2LUsS6mpqTp16pSmTZvm0uEAAACuV+B4iY6OdnhcpkwZVapUSa1bt9Zdd93lqrkAAADyVKB4ycrKUo0aNfTwww+ratWqRTUTAABAvgp8w66Pj4+Sk5NVvXr1opqp0LhhFyi9uGEXKL2K7IbdZs2aaffu3QV9GgAAgEsU+D9hBg0apGHDhun48eOKjIxUuXLlHNbfe++9LhsOAADgek6/bNS3b19NnjxZFSpUyL0Tm02WZclmsyk7O9vVMxYYLxsBpRcvGwGll8u/VdrNzU0nTpxQenr6DbcrCffCEC9A6UW8AKWXy79V+mrjlIQ4AQAAf1wFumH3Rt8mDQAAcDsU6Ppr3bp1bxowZ8+eLdRAAAAAN1KgeBk7dqz8/f2LahYAAICbcvqG3TJlyig1NVWVK1cu6pkKjRt2gdKLG3aB0svlH1LH/S4AAKAkcDpeCvgtAgAAAEXC6euvOTk5RTkHAACAUwr83UYAAADFiXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYpUTHy7Fjx9S3b98bbpORkaELFy44/GRkZNymCQEAwO1WouPl7Nmzev/992+4zfjx4+Xv7+/w887fJt6mCQEAwO1msyzLKq6Dr1mz5obrDx06pGHDhik7OzvfbTIyMnJdabnwu5s8PT1dMiOAkqW8d9niHgFAEfFy8o93scZLmTJlZLPZdKMRbDbbDeMlL6d+yyrsaABKKOIFKL2cjZdifdmoWrVqWrFihXJycvL82bVrV3GOBwAASqBijZfIyMgbBsrNrsoAAIA/nmK9/hofH6+0tLR819epU0ebN2++jRMBAICSrljveSkq3PMClF7c8wKUXkbc8wIAAFBQxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADAK8QIAAIxCvAAAAKMQLwAAwCjECwAAMArxAgAAjEK8AAAAoxAvAADAKMQLAAAwCvECAACMQrwAAACjEC8AAMAoxAsAADCKzbIsq7iHAG5VRkaGxo8frxEjRsjT07O4xwHgQvz5Rn6IFxjtwoUL8vf31/nz5+Xn51fc4wBwIf58Iz+8bAQAAIxCvAAAAKMQLwAAwCjEC4zm6emphIQEbuYDSiH+fCM/3LALAACMwpUXAABgFOIFAAAYhXgBAABGIV4AAIBRiBcYbdq0aapZs6a8vLwUGRmpr776qrhHAlBIX375pTp37qzg4GDZbDatXr26uEdCCUO8wFjLli3T0KFDNXLkSO3evVstW7ZUVFSUUlJSins0AIWQlpamhg0b6r333ivuUVBC8VZpGKtZs2Zq3Lixpk+fbl8WHh6u6OhojR8/vhgnA+AqNptNq1atUnR0dHGPghKEKy8w0u+//66dO3eqY8eODss7duyob775ppimAgDcDsQLjHT69GllZ2erSpUqDsurVKmi1NTUYpoKAHA7EC8wms1mc3hsWVauZQCA0oV4gZGCgoLk5uaW6yrLyZMnc12NAQCULsQLjOTh4aHIyEht3LjRYfnGjRvVokWLYpoKAHA7lC3uAYBb9fLLL+vZZ59VkyZN1Lx5c82aNUspKSkaMGBAcY8GoBAuXryogwcP2h8fPnxYe/bsUUBAgMLCwopxMpQUvFUaRps2bZoSExN14sQJ3X333Zo0aZIeeuih4h4LQCF8/vnnatOmTa7lvXv31oIFC27/QChxiBcAAGAU7nkBAABGIV4AAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBcAtGzNmjBo1amR/HBMTo+jo6Ns+x5EjR2Sz2bRnz54iO8b153orbsecwB8B8QKUMjExMbLZbLLZbHJ3d1etWrX0yiuvKC0trciP/c477zj9Cai3+y/y1q1ba+jQobflWACKFt9tBJRCjzzyiObPn6/MzEx99dVX6tevn9LS0jR9+vRc22ZmZsrd3d0lx/X393fJfgDgRrjyApRCnp6eqlq1qkJDQ9WjRw/17NlTq1evlvTflz/mzZunWrVqydPTU5Zl6fz58+rfv78qV64sPz8/tW3bVnv37nXY74QJE1SlShWVL19esbGxunz5ssP66182ysnJ0cSJE1WnTh15enoqLCxMb775piSpZs2akqSIiAjZbDa1bt3a/rz58+crPDxcXl5euuuuuzRt2jSH42zbtk0RERHy8vJSkyZNtHv37kL/zoYPH666devKx8dHtWrV0qhRo5SZmZlru5kzZyo0NFQ+Pj568sknde7cOYf1N5v9Wr/++qt69uypSpUqydvbW3feeafmz59f6HMBSjuuvAB/AN7e3g5/ER88eFDLly/XihUr5ObmJknq1KmTAgIC9PHHH8vf318zZ85Uu3bt9K9//UsBAQFavny5EhISNHXqVLVs2VKLFi3Su+++q1q1auV73BEjRmj27NmaNGmSHnzwQZ04cUI//vijpCsB0rRpU3366adq0KCBPDw8JEmzZ89WQkKC3nvvPUVERGj37t2Ki4tTuXLl1Lt3b6Wlpemxxx5T27Zt9cEHH+jw4cMaMmRIoX9H5cuX14IFCxQcHKzvv/9ecXFxKl++vF599dVcv7e1a9fqwoULio2N1eDBg7V48WKnZr/eqFGj9MMPP2j9+vUKCgrSwYMHlZ6eXuhzAUo9C0Cp0rt3b6tLly72x99++60VGBhodevWzbIsy0pISLDc3d2tkydP2rf57LPPLD8/P+vy5csO+6pdu7Y1c+ZMy7Isq3nz5taAAQMc1jdr1sxq2LBhnse+cOGC5enpac2ePTvPOQ8fPmxJsnbv3u2wPDQ01PrHP/7hsGzcuHFW8+bNLcuyrJkzZ1oBAQFWWlqaff306dPz3Ne1WrVqZQ0ZMiTf9ddLTEy0IiMj7Y8TEhIsNzc369ixY/Zl69evt8qUKWOdOHHCqdmvP+fOnTtbffr0cXomAFdw5QUohdatWydfX19lZWUpMzNTXbp00ZQpU+zrq1evrkqVKtkf79y5UxcvXlRgYKDDftLT0/XTTz9JkpKTkzVgwACH9c2bN9fmzZvznCE5OVkZGRlq166d03OfOnVKx44dU2xsrOLi4uzLs7Ky7PfTJCcnq2HDhvLx8XGYo7A+/PBDTZ48WQcPHtTFixeVlZUlPz8/h23CwsJ0xx13OBw3JydHBw4ckJub201nv97AgQP1xBNPaNeuXerYsaOio6PVokWLQp8LUNoRL0Ap1KZNG02fPl3u7u4KDg7OdUNuuXLlHB7n5OSoWrVq+vzzz3Ptq0KFCrc0g7e3d4Gfk5OTI+nKyy/NmjVzWHf15S3Lsm5pnhvZunWrnnrqKY0dO1YPP/yw/P39tXTpUv3tb3+74fNsNpv9f52Z/XpRUVE6evSoPvroI3366adq166dBg8erLffftsFZwWUXsQLUAqVK1dOderUcXr7xo0bKzU1VWXLllWNGjXy3CY8PFxbt25Vr1697Mu2bt2a7z7vvPNOeXt767PPPlO/fv1yrb96j0t2drZ9WZUqVRQSEqJDhw6pZ8+eee63fv36WrRokdLT0+2BdKM5nLFlyxZVr15dI0eOtC87evRoru1SUlL0yy+/KDg4WJKUlJSkMmXKqG7duk7NnpdKlSopJiZGMTExatmypeLj44kX4CaIFwBq3769mjdvrujoaE2cOFH16tXTL7/8oo8//ljR0dFq0qSJhgwZot69e6tJkyZ68MEHtXjxYu3fvz/fG3a9vLw0fPhwvfrqq/Lw8NADDzygU6dOaf/+/YqNjVXlypXl7e2tDRs26I477pCXl5f8/f01ZswYvfjii/Lz81NUVJQyMjK0Y8cO/frrr3r55ZfVo0cPjRw5UrGxsfrrX/+qI0eOOP2X/alTp3J9rkzVqlVVp04dpaSkaOnSpbrvvvv00UcfadWqVXmeU+/evfX222/rwoULevHFF9WtWzdVrVpVkm46+/VGjx6tyMhINWjQQBkZGVq3bp3Cw8OdOhfgD624b7oB4FrX37B7vYSEBIebbK+6cOGC9cILL1jBwcGWu7u7FRoaavXs2dNKSUmxb/Pmm29aQUFBlq+vr9W7d2/r1VdfzfeGXcuyrOzsbOuNN96wqlevbrm7u1thYWHWW2+9ZV8/e/ZsKzQ01CpTpozVqlUr+/LFixdbjRo1sjw8PKyKFStaDz30kLVy5Ur7+qSkJKthw4aWh4eH1ahRI2vFihVO3bArKddPQkKCZVmWFR8fbwUGBlq+vr5W9+7drUmTJln+/v65fm/Tpk2zgoODLS8vL6tr167W2bNnHY5zo9mvv2F33LhxVnh4uOXt7W0FBARYXbp0sQ4dOpTvOQC4wmZZRfACMgAAQBHhQ+oAAIBRiBcAAGAU4gUAABiFeAEAAEYhXgAAgFGIFwAAYBTiBQAAGIV4AQAARiFeAACAUYgXAABgFOIFAAAYhXgBAABG+X9xfnFfJNSA0QAAAABJRU5ErkJggg==",
+      "text/plain": [
+       "<Figure size 640x480 with 1 Axes>"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    }
+   ],
+   "source": [
+    "from sklearn.neighbors import KNeighborsClassifier\n",
+    "from sklearn.metrics import confusion_matrix\n",
+    "import seaborn as sns\n",
+    "import matplotlib.pyplot as plt\n",
+    "model =  KNeighborsClassifier(3)\n",
+    "model.fit(X_train, y_train)\n",
+    "y_pred = model.predict(X_test)\n",
+    "accuracy = accuracy_score(y_test, y_pred)\n",
+    "print(\"Accuracy:\", accuracy)\n",
+    "conf_matrix = confusion_matrix(y_test, y_pred)\n",
+    "sns.heatmap(conf_matrix, annot=True, fmt=\"d\", cmap=\"Blues\", cbar=False)\n",
+    "plt.title(\"Confusion Matrix\")\n",
+    "plt.xlabel(\"Predicted Labels\")\n",
+    "plt.ylabel(\"True Labels\")\n",
+    "plt.show()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 练习5：调整模型的标准\n",
+    "\n",
+    "银行通常会有更严格的要求，因为fraud带来的后果通常比较严重，一般我们会调整模型的标准。<br>\n",
+    "\n",
+    "比如在logistic regression当中，一般我们的概率判定边界为0.5，但是我们可以把阈值设定低一些，来提高模型的“敏感度”，试试看把阈值设定为0.3，再看看这时的评估指标(主要是准确率和召回率)。\n",
+    "\n",
+    "- 提示：<span style='color:white'>sklearn的很多分类模型，predict_prob可以拿到预估的概率，可以根据它和设定的阈值大小去判断最终结果(分类类别)('Gender') </span>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 26,
+   "metadata": {
+    "jupyter": {
+     "outputs_hidden": true
+    }
+   },
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Accuracy: 0.9307\n",
+      "Precision: 0.4409\n",
+      "Recall: 0.0553\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\linear_model\\_logistic.py:458: ConvergenceWarning: lbfgs failed to converge (status=1):\n",
+      "STOP: TOTAL NO. of ITERATIONS REACHED LIMIT.\n",
+      "\n",
+      "Increase the number of iterations (max_iter) or scale the data as shown in:\n",
+      "    https://scikit-learn.org/stable/modules/preprocessing.html\n",
+      "Please also refer to the documentation for alternative solver options:\n",
+      "    https://scikit-learn.org/stable/modules/linear_model.html#logistic-regression\n",
+      "  n_iter_i = _check_optimize_result(\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:767: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if not hasattr(array, \"sparse\") and array.dtypes.apply(is_sparse).any():\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:605: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype):\n",
+      "D:\\Anaconda\\envs\\taiji\\Lib\\site-packages\\sklearn\\utils\\validation.py:614: FutureWarning: is_sparse is deprecated and will be removed in a future version. Check `isinstance(dtype, pd.SparseDtype)` instead.\n",
+      "  if is_sparse(pd_dtype) or not is_extension_array_dtype(pd_dtype):\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.model_selection import train_test_split\n",
+    "from sklearn.linear_model import LogisticRegression\n",
+    "from sklearn.metrics import accuracy_score, precision_score, recall_score\n",
+    "import numpy as np\n",
+    "\n",
+    "# Split the data into training and testing sets\n",
+    "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)\n",
+    "\n",
+    "# Instantiate the logistic regression model\n",
+    "model = LogisticRegression()\n",
+    "\n",
+    "# Train the model on the training data\n",
+    "model.fit(X_train, y_train)\n",
+    "\n",
+    "# Set a lower threshold for classification\n",
+    "threshold = 0.3\n",
+    "y_prob = model.predict_proba(X_test)[:, 1]  # Get the probabilities of the positive class\n",
+    "y_pred_lower_threshold = (y_prob >= threshold).astype(int)\n",
+    "\n",
+    "# Evaluate the model's performance with the lower threshold\n",
+    "accuracy = accuracy_score(y_test, y_pred_lower_threshold)\n",
+    "precision = precision_score(y_test, y_pred_lower_threshold)\n",
+    "recall = recall_score(y_test, y_pred_lower_threshold)\n",
+    "\n",
+    "print(f\"Accuracy: {accuracy:.4f}\")\n",
+    "print(f\"Precision: {precision:.4f}\")\n",
+    "print(f\"Recall: {recall:.4f}\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "taiji",
+   "language": "python",
+   "name": "taiji"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.5"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}

	SeriousDlqin2yrs	RevolvingUtilizationOfUnsecuredLines	age	NumberOfTime30-59DaysPastDueNotWorse	DebtRatio	MonthlyIncome	NumberOfOpenCreditLinesAndLoans	NumberOfTimes90DaysLate	NumberRealEstateLoansOrLines	NumberOfDependents
0	1	0.766127	45.0	2.0	0.802982	9120.0	13.0	0.0	6.0	2.0
1	0	0.957151	40.0	0.0	0.121876	2600.0	4.0	0.0	0.0	1.0
2	0	0.658180	38.0	1.0	0.085113	3042.0	2.0	1.0	0.0	0.0
3	0	0.233810	30.0	0.0	0.036050	3300.0	5.0	0.0	0.0	0.0
4	0	0.907239	49.0	1.0	0.024926	63588.0	7.0	0.0	1.0	0.0