Application Python

Voici l’algorithme entraîné mis en fonction grâce à une régression logistique qui permet de répondre de façon binaire VRAIS ou FAUX billet.

Notebooks et outils de programmation :

import numpy as np
import pandas as pd
import pickle

In [2]:

# load
with open("model.pkl", "rb") as f:
    clf2 = pickle.load(f)

In [3]:

file_t = pd.read_csv("billets_test.csv")
file_t.head()

Out[3]:

diagonalheight_leftheight_rightmargin_lowmargin_uplengthid
0172.09103.95103.734.393.09113.19B_1
1171.52104.17104.035.273.16111.82B_2
2171.78103.80103.753.813.24113.39B_3
3172.02104.08103.995.573.30111.10B_4
4171.79104.34104.375.003.07111.87B_5

In [4]:

file_test = file_t[["height_right","margin_low","margin_up","length"]]

In [13]:

def detect_billet(list_billet:list, file_t):
    preds = clf2.predict(list_billet) 
    list_preds = []
    for i in preds:
        if i == 1:
            list_preds.append("Vrai billet")
        else:
            list_preds.append("Faux billet")
    file_t["pred"] = list_preds
    file_t["proba"] = np.round(clf2.predict_proba(list_billet)[:,1],3)
    return file_t

In [14]:

detect_billet(file_test, file_t)

Out[14]:

diagonalheight_leftheight_rightmargin_lowmargin_uplengthidpredproba
0172.09103.95103.734.393.09113.19B_1Vrai billet0.991
1171.52104.17104.035.273.16111.82B_2Faux billet0.005
2171.78103.80103.753.813.24113.39B_3Vrai billet0.999
3172.02104.08103.995.573.30111.10B_4Faux billet0.000
4171.79104.34104.375.003.07111.87B_5Faux billet0.014

In [ ]:

 

Présentation client par un PowerPoint.

Lien vers la première partie d’apprentissage et de régression linéaire.

Retour vers Data-Analyst