๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

728x90

๋จธ์‹ ๋Ÿฌ๋‹

4/7 ๋ชฉ ๋ชฉ์š”์ผ! ์˜ค๋Š˜๋„ Multinomial Classification๋ฅผ ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์ œ(MNIST)๋ฅผ ํ†ตํ•ด ๋ฐฐ์šด๋‹ค~ ์†์œผ๋กœ ์“ด ์ˆซ์ž๋“ค๋กœ ์ด๋ฃจ์–ด์ง„ ๋Œ€ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค~ MNIST ์ด๋ฏธ์ง€๋Š” ๊ทธ ์ž์ฒด๊ฐ€ 2์ฐจ์›์ด๊ณ  ๊ทธ๋Ÿฐ ๊ฒŒ ์—ฌ๋Ÿฟ์ด๊ธฐ ๋•Œ๋ฌธ์— 3์ฐจ์›. ์ด๋ฏธ์ง€๋ฅผ 1์ฐจ์›์œผ๋กœ ravel() ํ•ด์•ผ ํ•จ https://www.kaggle.com/competitions/digit-recognizer/data?select=test.csv Digit Recognizer | Kaggle www.kaggle.com Tensorflow Ver. 1.15์€ ๋ฐฐ์šด ์ด๋ก ์„ ์ฝ”๋“œ๋กœ ์ดํ•ดํ•˜๊ธฐ์—๋Š” ์ข‹์ง€๋งŒ ์ฝ”๋“œ๊ฐ€ ๋„ˆ๋ฌด ์–ด๋ ต๋‹ค. 1. Multinomial Classification by Tensorflow Ver. 1.15 - MNIST import nump.. ๋”๋ณด๊ธฐ
4/6 ์ˆ˜ ์ˆ˜์š”์ผ! ์˜ค๋Š˜์€ Multinomial Classification์„ ๋ฐฐ์šด๋‹ค. Linear Regression(์—ฐ์†์ ์ธ ์ˆซ์ž ๊ฐ’ ์˜ˆ์ธก)์ด ๋ฐœ์ „ํ•œ ๊ฒƒ์ด Logistic Regression → Classification(๋ถ„๋ฅ˜๋ฅผ ํŒ๋‹จํ•˜๋Š” ์˜ˆ์ธก) - Binary Classification(์ดํ•ญ๋ถ„๋ฅ˜) - Multinomial Classification(๋‹คํ•ญ๋ถ„๋ฅ˜) Logistic Regression์€ ์ด์ง„ ๋ถ„๋ฅ˜์— ํŠนํ™”๋จ SKlearn์ด ์ œ๊ณตํ•˜๋Š” ๋ถ„๋ฅ˜๊ธฐ์ธ Gradient Descent(๊ฒฝ์‚ฌํ•˜๊ฐ•๋ฒ•)๊ฐ€ ๋ฐœ์ „ํ•œ ํ˜•ํƒœ์ธ SGD Classifier(Stochastic Gradient Descent, ํ™•๋ฅ ์  ๊ฒฝ์‚ฌํ•˜๊ฐ•๋ฒ•) 1. Binary Classification - ์œ„์Šค์ฝ˜์‹  ์œ ๋ฐฉ์•” ๋ฐ์ดํ„ฐ by Gradient Descent Cl.. ๋”๋ณด๊ธฐ
4/5 ํ™” ํ™”์š”์ผ! Logistic Regression์„ ํ™œ์šฉํ•ด ๋จธ์‹ ๋Ÿฌ๋‹ ์ง„ํ–‰ ์‹œ ์ฃผ์˜์‚ฌํ•ญ์„ ์•Œ์•„๋ณธ๋‹ค. ์•ž์œผ๋กœ ์šฐ๋ฆฌ๋Š” Classification(์ดํ•ญ๋ถ„๋ฅ˜)์˜ Metrics๋กœ Accuracy๋ฅผ ์‚ฌ์šฉํ•  ์˜ˆ์ •์ด๋‹ค. ๋ชจ๋ธ ํ‰๊ฐ€ ์ „ ๊ณ ๋ คํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ๋“ค 1. learning rate(ํ•™์Šต๋ฅ ) : loss ๊ฐ’์„ ๋ณด๋ฉด์„œ ํ•™์Šต๋ฅ ์„ ์กฐ์ •ํ•ด์•ผ ํ•จ. ๋ณดํ†ต 1์˜ ๋งˆ์ด๋„ˆ์Šค 4์Šน์œผ๋กœ ์žก์Œ ํ•™์Šต๋ฅ ์ด ๋„ˆ๋ฌด ํฌ๋‹ค๋ฉด global minima(W')๋ฅผ ์ฐพ์„ ์ˆ˜ ์—†๊ฒŒ ๋จ → OverShooting ๋ฐœ์ƒ ํ•™์Šต๋ฅ ์ด ์•„์ฃผ ์ž‘๋‹ค๋ฉด local minima ์ฐพ๊ฒŒ ๋จ 2. Normalization(์ •๊ทœํ™”) : MinMax Scaling - 0 ~ 1. ์ด์ƒ์น˜์— ๋ฏผ๊ฐํ•จ Standardization - ํ‘œ์ค€ํ™”, Z-Score. ์ƒ๋Œ€์ ์œผ๋กœ ์ด์ƒ์น˜์— ๋‘”๊ฐํ•จ, ๋ชจ๋“  ์นผ๋Ÿผ์—.. ๋”๋ณด๊ธฐ
7ํšŒ ์ฐจ | 4/4 ์›” 7ํšŒ ์ฐจ! ๋ฒŒ์จ ์Šคํ„ฐ๋”” 4์ฃผ ์ฐจ๋‹ค~ ์ฒซ์งธ ์ฃผ๋Š” ํƒ€์ดํƒ€๋‹‰, ๋‘˜์งธ ์ฃผ๋Š” MovieLens EDA · ์‹œ๊ฐํ™” · ๊ธฐ์ˆ ํ†ต๊ณ„, ์…‹์งธ ์ฃผ๋Š” ์บ๊ธ€ ๋ฐ ๋ฐ์ด์ฝ˜์˜ ์˜ˆ์ œ ํ˜น์€ ๊ฐ์ž ์ˆ˜์ง‘ํ•œ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ์ง„ํ–‰ํ•˜๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ํ”„๋กœ์ ํŠธ๊ฐ€ ์žˆ์—ˆ๋‹ค. ์ด๋ฒˆ ์ฃผ๋Š” ์ง€๋‚œ๋ฒˆ์— ์ˆ˜์ •ํ•œ ์ปค๋ฆฌํ˜๋Ÿผ์— ๋”ฐ๋ผ ๋ฉ€์บ  ์ฃผ๊ฐ„ ์ˆ˜์—…์—์„œ ๋ฐฐ์šด ์ธ๊ณต์‹ ๊ฒฝ๋ง์„ ๋ณต์Šตํ•˜๊ณ  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๊ณต๋ถ€ํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๊ฐ€๋‹ฅ์„ ์žก์•˜์—ˆ๋‹ค. ํ•˜์ง€๋งŒ ์•„์ง ์„ฑ๋Šฅํ‰๊ฐ€(Metrics)๋ฅผ ๋ฐฐ์šฐ๊ณ  ์žˆ์–ด, ์•„์ง ๋”ฅ๋Ÿฌ๋‹์œผ๋กœ ์ง„๋„๊ฐ€ ๋‚˜๊ฐ€์ง€ ์•Š์•˜๋‹ค. ๋‚ด์ผ๊นŒ์ง€ ์ œ์ถœํ•ด์•ผ ํ•˜๋Š” ์ˆ˜ํ–‰ํ‰๊ฐ€๋„ ์žˆ์–ด, ์ด์— ๋Œ€ํ•œ ๊ฐ์ž์˜ ์ง„ํ–‰ ์ƒํ™ฉ์„ ๋ฆฌ๋ทฐํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋Œ€์ฒดํ–ˆ๋‹ค. (๋ฐ์ดํ„ฐ ๋ถ„์„, ๋ฐ˜๋ณต๋ฌธ, ๋ถˆ๋ฆฐ ์ธ๋ฑ์‹ฑ, ์ „์ฒ˜๋ฆฌ, ์ •๊ทœํ™”, ๊ฒฐ์ธก์น˜ · ์ด์ƒ์น˜ ์ฒ˜๋ฆฌ ๋“ฑ ์Šคํƒ€์ผ์ด ๋‹ค ๋‹ค๋ฅด๋‹ค. ์ฐธ๊ณ ํ•ด์„œ ์ตœ์ ์˜ ๋ฐฉ๋ฒ•์„ ์ตํžˆ์ž) ์šด์˜์ง„ ํšŒ์˜๋ฅผ ๊ฑฐ์ณ ์ •ํ•œ ์ปค.. ๋”๋ณด๊ธฐ
4/4 ์›” ์›”์š”์ผ! ์˜ค๋Š˜์€ ๊ธˆ์š”์ผ์— ์‹ค์Šต ์˜ˆ์ œ๋กœ ์ฃผ์–ด์กŒ๋˜ admission(๋Œ€ํ•™์› ํ•ฉ๊ฒฉ ์—ฌ๋ถ€) ๋ฐ์ดํ„ฐ์…‹์„ Sklearn, Tensorflow๋กœ ๊ตฌํ˜„ํ•˜๊ณ , ์ง€๋‚œ์ฃผ์— ๋ฐฐ์šด Logistic Regression์„ ํ™œ์šฉํ•ด ํ‰๊ฐ€์ง€ํ‘œ(Metrics)๋ฅผ ์•Œ์•„๋ณธ๋‹ค. 1. Logistic Regression by Sklearn import numpy as np import pandas as pd import tensorflow as tf from sklearn import linear_model from sklearn.preprocessing import MinMaxScaler from scipy import stats import matplotlib.pyplot as plt import warnings warnings.filter.. ๋”๋ณด๊ธฐ
6ํšŒ ์ฐจ | 4/1 ๊ธˆ 6ํšŒ ์ฐจ! ์Šคํ„ฐ๋”” ์ถœ์„๋ถ€ ๋ณด๋‹ค ๋ณด๋‹ˆ, ๊ต์œก์ด ์–ผ๋งˆ ๋‚จ์ง€ ์•Š์„ ๊ฑธ ์‹ค๊ฐํ•œ๋‹ค. 6์›” 28์ผ์— ๋๋‚˜๋Š” ๊ต์œก ๊ธฐ๊ฐ„ ์ค‘ AI์™€ ์œต๋ณตํ•ฉ ํ”„๋กœ์ ํŠธ์— ๋“ค์–ด๊ฐ€๋Š” ๊ธฐ๊ฐ„์„ ์ œ์™ธํ•˜๋ฉด, ์ˆ˜์—…์€ 16์ผ ๋‚จ์•˜๋‹ค. (๋จธ์‹ ๋Ÿฌ๋‹ ๊ต์œก 2์ผ + ๋”ฅ๋Ÿฌ๋‹ ๊ต์œก 14์ผ) ํ”„๋กœ์ ํŠธ ์‹œ์ž‘ ์ „ ๋‚จ์€ ์Šคํ„ฐ๋”” ํšŸ์ˆ˜๋„ 6๋ฒˆ.. ๊ธฐ๊ฐ„์ด ์งง๊ณ  ๋ฐฐ์šธ ์–‘์ด ๋งŽ๊ณ  ์‹ฌ๋„ ์žˆ๋Š” ๋‚ด์šฉ์ด๋‹ค ๋ณด๋‹ˆ ๋ฐฐ์šธ ๋•Œ ์ œ๋Œ€๋กœ ์ฒด๋‚ดํ™” ํ•ด์•ผ ํ•œ๋‹ค!! ๐Ÿฑ‍๐Ÿ‰ ์˜ค๋Š˜์€ ์ง€๋‚œ์ฃผ ๊ธˆ์š”์ผ์— ๊ฐ์ž ์„ ์ •ํ•œ ๋จธ์‹ ๋Ÿฌ๋‹ ํ”„๋กœ์ ํŠธ๋ฅผ ๋ฐœํ‘œํ–ˆ๋‹ค. ์Šคํ„ฐ๋”” ์ธ์› ๋Œ€๋ถ€๋ถ„์ด ๋ฐ์ด์ฝ˜ ์˜ˆ์ œ๋กœ ํ”„๋กœ์ ํŠธ๋ฅผ ์ง„ํ–‰ํ•˜๊ณ  ์‹ค์ œ๋กœ ์™„์„ฑํ•œ ์ฝ”๋“œ ์ œ์ถœ๋„ ํ•˜์˜€๋”๋ผ! ๐Ÿ‘ ๊ฒฐ์ธก์น˜๊ฐ€ ์žˆ๊ณ , feature ๋ณ„๋กœ ํ˜•ํƒœ๊ฐ€ ๋‹ค๋ฅด๊ฑฐ๋‚˜, ์šฐ๋ฆฌ๊ฐ€ ๋ฐฐ์šด ํšŒ๊ท€๋งŒ ์“ฐ๋Š” ๊ฒƒ์ด ์•„๋‹Œ ๋ถ„๋ฅ˜ ๋“ฑ ๋‹ค์–‘ํ•œ ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๋Š”๋ฐ ์“ฐ๋Š” ๋™๋ฃŒ๋“ค. ๋‚˜๋Š” ์ž‘๊ณ  ์†Œ์ค‘ํ•˜๊ณ  ๊น”๋”ํ•œ ๋ฐ์ดํ„ฐ .. ๋”๋ณด๊ธฐ
4/1 ๊ธˆ ๊ธˆ์š”์ผ! ๐Ÿ˜Ž ์–ด์ œ ์ž ๊น ์†Œ๊ฐœํ•œ Logistic Regression์„ ๋ฐฐ์šด๋‹ค~ Linear Regression(์—ฐ์†์ ์ธ ์ˆซ์ž ๊ฐ’ ์˜ˆ์ธก)์ด ๋ฐœ์ „ํ•œ ๊ฒƒ์ด Logistic Regression → Classification(๋ถ„๋ฅ˜๋ฅผ ํŒ๋‹จํ•˜๋Š” ์˜ˆ์ธก) - Binary Classification(์ดํ•ญ๋ถ„๋ฅ˜) - Multinomial Classification(๋‹คํ•ญ๋ถ„๋ฅ˜) ๊ทธ๋ž˜ํ”„๋ฅผ ๋ณผ ์ˆ˜ ์žˆ๋Š” ์œ ํ‹ธ๋ฆฌํ‹ฐ ๋ชจ๋“ˆ(mglearn)์„ ์ถ”๊ฐ€์ ์œผ๋กœ ์„ค์น˜ํ•˜์ž conda activate maching_TF15 pip install mglearn conda install์€ ์ด๋ฏธ ์„ค์น˜๋˜์–ด ์žˆ๋Š” ๋ชจ๋“ˆ, ํŒจํ‚ค์ง€์— ๋Œ€ํ•œ Dependency๋ฅผ ๊ณ ๋ คํ•ด์„œ ์ตœ์ ์ธ ๋ฒ„์ „์„ ์„ค์น˜, pip install์€ ๊ทธ๋ƒฅ ๊น”์•„๋ฒ„๋ฆผ Logistic Regression : L.. ๋”๋ณด๊ธฐ
3/31 ๋ชฉ ๋ชฉ์š”์ผ! ์˜ค๋Š˜์€ ์šฐ๋ฆฌ๊ฐ€ ์ฃผ๋ ฅ์œผ๋กœ ์‚ฌ์šฉํ•  Tensorflow๋ฅผ ๋ฐฐ์šด๋‹ค! ๐Ÿฑ‍๐Ÿ ์ˆ˜ํ–‰ํ‰๊ฐ€ ๋˜ ๋‚˜์™”๋„น.. ๋ฐ์ดํ„ฐ ํ•ธ๋“ค๋ง 2 + ๋จธ์‹ ๋Ÿฌ๋‹(๋‹ค์ค‘์„ ํ˜•ํšŒ๊ท€) 1. 4/5 ํ™”์š”์ผ๊นŒ์ง€ ์ œ์ถœ!! Ozone ๋ฐ์ดํ„ฐ๋กœ ๋‹ค์ค‘์„ ํ˜•ํšŒ๊ท€๋ฅผ 3๊ฐ€์ง€ ๋ฐฉ๋ฒ•(Python, Sklearn, Tensorflow)์œผ๋กœ ๊ตฌํ˜„, ์˜ˆ์ธก์น˜๊ฐ€ ๋น„์Šทํ•˜๊ฒŒ ๋‚˜์™€์•ผ ํ•œ๋‹ค! ๋‹น์—ฐํžˆ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ(๊ฒฐ์น˜๊ฐ’, ์ด์ƒ์น˜, ์ •๊ทœํ™”)๋„~ ๊ฐ€์žฅ ๋งŽ์ด ์“ฐ์ด๋Š” ๋”ฅ๋Ÿฌ๋‹ ์˜คํ”ˆ์†Œ์Šค ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋Š” Google์˜ Tensorflow์™€ Facebook์˜ PyTorch~ Sklearn์€ ๋ฐ์ดํ„ฐ ์–‘๊ณผ ๋ณ€์ˆ˜๊ฐ€ ๋งŽ์•„์ง€๋ฉด ์†๋„๊ฐ€ ๊ต‰์žฅํžˆ ๋Š๋ ค์ง€๊ธฐ ๋•Œ๋ฌธ์—, Tensorflow๋ฅผ ์ด์šฉํ•œ๋‹ค. Tensorflow 2.0 ver.์ด ๋“ฑ์žฅํ•˜๋ฉด์„œ ์ด์ „ ๋ฒ„์ „๊ณผ๋Š” ์™„์ „ํžˆ ๋‹ค๋ฅด๊ฒŒ ๋ฐ”๋€Œ์—ˆ๋‹ค. ๊ธฐ์กด์— ๋งŒ๋“ค์—ˆ๋˜ ๊ฐ€์ƒํ™˜๊ฒฝ(ma.. ๋”๋ณด๊ธฐ

728x90