๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

728x90

๋ฉ€ํ‹ฐ์บ ํผ์Šค

4/14 ๋ชฉ ๋ชฉ์š”์ผ! ์˜ค๋Š˜์€ CNN์„ ์‹ค์ œ ์ฝ”๋“œ๋กœ ๊ตฌํ˜„ํ•ด๋ณธ๋‹ค! ์ด๋ฏธ์ง€ ํ•œ ์žฅ : 2์ฐจ์›(X, T) → Convolution Layer :: Feature Map : 2์ฐจ์› ์—ฌ๋Ÿฌ ๊ฐœ → Activation Map : 3์ฐจ์› → X ๋ฐ์ดํ„ฐ(์ด๋ฏธ์ง€ ์ •๋ณด. ์ด๋ฏธ์ง€ ๊ฐœ์ˆ˜, ์„ธ๋กœ, ๊ฐ€๋กœ, Channel) : 4์ฐจ์› Data โ–ถ conv : ํŠน์ง•์„ ๋ฝ‘์•„๋‚ธ ์ด๋ฏธ์ง€๊ฐ€ ์—ฌ๋Ÿฌ ์žฅ์ด ๋˜๋„๋ก ๋ฐ˜๋ณต ์ž‘์—…(์ด๋ฏธ์ง€ ๊ฐœ์ˆ˜, Feature Map ์„ธ๋กœ, Feature Map ๊ฐ€๋กœ, filter์˜ ๊ฐœ์ˆ˜) โ–ถ Pooling Layer :: conv ์ž‘์—…์„ ๊ฑฐ์นœ ์—ฌ๋Ÿฌ ์žฅ์˜ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์ฆˆ๋ฅผ ์ค„์ž„ โ–ถ conv :: Pooling Layer๋ฅผ ๊ฑฐ์นœ ๋ฐ์ดํ„ฐ์˜ ํŠน์ง•์„ ๋˜ ๋ฝ‘์•„๋ƒ„ โ–ถ FLATTEN :: 4์ฐจ์› → 2์ฐจ์›(batch size ํฌํ•จํ•  ๋•Œ) 1. Cha.. ๋”๋ณด๊ธฐ
4/13 ์ˆ˜ ์ˆ˜์š”์ผ! CNN(Convolutional Neural Network, convnet. ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง)์œผ๋กœ ๋“ค์–ด๊ฐ„๋‹ค! Deep Learning(Deep Neural Network)์˜ ์ข…๋ฅ˜ : - Computer Vision : ์ปดํ“จํ„ฐ๊ฐ€ ์ด๋ฏธ์ง€๋‚˜ ๋น„๋””์˜ค๋ฅผ ๋ณด๊ณ  ์—ฌ๋Ÿฌ ๊ฐ์ฒด๋ฅผ ์‹๋ณ„ํ•˜๊ณ  ํŒŒ์•…ํ•  ์ˆ˜ ์žˆ๋„๋ก ์ง€์›ํ•˜๋Š” Computer Science ๋ถ„์•ผ. ๋Œ€ํ‘œ์ ์ธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ CNN. ๋ชฉ์ ์€ pixel์„ ์ดํ•ดํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๊ณ ์•ˆํ•˜๋Š” ๊ฒƒ - NLP(Natural Language Process) : ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ. ๋Œ€ํ‘œ์ ์ธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ RNN, LSTM ์ด๋ฏธ์ง€๋ฅผ ์ด๋ฃจ๋Š” ๊ฐ€์žฅ ์ž‘์€ ๋‹จ์œ„ → pixel ์ด๋ฏธ์ง€ ์ขŒํ‘œ๊ณ„ (Image coordinate) - 2์ฐจ์› ndarray๋กœ ํ‘œํ˜„ - pixel (์„ธ๋กœ, ๊ฐ€๋กœ) 1. ์ด๋ฏธ์ง€ ์ฒ˜๋ฆฌ(Im.. ๋”๋ณด๊ธฐ
4/12 ํ™” ํ™”์š”์ผ! ํ˜„์žฌ์˜ Deep Learning์ด ์–ด๋Š ์ •๋„ ํšจ์œจ์„ ๋‚ด๊ธฐ ์‹œ์ž‘ํ•œ ์ด์œ ์— ๋Œ€ํ•ด ๊ณต๋ถ€ํ•œ๋‹ค. Weight์™€ Bias๋ฅผ ๋žœ๋ค ์ดˆ๊ธฐ๊ฐ’์œผ๋กœ ์‚ฌ์šฉํ•˜๋˜ ๊ฒƒ์„ Xievier/He's Initialization์œผ๋กœ ๋Œ€์ฒดํ•˜๊ณ , Vanishing Gradient ํ˜„์ƒ์„ Back-Propagation(ํ–‰๋ ฌ์—ฐ์‚ฐ, ์—ญ์˜ ๋ฐฉํ–ฅ์œผ๋กœ W, b๋ฅผ Update)๊ณผ Activation ํ•จ์ˆ˜๋ฅผ Sigmoid ๋Œ€์‹  ReLU๋ฅผ ์‚ฌ์šฉํ•˜๊ณ , ๊ณ„์‚ฐํ•ด์•ผ ํ•˜๋Š” W, b๋ฅผ Drop-out์œผ๋กœ ์—ฐ์‚ฐ์— ์‚ฌ์šฉ๋˜๋Š” Node๋ฅผ ์ค„์ž„์œผ๋กœ์จ ํ•ด๊ฒฐํ•จ 1. Multinomial Classification by Tensorflow 1.15 ver. import numpy as np import pandas as pd import tensorflow as tf imp.. ๋”๋ณด๊ธฐ
4/11 ์›” ์›”์š”์ผ! ์˜ค๋Š˜์€ ๋”ฅ๋Ÿฌ๋‹๊ณผ ์‹ ๊ฒฝ๋ง์— ๋Œ€ํ•ด ๋ฐฐ์šด๋‹ค! Perceptron์€ Neuron ํ•œ ๊ฐœ Deep Learning : ํ•œ ๊ฐœ์˜ Logistic Regression์„ ํ‘œํ˜„ํ•˜๋Š” node๊ฐ€ ์„œ๋กœ ์—ฐ๊ฒฐ๋˜์–ด ์žˆ๋Š” ์‹ ๊ฒฝ๋ง ๊ตฌ์กฐ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ํ•œ ๊ฐœ์˜ ์ž…๋ ฅ์ธต, ํ•œ ๊ฐœ ์ด์ƒ์˜ ์€๋‹‰์ธต(๋งŽ์„์ˆ˜๋ก ํ•™์Šต์ด ์ž˜ ๋จ. 1~3๊ฐœ๊ฐ€ ์ ๋‹น), ํ•œ ๊ฐœ์˜ ์ถœ๋ ฅ์ธต์œผ๋กœ ์ด๋ฃจ์–ด์ง feed forward(propagation). XOR ๋ฌธ์ œ๋Š” node ํ•œ ๊ฐœ(perceptron)๋กœ๋Š” ํ•™์Šต์ด ์•ˆ ๋จ 1. Perceptron. GATE ์—ฐ์‚ฐ(AND, OR, XOR ์—ฐ์‚ฐ์ž)์„ Logistic Regression๊ณผ Tensorflow 1.5 Ver.์œผ๋กœ ๊ตฌํ˜„ import numpy as np import tensorflow as tf from sklear.. ๋”๋ณด๊ธฐ
4/8 ๊ธˆ ๊ธˆ์š”์ผ! ๐Ÿฑ‍๐Ÿ ์˜ค๋Š˜์€ Regression์„ ๋๋‚ธ๋‹ค~~ 4/11 ์›”์š”์ผ์€ ๋จธ์‹ ๋Ÿฌ๋‹ ํ•„๋‹ต ํ‰๊ฐ€, 4/17 ์ผ์š”์ผ์€ ์ˆ˜ํ–‰ํ‰๊ฐ€ 4๊ฐ€์ง€ ์ œ์ถœ์ด ์žˆ๋‹ค. ๊ฒฐ์ธก์น˜ ์ฒ˜๋ฆฌ๋Š” ์‚ญ์ œํ•˜๊ฑฐ๋‚˜, imputation(๋ณด๊ฐ„, ๋Œ€์ฒด) - ํ‰๊ท ํ™” ๊ธฐ๋ฒ•(๋…๋ฆฝ๋ณ€์ˆ˜๋ฅผ ๋Œ€ํ‘œ๊ฐ’์œผ๋กœ ๋Œ€์ฒด), ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•(์ข…์†๋ณ€์ˆ˜๊ฐ€ ๋Œ€์ƒ. KNN) KNN(K-Nearest Neighbors, K-์ตœ๊ทผ์ ‘ ์ด์›ƒ) : hyperparameter๋Š” k(=1์ผ ๋•Œ ์–ด๋Š ์ •๋„์˜ ์„ฑ๋Šฅ ๋ณด์žฅ)์™€ ๊ฑฐ๋ฆฌ์ธก์ • ๋ฐฉ์‹(์ฃผ๋กœ ์œ ํด๋ผ๋””์•ˆ ์‚ฌ์šฉ) ๋ฐ˜๋“œ์‹œ ์ •๊ทœํ™”๋ฅผ ์ง„ํ–‰ํ•ด์•ผ ํ•จ. ๋ชจ๋“  ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•ด ๊ฑฐ๋ฆฌ๋ฅผ ๊ณ„์‚ฐํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์‹œ๊ฐ„์ด ์˜ค๋ž˜ ๊ฑธ๋ฆด ์ˆ˜ ์žˆ์Œ 1. Logistic Regression + KNN - BMI data import numpy as np import pandas as pd fro.. ๋”๋ณด๊ธฐ
4/7 ๋ชฉ ๋ชฉ์š”์ผ! ์˜ค๋Š˜๋„ Multinomial Classification๋ฅผ ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์ œ(MNIST)๋ฅผ ํ†ตํ•ด ๋ฐฐ์šด๋‹ค~ ์†์œผ๋กœ ์“ด ์ˆซ์ž๋“ค๋กœ ์ด๋ฃจ์–ด์ง„ ๋Œ€ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค~ MNIST ์ด๋ฏธ์ง€๋Š” ๊ทธ ์ž์ฒด๊ฐ€ 2์ฐจ์›์ด๊ณ  ๊ทธ๋Ÿฐ ๊ฒŒ ์—ฌ๋Ÿฟ์ด๊ธฐ ๋•Œ๋ฌธ์— 3์ฐจ์›. ์ด๋ฏธ์ง€๋ฅผ 1์ฐจ์›์œผ๋กœ ravel() ํ•ด์•ผ ํ•จ https://www.kaggle.com/competitions/digit-recognizer/data?select=test.csv Digit Recognizer | Kaggle www.kaggle.com Tensorflow Ver. 1.15์€ ๋ฐฐ์šด ์ด๋ก ์„ ์ฝ”๋“œ๋กœ ์ดํ•ดํ•˜๊ธฐ์—๋Š” ์ข‹์ง€๋งŒ ์ฝ”๋“œ๊ฐ€ ๋„ˆ๋ฌด ์–ด๋ ต๋‹ค. 1. Multinomial Classification by Tensorflow Ver. 1.15 - MNIST import nump.. ๋”๋ณด๊ธฐ
4/6 ์ˆ˜ ์ˆ˜์š”์ผ! ์˜ค๋Š˜์€ Multinomial Classification์„ ๋ฐฐ์šด๋‹ค. Linear Regression(์—ฐ์†์ ์ธ ์ˆซ์ž ๊ฐ’ ์˜ˆ์ธก)์ด ๋ฐœ์ „ํ•œ ๊ฒƒ์ด Logistic Regression → Classification(๋ถ„๋ฅ˜๋ฅผ ํŒ๋‹จํ•˜๋Š” ์˜ˆ์ธก) - Binary Classification(์ดํ•ญ๋ถ„๋ฅ˜) - Multinomial Classification(๋‹คํ•ญ๋ถ„๋ฅ˜) Logistic Regression์€ ์ด์ง„ ๋ถ„๋ฅ˜์— ํŠนํ™”๋จ SKlearn์ด ์ œ๊ณตํ•˜๋Š” ๋ถ„๋ฅ˜๊ธฐ์ธ Gradient Descent(๊ฒฝ์‚ฌํ•˜๊ฐ•๋ฒ•)๊ฐ€ ๋ฐœ์ „ํ•œ ํ˜•ํƒœ์ธ SGD Classifier(Stochastic Gradient Descent, ํ™•๋ฅ ์  ๊ฒฝ์‚ฌํ•˜๊ฐ•๋ฒ•) 1. Binary Classification - ์œ„์Šค์ฝ˜์‹  ์œ ๋ฐฉ์•” ๋ฐ์ดํ„ฐ by Gradient Descent Cl.. ๋”๋ณด๊ธฐ
4/5 ํ™” ํ™”์š”์ผ! Logistic Regression์„ ํ™œ์šฉํ•ด ๋จธ์‹ ๋Ÿฌ๋‹ ์ง„ํ–‰ ์‹œ ์ฃผ์˜์‚ฌํ•ญ์„ ์•Œ์•„๋ณธ๋‹ค. ์•ž์œผ๋กœ ์šฐ๋ฆฌ๋Š” Classification(์ดํ•ญ๋ถ„๋ฅ˜)์˜ Metrics๋กœ Accuracy๋ฅผ ์‚ฌ์šฉํ•  ์˜ˆ์ •์ด๋‹ค. ๋ชจ๋ธ ํ‰๊ฐ€ ์ „ ๊ณ ๋ คํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ๋“ค 1. learning rate(ํ•™์Šต๋ฅ ) : loss ๊ฐ’์„ ๋ณด๋ฉด์„œ ํ•™์Šต๋ฅ ์„ ์กฐ์ •ํ•ด์•ผ ํ•จ. ๋ณดํ†ต 1์˜ ๋งˆ์ด๋„ˆ์Šค 4์Šน์œผ๋กœ ์žก์Œ ํ•™์Šต๋ฅ ์ด ๋„ˆ๋ฌด ํฌ๋‹ค๋ฉด global minima(W')๋ฅผ ์ฐพ์„ ์ˆ˜ ์—†๊ฒŒ ๋จ → OverShooting ๋ฐœ์ƒ ํ•™์Šต๋ฅ ์ด ์•„์ฃผ ์ž‘๋‹ค๋ฉด local minima ์ฐพ๊ฒŒ ๋จ 2. Normalization(์ •๊ทœํ™”) : MinMax Scaling - 0 ~ 1. ์ด์ƒ์น˜์— ๋ฏผ๊ฐํ•จ Standardization - ํ‘œ์ค€ํ™”, Z-Score. ์ƒ๋Œ€์ ์œผ๋กœ ์ด์ƒ์น˜์— ๋‘”๊ฐํ•จ, ๋ชจ๋“  ์นผ๋Ÿผ์—.. ๋”๋ณด๊ธฐ

728x90