๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

728x90

์œ ์‚ฌ๋„

11/2 ์ˆ˜ 1. ์œ ์‚ฌ๋„ - ์ž์นด๋“œ ์œ ์‚ฌ๋„(Jaccard Similarity): ์ง‘ํ•ฉ ๊ธฐ๋ฐ˜ A, B ๋‘ ๊ฐœ์˜ ์ง‘ํ•ฉ์ด ์žˆ๋‹ค๊ณ  ํ•  ๋•Œ, ํ•ฉ์ง‘ํ•ฉ์—์„œ ๊ต์ง‘ํ•ฉ์˜ ๋น„์œจ์„ ๊ตฌํ•˜๋Š” ๊ฒƒ 0~1 ์‚ฌ์ด์˜ ๊ฐ’์„ ๊ฐ€์ง (๋‘ ์ง‘ํ•ฉ์ด ๋™์ผํ•˜๋ฉด 1, ๋‘ ์ง‘ํ•ฉ์˜ ๊ต์ง‘ํ•ฉ์ด ์—†๋‹ค๋ฉด 0 ๊ฐ’์„ ๊ฐ€์ง) ๊ฐ’์ด 1์— ๊ฐ€๊นŒ์šธ์ˆ˜๋ก ๋‘ ๋ฌธ์žฅ์ด ์œ ์‚ฌํ•œ ๋ฌธ์žฅ, 0์— ๊ฐ€๊นŒ์šธ์ˆ˜๋ก ์œ ์‚ฌํ•˜์ง€ ์•Š์€ ๋ฌธ์žฅ์œผ๋กœ ๋ถ„๋ฅ˜ ๊ฐ ์•„์ดํ…œ(๋‹จ์–ด)๋“ค์„ Binary๊ฐ’(0,1)์œผ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ๊ต์ง‘ํ•ฉ/ํ•ฉ์ง‘ํ•ฉ์„ ๊ตฌํ•จ - ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„(Cosine Similarity): ๋ฒกํ„ฐ(๊ฐ๋„) ๊ธฐ๋ฐ˜ ๋‘ ๋ฒกํ„ฐ(๋ฐฉํ–ฅ์„ฑ)๊ฐ„์˜ ์ฝ”์‚ฌ์ธ ๊ฐ๋„๋ฅผ ์ด์šฉํ•˜์—ฌ ๊ตฌํ•˜๋Š” ๋‘ ๋ฒกํ„ฐ์˜ ์œ ์‚ฌ๋„๋ฅผ ์˜๋ฏธ -1~1 ์‚ฌ์ด์˜ ๊ฐ’์„ ๊ฐ€์ง ๊ฐ’์ด 1์— ๊ฐ€๊นŒ์šธ์ˆ˜๋ก(๋‘ ๋ฒกํ„ฐ์˜ ๋ฐฉํ–ฅ์ด ๊ฐ™์„์ˆ˜๋ก) ์œ ์‚ฌ์„ฑ์ด ๋†’์Œ (๋‘ ๋ฒกํ„ฐ๊ฐ€ ์„œ๋กœ 90°์˜ ๊ฐ์„ ์ด๋ฃจ๋ฉด ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ.. ๋”๋ณด๊ธฐ

728x90