Notice
Recent Posts
Recent Comments
Link
ยซ   2024/09   ยป
์ผ ์›” ํ™” ์ˆ˜ ๋ชฉ ๊ธˆ ํ† 
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30
Tags more
Archives
Today
Total
๊ด€๋ฆฌ ๋ฉ”๋‰ด

๋ชฉ๋กAI (2)

๐Ÿ’ป ๐Ÿง

Seq2Seq

Seq2Seq ํƒœ๊ทธ: Attention Sequence-to-Sequence Structure ์ž…๋ ฅ ๋œ ์‹œํ€€์Šค๋กœ๋ถ€ํ„ฐ ๋‹ค๋ฅธ ๋„๋ฉ”์ธ์˜ ์‹œํ€€์Šค๋ฅผ ์ถœ๋ ฅํ•˜๋Š” ๋ชจ๋ธ. ์ฑ—๋ด‡๊ณผ ๋ฒˆ์—ญ์ด ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์‹œ์ธ๋ฐ, ์ž…๋ ฅ์‹œํ€€์Šค์™€ ์ถœ๋ ฅ ์‹œํ€€์Šค๋ฅผ ๊ฐ๊ฐ ์งˆ๋ฌธ๊ณผ ๋Œ€๋‹ต์œผ๋กœ ๊ตฌ์„ฑํ•˜๋ฉด ์ฑ—๋ด‡์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ๊ณ  ๋ฒˆ์—ญ๊ธฐ๋กœ๋„ ์‚ฌ์šฉ ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ค„์—ฌ์„œ Seq2Seq ๋ถ€๋ฅธ๋‹ค. ๋ฒˆ์—ญ๊ธฐ๊ฐ€ Seq2Seq ๋ฅผ ํ†ตํ•ด ๋ฒˆ์—ญ์„ ํ•˜๋Š” ๊ตฌ์กฐ์ธ๋ฐ, ๊ทธ ์ž์„ธํ•œ ๋‚ด๋ถ€ ๊ตฌ์กฐ๊ฐ€ ์–ด๋–ป๊ฒŒ ์ƒ๊ฒผ๋Š”์ง€๋ฅผ ์•Œ์•„์•ผ ํ•œ๋‹ค. Seq2Seq ๋Š” ์ธ์ฝ”๋”์™€ ๋””์ฝ”๋”๋กœ ์ด๋ฃจ์–ด์ง€๋Š”๋ฐ ์ธ์ฝ”๋”๋Š” ์ž…๋ ฅ ๋ฌธ์žฅ์˜ ๋ชจ๋“  ๋‹จ์–ด๋“ค์„ ์ˆœ์ฐจ์ ์œผ๋กœ ์ž…๋ ฅ๋ฐ›์€ ๋’ค์— ํ•˜๋‚˜์˜ ๋ฒกํ„ฐ , ์˜๋ฏธ๋ฒกํ„ฐ ( Context Vector ) ๋กœ ๋งŒ๋“ ๋‹ค. ์ปจํ…์ŠคํŠธ ๋ฒกํ„ฐ๋กœ ์••์ถ•๋˜๋ฉด ๋””์ฝ”๋”๋กœ ๋ณด๋‚ด๋Š”๋ฐ ๋””์ฝ”๋”๋Š” ์ด ์ปจํ…์ŠคํŠธ ๋ฒกํ„ฐ๋ฅผ ๋””์ฝ”๋”์— ํ•™์Šต ๋œ ๋‹ค..

AI/Machine Learning 2024. 1. 28. 16:42
Learning Rate ( lr , ํ•™์Šต๋ฅ  )

ํ•™์Šต๋ฅ  ( Learning Rate ) Gradient Descent :๊ฒฝ์‚ฌํ•˜๊ฐ•๋ฒ• ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๊ธฐ์šธ๊ธฐ์— ํ•™์Šต๋ฅ ๋˜๋Š” ๋ณดํญ์˜ ์Šค์นผ๋ผ๋ฅผ ๊ณฑํ•ด ๊ฐ’์˜ ๋‹ค์Œ์ง€์ ์„ ๊ฒฐ์ •ํ•œ๋‹ค. :Local Minimum ์— ํšจ์œจ์ ์œผ๋กœ ๋„๋‹ฌ ํ•  ์ˆ˜ ์žˆ๋„๋ก, ๋„ˆ๋ฌด ํฌ์ง€๋„ ์ž‘์ง€๋„ ์•Š์€ ์ ์ ˆํ•œ ๊ฐ’ ( lr ) ์„ ๊ฒฐ์ • ํ•ด์•ผ ํ•จ. ์‚ฌ์šฉ ๋œ ํ•™์Šต๋ฅ ์ด ํšจ์œจ์ ์ธ์ง€, ์ ์ ˆํ•œ์ง€๋Š” ๋‹ค์Œ์˜ ๊ทธ๋ฆผ์„ ํ†ตํ•ด์„œ ์•Œ ์ˆ˜ ์žˆ๋‹ค. low learning rate : ์†์‹ค ๊ฐ์†Œ๊ฐ€ ์„ ํ˜•์˜ ํ˜•ํƒœ๋ฅผ ๋ณด์ด๋ฉฐ ์ฒœ์ฒœํžˆ ํ•™์Šต ๋จ -> ๋„ˆ๋ฌด ๋Š๋ ค์„œ ์ž‘์—…์ด ์™„๋ฃŒ๋˜์ง€ ์•Š์„ ๊ฐ€๋Šฅ์„ฑ์ด ์žˆ์Œ. high leaning rate : ์†์‹ค ๊ฐ์†Œ๊ฐ€ ์ง€์ˆ˜์ ์ธ ํ˜•ํƒœ๋ฅผ ๋ณด์ด๋ฉฐ ๋น ๋ฅธ ํ•™์Šต์ด๊ฑฐ๋‚˜ ์™„์ „ ์ •์ฒด -> ํŠน์ • ๊ฐ’์— ์ˆ˜๋ ดํ•˜์ง€ ์•Š๊ณ  ๋ฐœ์‚ฐ ํ•  ๊ฐ€๋Šฅ์„ฑ์ด ํผ very high leaning rate : ๋งค์šฐ ๋†’..

AI 2020. 10. 29. 16:39