AI ๋ฐ ML ๋‰ด์Šค

๐Ÿค– ์ƒ์„ฑ์  AI 100์ผ - 3์ผ์ฐจ - ์ฃผ์˜๋ ฅ์ด ์ „๋ถ€์ž…๋‹ˆ๋‹ค ๐Ÿค–

'Attention Is All You Need'๋ผ๋Š” ํ•œ ์—ฐ๊ตฌ ๋…ผ๋ฌธ์€ ๋ชจ๋‘๊ฐ€ ์ฝ์–ด์•ผ ํ•˜๋Š” ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ GPT(Generative Pre-trained Transformer)์—์„œ 'T'๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” Transformer ๊ตฌ์กฐ๋ฅผ ์†Œ๊ฐœํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ ๊ฝค ๋ณต์žกํ•˜๋ฏ€๋กœ, ๊ทธ๋ž˜ํ”ฝ์Šค์™€ ๋” ์‰ฌ์šด ํ…์ŠคํŠธ๋ฅผ ํฌํ•จํ•˜๋Š” Jay์˜ ์ž‘์—…์„ ํ™•์ธํ•  ๊ฒƒ์„ ์ถ”์ฒœํ•ฉ๋‹ˆ๋‹ค. โœ… ์ง€๊ธˆ๊นŒ์ง€์˜ ๋‚˜์˜ ์ดํ•ด ์š”์•ฝ ์ด ๋…ผ๋ฌธ์€ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ(NLP) ๋ถ„์•ผ์—์„œ ํš๊ธฐ์ ์ธ ๋ชจ๋ธ์ธ Transformer๋ฅผ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ์‹œํ€€์Šค-ํˆฌ-์‹œํ€€์Šค ๋ชจ๋ธ์ด ์žฌ๊ท€ ์‹ ๊ฒฝ๋ง(RNN) ๋˜๋Š” ํ•ฉ์„ฑ ์‹ ๊ฒฝ๋ง(CNN)์— ์˜์กดํ•˜๋Š” ๋ฐ˜๋ฉด, Transformer๋Š” ์ž…๋ ฅ๊ณผ ์ถœ๋ ฅ ๊ฐ„์˜ ์˜์กด์„ฑ์„ ์ฒ˜๋ฆฌํ•˜๋Š” ๋ฐ ์žˆ์–ด ์‹œํ€€์Šค ๊ฑฐ๋ฆฌ๋ฅผ ๊ณ ๋ คํ•˜์ง€ ์•Š๊ณ  ์ž์ฒด ์ฃผ์˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ๊ตฌ์กฐ๋Š” ํ›ˆ๋ จ ์ค‘์— ๋” ๋งŽ์€ ๋ณ‘๋ ฌํ™”๋ฅผ ํ—ˆ์šฉํ•˜์—ฌ ํ›ˆ๋ จ ์†๋„๋ฅผ ํฌ๊ฒŒ ๊ฐœ์„ ํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ชจ๋ธ์€ ๋‹ค์–‘ํ•œ ํƒœ์Šคํฌ์—์„œ ์ตœ๊ณ ์˜ ์„ฑ๊ณผ๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ ๊ธฐ๊ณ„ ๋ฒˆ์—ญ์—์„œ ๊ทธ๋ ‡์Šต๋‹ˆ๋‹ค. โœ… ๋‹ค๋ฅธ ์ฃผ์š” ํ•˜์ด๋ผ์ดํŠธ 1๏ธโƒฃ ์ž์ฒด ์ฃผ์˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜: ์ด ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ๋ชจ๋ธ์ด ๋ฌธ์žฅ์—์„œ ๋‹ค๋ฅธ ๋‹จ์–ด์˜ ์ค‘์š”์„ฑ์„ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ—ˆ์šฉํ•˜์—ฌ ํšจ์œจ์ ์œผ๋กœ่ฟœ่ท็ฆป ์˜์กด์„ฑ์„ ํฌ์ฐฉํ•ฉ๋‹ˆ๋‹ค. 2๏ธโƒฃ ๋ณ‘๋ ฌํ™”: Transformer ๋ชจ๋ธ์€ ์‹œํ€€์Šค์—์„œ ๋ชจ๋“  ๋‹จ์–ด๋ฅผ ๋™์‹œ์— ์ฒ˜๋ฆฌํ•˜์—ฌ RNNs์™€ CNNs์— ๋น„ํ•ด ํ›ˆ๋ จ ์‹œ๊ฐ„์„ ํฌ๊ฒŒ ๋‹จ์ถ•ํ•ฉ๋‹ˆ๋‹ค. 3๏ธโƒฃ ์„ฑ๋Šฅ: ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ํƒœ์Šคํฌ์—์„œ ์ตœ๊ณ ์˜ ์„ฑ๊ณผ๋ฅผ ๋‹ฌ์„ฑํ•˜์—ฌ WMT 2014 ์˜์–ด-๋…์ผ์–ด ๋ฐ ์˜์–ด-ํ”„๋ž‘์Šค์–ด ๋ฒˆ์—ญ ๋ฐ์ดํ„ฐ์…‹์—์„œ ์ƒˆ๋กœ์šด ๋ฒค์น˜๋งˆํฌ๋ฅผ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค. ๐Ÿ”— ์ฐธ์กฐ ๋…ผ๋ฌธ: https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf ๐Ÿ”— Jay ๋ธ”๋กœ๊ทธ: https://jalammar.github.io/illustrated-transformer/
favicon
dev.to
๐Ÿค– 100 Days of Generative AIโ€Š-โ€ŠDay 3โ€Š-โ€ŠAttention Is All You Needย ๐Ÿค–
Create attached notes ...