Uh oh! Wolfram|Alpha doesn't run without JavaScript.
Please enable JavaScript. If you don't know how, you can find instructions
here
.
Once you've done that, refresh this page to start using Wolfram|Alpha.
In einem Transformer: Attention-Score für Q = (1, 1) , K = (3, 4) ist Q * K / \sqrt 2
In einem Transformer: Attention-Score für Q = (1, 1) , K = (3, 4) ist Q * K / \sqrt 2
Natural Language
Math Input
Extended Keyboard