Posted inTechnology News
anton on Twitter: “Interesting, George Hotz mentioning GPT-4 size/architecture in a recent podcase …
anton on Twitter: "Interesting, George Hotz mentioning GPT-4 size/architecture in a recent podcase ... GPT-4: 8 x 220B experts trained with different data/task distributions and 16-iter inference. Glad that Geohot…