cover_sfi19.png

19th edition

of SFI IT Academic Festival

cover_sfi19.png

19th edition

2024

Czy modele językowe potrafią knuć?

Edition: 19th SFI Academic Festiwal

Date: April 5, 2024, 8:30 p.m.

Type: Lightning Talks

Category: AI

cover_sfi19.png
Speaker
Abstract

I will present experiments, where language models of different architectures need to solve a multi-step task, but doing all the steps in memory, without writing them down. Recurrent architectures enable such hidden reasoning, which is risky, because it means that we don't always have access into the model's "thoughts". On the other hand transformers (f.e. GPT) are forced to write down intermediate steps, which usually gives us such access (but not always)

Duration
30 min

Gold Sponsors