Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Many-shot ICL #53

Open
agarwl opened this issue Jun 5, 2024 · 1 comment
Open

Many-shot ICL #53

agarwl opened this issue Jun 5, 2024 · 1 comment

Comments

@agarwl
Copy link

agarwl commented Jun 5, 2024

Seems like many-shot prompting seems to help on several of the existings tasks here (Big-bench hard, MATH, GSM8K, GPQA).

Not sure what's the process but seems like worth a mention / including it here.

https://arxiv.org/abs/2404.11018

Also, works for Claude-3 (many-shot jailbreaking paper) and gpt-4o in multimodal tasks (many-shot ICL in multimodal tasks).

@Harsha-Nori
Copy link
Collaborator

Hey Rishabh!! Been a long time since we chatted about NAMs/GAMs :)

Wow, this is a really cool paper -- thanks for sharing! As corroborating evidence, in medprompt we did ablations up to k=20 few shots and found continued performance improvements (e.g. 90.2 -> 90.6 on medQA when going from 5 shots to 20 shots), but wanted to keep the inference budget reasonable for the "standard" algorithm configuration. We didn't ablate beyond that, so it's really cool to see it studied so rigorously.

image

https://arxiv.org/pdf/2311.16452

Happy to add a link to your paper in the readme when I'm back at my desk, and excited to read it more thoroughly too. Would be fun to catch up sometime!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants