Hacker Times
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
umanwizard
11 months ago
|
parent
|
context
|
favorite
| on:
AccountingBench: Evaluating LLMs on real long-hori...
It gets it right for me...
https://chatgpt.com/share/687e8c28-7714-800c-abf4-e9cd3ce87b...
yoyohello13
11 months ago
|
next
[–]
Ah, wouldn’t be an LLM discussion thread without one of these “it works/doesn’t” conversations.
mdaniel
11 months ago
|
parent
|
next
[–]
If it makes you feel any better, the other infamous one "I spend so much time chasing hallucinations, I could have done it myself" is currently a sibling comment
riku_iki
11 months ago
|
prev
[–]
There were so many embarrassing topics about this, that openai for sure added it to training dataset with high priority
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: