one of the contributors here. we have a statistical test script we can run on ou... | Hacker News

Hacker Timesnew | past | comments | ask | show | jobs | submit

		pombo on July 25, 2023 \| parent \| context \| favorite \| on: Show HN: Marsha – An LLM-Based Programming Languag... one of the contributors here. we have a statistical test script we can run on our branches to test the compile time, cost and reliability. we want to try tree of thought and also something like https://www.reddit.com/r/ChatGPT/comments/14d7pfz/become_god.... that said, we found that when we asked GPT to first explain why a test case is failing and then to correct that failure instead of just asking it to correct the failure, unexpectedly costs went up and reliability went down

ISV_Damocles on July 25, 2023 [–]

Btw, here's the test job: https://github.com/alantech/marsha/blob/main/.github/workflo... And the core script for the job: https://github.com/alantech/marsha/blob/main/marsha/.time.py

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact