Commit graph

30 commits

Author SHA1 Message Date
buzz-lightsnack-2007
39516acf9a add: output results for various models and datasets 2024-12-07 23:45:28 +08:00
buzz-lightsnack-2007
a028413832 update answer format prompt for clarity and brevity 2024-12-07 23:45:12 +08:00
buzz-lightsnack-2007
00bade7489 remove: original notebook
The notebook's functionality was split to the various files.
2024-12-07 23:44:16 +08:00
buzz-lightsnack-2007
27e6dd6653 add: implement results analysis and grading functionality 2024-12-07 23:43:42 +08:00
buzz-lightsnack-2007
9fd769e5b9 add: testing results
These are large files and will need further processing.
2024-12-07 21:39:16 +08:00
buzz-lightsnack-2007
f5c6380b77 add: testing program
This script contains prompt generation and LLM testing.
2024-12-07 21:37:30 +08:00
buzz-lightsnack-2007
c9efe75b77 update prompts to make clear the usage of chain-of-thought 2024-12-05 23:52:41 +08:00
buzz-lightsnack-2007
1c03a7eeaf add outputs for descriptions tests 2024-12-04 13:40:09 +08:00
buzz-lightsnack-2007
ff5d24997a create a folder for prompts cache 2024-12-04 13:38:49 +08:00
buzz-lightsnack-2007
b5970cac26 move models and prompts to testing config folder 2024-12-04 13:08:55 +08:00
buzz-lightsnack-2007
3b94f13adc add testing for description 2024-12-04 13:03:02 +08:00
buzz-lightsnack-2007
14854ff20a updated ignored files 2024-12-04 11:41:11 +08:00
buzz-lightsnack-2007
adcd95107f split data segregation into another notebook 2024-12-04 11:41:06 +08:00
buzz-lightsnack-2007
d127b07d23 refactor sources data structure
identify the data source
2024-12-03 20:44:38 +08:00
buzz-lightsnack-2007
05b9abe3a6 force LLM to output the results properly 2024-08-30 23:32:27 +08:00
buzz-lightsnack-2007
9757b0beeb Generate initial blind testing results 2024-08-30 23:32:15 +08:00
buzz-lightsnack-2007
4a87bfad7c Refactor: Remove Gemini and use Gemma 2024-08-30 23:31:51 +08:00
buzz-lightsnack-2007
ab77659f14 make "answer format" prompt more specific 2024-08-25 12:50:17 +00:00
buzz-lightsnack-2007
c489bb4ee1 add LLM observation output 2024-08-24 06:13:06 +00:00
buzz-lightsnack-2007
67d11cd9cc add answer format prompt 2024-08-24 06:11:48 +00:00
buzz-lightsnack-2007
a0b15a48de add LLM prompts 2024-08-24 05:38:52 +00:00
buzz-lightsnack-2007
d3984ec9e1 secure the API keys 2024-08-23 09:31:51 +00:00
buzz-lightsnack-2007
9ee26671b1 change local provider from gpt4all to ollama 2024-08-22 15:50:27 +00:00
buzz-lightsnack-2007
6754eb8b0b write model testing section 2024-08-22 15:31:33 +00:00
buzz-lightsnack-2007
3d6a1f6b42 add Gemini API python library as a dep 2024-08-22 15:22:38 +00:00
buzz-lightsnack-2007
c034b70c9a add data preview and environment set-up notebook 2024-08-22 11:16:29 +00:00
buzz-lightsnack-2007
f14ee25d5b identify all Kaggle sources 2024-08-22 10:55:15 +00:00
buzz-lightsnack-2007
12dc7e7901 list global dependencies 2024-08-22 10:26:08 +00:00
buzz-lightsnack-2007
1ef7f83a00 update ignored files configuration 2024-08-22 10:25:52 +00:00
H. Saw
28c35ebc29 initial writeup 2024-08-22 09:33:58 +00:00