Commit graph

343 commits

Author SHA1 Message Date
Adil Hafeez
5ed7e0b185 Merge branch 'adil/log_improvements' into adil/add_acm_demo 2025-01-23 14:23:06 -08:00
Adil Hafeez
4ab7665c30 log improvements 2025-01-23 14:22:16 -08:00
Adil Hafeez
411b6904b7 fix http hosts header 2025-01-23 11:05:24 -08:00
Adil Hafeez
613192f71c Merge branch 'main' into adil/add_acm_demo 2025-01-22 14:10:18 -08:00
Adil Hafeez
6740a09952
add docker-compose file for honeycomb tracing (#377) 2025-01-22 14:02:59 -08:00
Adil Hafeez
6887d52750
When using ollama token count was not coming in (#375)
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
2025-01-21 18:01:56 -08:00
Aayush
fcd8cfb9fc
add in honeycomb support for weather-forecast demo (#345) 2025-01-21 17:15:27 -08:00
Salman Paracha
bea0dd4a83
Update README.md 2025-01-21 10:56:17 -08:00
Salman Paracha
4bbf6c382e
Update README.md 2025-01-20 15:03:19 -08:00
Salman Paracha
8d1f132b75
Update README.md 2025-01-20 15:02:43 -08:00
Salman Paracha
966901d2a5
Update README.md 2025-01-20 14:46:14 -08:00
Salman Paracha
0fe0e775ee
Update README.md 2025-01-20 14:45:51 -08:00
Salman Paracha
e2ec2f6bb8
Salmanap/fix readme 019a (#373)
* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 14:44:40 -08:00
Salman Paracha
c8b5137d37
updated README based on feedback on reddit (#372)
* updated README based on feedback on reddit

* fixed typo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 13:56:09 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 (#371) 2025-01-17 18:47:26 -08:00
Adil Hafeez
253017e93d Merge branch 'main' into adil/add_acm_demo 2025-01-17 18:26:27 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm (#365) 2025-01-17 18:25:55 -08:00
Adil Hafeez
5c066c9825 add more changes 2025-01-17 18:24:59 -08:00
Adil Hafeez
c532a5f4c7 Merge branch 'adil/fix_prompt_target_name' into adil/add_acm_demo 2025-01-17 17:50:40 -08:00
Adil Hafeez
1b3c1b8ba5 fix tests 2025-01-17 16:52:38 -08:00
Adil Hafeez
36c9c0d414 fix tests 2025-01-17 16:41:55 -08:00
Adil Hafeez
c235eaf762 fix test 2025-01-17 16:39:13 -08:00
Adil Hafeez
aca1631b49 add more changes 2025-01-17 16:33:17 -08:00
Adil Hafeez
a7b9458e5a fix rust tests 2025-01-17 11:00:36 -08:00
Adil Hafeez
46cca42040 fix more 2025-01-16 18:31:40 -08:00
Adil Hafeez
aa649d5d80 add schema validator for provider 2025-01-16 17:18:26 -08:00
Adil Hafeez
00e4ba55a8 ensure that only openai is used for provider 2025-01-16 17:13:45 -08:00
Adil Hafeez
c7f8c2cef9 add demo for ollama 2025-01-16 16:34:17 -08:00
Adil Hafeez
2928b7630f Merge branch 'main' into adil/fix_prompt_target_name 2025-01-16 15:14:57 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt (#368)
* update prompt target name after arch_fc has identified tool

* add test for currency exchange
2025-01-16 15:11:37 -08:00
Adil Hafeez
5017e7931e fix tracing 2025-01-16 14:37:09 -08:00
Adil Hafeez
2413f56980 pending changes 2025-01-16 14:33:59 -08:00
Adil Hafeez
1a10b82724 pending changes 2025-01-14 16:32:52 -08:00
Adil Hafeez
9570b167db fix tracing 2025-01-14 11:56:10 -08:00
Adil Hafeez
35065e2e41 Merge branch 'main' into adil/fix_prompt_target_name 2025-01-14 10:20:49 -08:00
Adil Hafeez
a24d62af1a
add github pull request in vscode (#367) 2025-01-14 10:20:27 -08:00
Adil Hafeez
b8474f42c9 Merge branch 'main' into adil/fix_prompt_target_name 2025-01-13 15:53:40 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server (#362)
* Some fixes on model server

* Remove prompt_prefilling message

* Fix logging

* Fix poetry issues

* Improve logging and update the support for text truncation

* Fix tests

* Fix tests

* Fix tests

* Fix modelserver tests

* Update modelserver tests
2025-01-10 16:45:36 -08:00
Salman Paracha
ebda682b30
updated docs for 0.1.8 support (#366)
* updated docs for 0.1.8 support

* updated REAMDE on root

* updated version reference to 0.1.8 in other parts of the repo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-10 16:38:48 -08:00
Adil Hafeez
516d9a7c4a update prompt target name after arch_fc has identified tool 2025-01-10 15:28:33 -08:00
Adil Hafeez
42ab061971 pending 2025-01-10 12:52:29 -08:00
Adil Hafeez
e55127d325 remove extra http_method 2025-01-08 16:58:43 -08:00
Adil Hafeez
68097fde07 Merge branch 'main' into adil/add_acm_demo 2025-01-08 16:55:07 -08:00
Adil Hafeez
dae6239b81
use per user docker socket if system docker socket doesn't exist (#361)
* use per user docker socket if system docker socket doesn't exist

* add retry
2025-01-08 14:55:42 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 (#358) 2025-01-06 14:23:46 -08:00
Adil Hafeez
8407edae99
only test currency exchange in demo test (#348) 2024-12-21 11:33:08 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Aayush
67b8fd635e
add more granular bucket sizes for ttft (#343)
* add more granular bucket sizes for ttft
2024-12-12 14:38:36 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module (#335)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod (#334)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00