Adil Hafeez
5421953ea9
Merge branch 'main' into collect-stats-in-stream-context
2024-11-12 11:01:16 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
aayushwhiz
6fc32b0152
update weather_forecast demo to spin up grafana and prometheus when using monitoring profile
...
has full dashboard with total requests, time per output token, time to
first token, total latency, output sequence length, and input sequence
length.
2024-11-11 17:00:48 -08:00
Salman Paracha
4b2b371876
removing depdency on mistral keys ( #256 )
2024-11-08 16:09:04 -08:00
Adil Hafeez
6b62662e01
update docs with weather_forecast path ( #253 )
2024-11-08 10:00:15 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
Salman Paracha
21f4e7a5e4
fixing ports in arch_config for demos ( #241 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 14:37:04 -07:00
Salman Paracha
3bff4a597b
fix ports and update README for paths to agent/chat ( #240 )
...
* fix ports and update README for paths to agent/chat
* minor fix
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 09:25:24 -07:00
Salman Paracha
dab7a44053
several fixes to demos ( #238 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-30 18:38:18 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Salman Paracha
bb882fb59b
Updated hr_agent to be full stack: gradio + fastAPI ( #235 )
...
* commiting to remove
* fix
* updating hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-30 15:05:34 -07:00
Salman Paracha
bb9a774a72
moving chatbot-ui in demos and out of root project structure ( #228 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-29 12:05:29 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug ( #227 )
2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) ( #202 )
2024-10-28 17:05:06 -07:00
Salman Paracha
ff6e9bd9bd
add README for hr_agent ( #224 )
...
* add README for hr_agent
* fixed sample prompt for hr_agent in README
* added screenshot and updated README.md
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-24 18:21:52 -07:00
Shuguang Chen
5f3aff4922
Update chatbot UI and update hallucination check ( #218 )
...
* update chatbot UI
* Update docker-compose for demos
* Fix bugs
* fix for emtadata (#219 )
* fix for emtadata
* fix
* revert
* merge main
---------
Co-authored-by: CTran <cotran2@utexas.edu>
2024-10-24 14:11:53 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
faf64960df
update observability and dashboards ( #198 )
2024-10-18 15:07:49 -07:00
Adil Hafeez
f72588c6da
fix lint
2024-10-10 23:34:21 -07:00
Adil Hafeez
7b51cce2f7
fix demo broken links
2024-10-10 17:52:33 -07:00
Adil Hafeez
50bea9135c
update broken link
2024-10-10 17:50:57 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Aayush
ceca0dba28
fix prometheus target and update dashboard. ( #165 )
...
* fix prometheus target and update dashboard.
* Update envoy_overview.json with whitespace at the end
2024-10-10 14:57:20 -07:00
Co Tran
2c45de26e6
fix for linux ( #175 )
...
* fix for linux
* fix pre commit
* fix
* fix extra white space
* fix commit
2024-10-10 14:56:23 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Adil Hafeez
6b70768170
make ratelimit section optional ( #168 )
2024-10-09 19:53:00 -07:00
Salman Paracha
dc3a9813c3
fixed function calling arch config yaml
2024-10-09 17:10:54 -07:00
Adil Hafeez
3e9327cf36
fix bug in jinja template for tracing
2024-10-09 16:44:50 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… ( #160 )
2024-10-09 15:53:12 -07:00
Adil Hafeez
e81ca8d5cf
llm listener split ( #155 )
2024-10-09 15:47:32 -07:00
Co Tran
5c4a6bc8ff
lint + formating with black ( #158 )
...
* lint + formating with black
* add black as pre commit
2024-10-09 11:25:07 -07:00
Salman Paracha
42d4a28e13
updated all demo READMes and minor doc changes ( #154 )
...
* updated all demo READMes and minor doc changes
* minor typo fixes
* updated main Readme
* fixed README and docs
* fixed README and docs
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 23:58:55 -07:00
Salman Paracha
b63a01fe82
Salmanap/fix network agent demo ( #153 )
...
* staging my changes to re-based from main
* adding debug statements to rust
* merged with main
* ready to push network agent
* removed the incomplete sql example
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 22:19:20 -07:00
Adil Hafeez
3b26b16fc8
add days and units to api server ( #150 )
...
* add days and units to api server
* add more stuff
* fix more
2024-10-08 18:14:06 -07:00
Adil Hafeez
285aa1419b
Split listener ( #141 )
2024-10-08 16:24:08 -07:00
Salman Paracha
82fc91495e
Salmanap/fix demos ( #140 )
...
* Comitting to bring in main
* insurance agent updated
* updated the insurance agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 20:27:07 -07:00
Co Tran
b1fa127704
Hallucination integration with rust ( #122 )
2024-10-07 18:38:55 -07:00
Shuguang Chen
5bfccd3959
Update arch_config.yaml ( #137 )
2024-10-07 16:01:12 -07:00
Adil Hafeez
2a747df7c0
don't compute embeddings for names and other fixes see description ( #126 )
...
* serialize tools - 2
* fix int tests
* fix int test
* fix unit tests
2024-10-05 19:25:16 -07:00
Salman Paracha
0e5ea3d6db
Salmanap/fix config generator ( #124 )
...
* fixed environment variables issue with build. Now llm provider access keys are being written correctly
* fixed and verified that keys are being properly set when archgw is booted up
* removing leaf reference to a staged config file. not needed anymore
* minor fixes to get the build in more stable state
* minor fixes based on feedback
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-05 10:49:47 -07:00
Salman Paracha
dc57f119a0
archgw cli ( #117 )
...
* initial commit of the insurange agent demo, with the CLI tool
* committing the cli
* fixed some field descriptions for generate-prompt-targets
* CLI works with buil, up and down commands. Function calling example works stand-alone
* fixed README to install archgw cli
* fixing based on feedback
* fixing based on feedback
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-03 18:21:27 -07:00
José Ulises Niño Rivera
8ea917aae5
Add the ability to use LLM Providers from the Arch config ( #112 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 10:57:01 -07:00
Adil Hafeez
1b57a49c9d
add support for default target ( #111 )
...
* add support for default target
* add more fixes
2024-10-02 20:43:16 -07:00
Co Tran
ed50d29ccf
fix fc integration ( #110 )
...
* fix fc integration
* fix integration
* remove file
* Update arch_fc.py
2024-10-01 19:20:28 -07:00
Aayush
f0fb4a48cd
change root to config in config_generator as well as remove old line 17 from function calling docker compose ( #108 )
2024-10-01 15:40:55 -07:00
Adil Hafeez
dbb8f87787
update path for arch_config.yaml file ( #107 )
2024-10-01 13:28:53 -07:00
Co Tran
17a643c410
ArchFC endpoint integration ( #94 )
...
* integration
* mopdify docker file
* add params and fix python lint
* fix empty context and tool calls
* address comments
* revert port
* fix bug merge
* fix environment
* fix bug
* fix compose
* fix merge
2024-10-01 12:47:26 -07:00
Adil Hafeez
1a7c1ad0a5
rename archgw_model_sever => model_server ( #106 )
2024-10-01 11:24:43 -07:00
Salman Paracha
8654d3d5c5
simplify developer getting started experience ( #102 )
...
* Fixed build. Now, we have a bare bones version of the docker-compose file with only two services, archgw and archgw-model-server. Tested using CLI
* some pre-commit fixes
* fixed cargo formatting issues
* fixed model server conflict changes
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-01 10:02:23 -07:00
Adil Hafeez
41cdef590a
arch schema validator ( #105 )
...
* add arch schema validator
* schema validator
2024-10-01 09:22:08 -07:00