Commit graph

48 commits

Author SHA1 Message Date
Adil Hafeez
71cdf69f77
dont send default target to archfc (#166) 2024-10-09 17:43:02 -07:00
Adil Hafeez
3e9327cf36 fix bug in jinja template for tracing 2024-10-09 16:44:50 -07:00
Salman Paracha
0ed88def8f updated settuptools packages 2024-10-09 16:31:35 -07:00
Adil Hafeez
6991fbb7a7 rename 2024-10-09 16:24:40 -07:00
Adil Hafeez
c254dfb16a
update cli and update docs (#161)
* add services to cli

* more changes
2024-10-09 16:22:27 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… (#160) 2024-10-09 15:53:12 -07:00
Adil Hafeez
e81ca8d5cf
llm listener split (#155) 2024-10-09 15:47:32 -07:00
Co Tran
8b5db45507
Fix gpu dependency and only leverage onnx when GPU is available (#157)
* replacing appending instead of write

* fix eetq dependency

* gpu guard required eetq

* fix bug when gpu is available

* fix for gpu device

* reverse

* fix

* replace gpu -> cuda
2024-10-09 11:42:05 -07:00
Co Tran
5c4a6bc8ff
lint + formating with black (#158)
* lint + formating with black

* add black as pre commit
2024-10-09 11:25:07 -07:00
Salman Paracha
498e7f9724
minor fixes to README (#156)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-09 10:12:36 -07:00
Salman Paracha
b63a01fe82
Salmanap/fix network agent demo (#153)
* staging my changes to re-based from main

* adding debug statements to rust

* merged with main

* ready to push network agent

* removed the incomplete sql example

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 22:19:20 -07:00
Adil Hafeez
6acfea7787 bug fix - send all parameter irrespective of type
earlier we were only sending parameter if the type is string
2024-10-08 20:28:32 -07:00
Adil Hafeez
e08d406be5
Update README.md 2024-10-08 17:20:51 -07:00
Adil Hafeez
ede125a4f3
ensure that tracing is optional in arch_config (#149) 2024-10-08 17:15:40 -07:00
Adil Hafeez
285aa1419b
Split listener (#141) 2024-10-08 16:24:08 -07:00
Adil Hafeez
fab71abdac update readme for filter 2024-10-08 15:32:49 -07:00
Co Tran
80d2229053
Cotran/onnx conversion (#145)
* onnx replacement

* onnx conversion for nli and embedding model

* fix naming

* fix naming

* fix naming

* pin version
2024-10-08 14:37:48 -07:00
Salman Paracha
3ed50e61d2
ensure that we can call the new api.fc.archgw.com url, logging fixes … (#142)
* ensure that we can call the new api.fc.archgw.com url, logging fixes and minor cli bug fixes

* fixed a bug where model_server printed on terminal after start script stopped running

* updating the logo and fixing the website styles

* updated the branch with feedback from Co and Adil

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 12:40:24 -07:00
Salman Paracha
82fc91495e
Salmanap/fix demos (#140)
* Comitting to bring in main

* insurance agent updated

* updated the insurance agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 20:27:07 -07:00
Co Tran
b1fa127704
Hallucination integration with rust (#122) 2024-10-07 18:38:55 -07:00
Adil Hafeez
43dc2a0a73
system prompt (keep system prompt at the top) (#139)
* add system prompt

* fix
2024-10-07 17:50:07 -07:00
Adil Hafeez
422efd3887
add system prompt (#138) 2024-10-07 17:25:37 -07:00
José Ulises Niño Rivera
c1cfbcd44d
Implement Client trait for StreamContext (#134)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-07 16:50:15 -07:00
Adil Hafeez
96686dc606
Serialize tool calls for Arch FC (#131)
* Serialize tool calls

* fix int tests
2024-10-07 00:03:25 -07:00
Adil Hafeez
b43f687b85
add files for local development (#133) 2024-10-06 23:10:13 -07:00
Salman Paracha
b60ceb9168
model server build (#127)
* first commit to have model_server not be dependent on Docker

* making changes to fix the docker-compose file for archgw to set DNS_V4 and minor fixes with the build

* additional fixes for model server to be separated out in the build

* additional fixes for model server to be separated out in the build

* fix to get model_server to be built as a separate python process. TODO: fix the embeddings logs after cli completes

* fixing init to pull tempfile using the tempfile python package

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 18:21:43 -07:00
Adil Hafeez
2a747df7c0
don't compute embeddings for names and other fixes see description (#126)
* serialize tools - 2

* fix int tests

* fix int test

* fix unit tests
2024-10-05 19:25:16 -07:00
Salman Paracha
0e5ea3d6db
Salmanap/fix config generator (#124)
* fixed environment variables issue with build. Now llm provider access keys are being written correctly

* fixed and verified that keys are being properly set when archgw is booted up

* removing leaf reference to a staged config file. not needed anymore

* minor fixes to get the build in more stable state

* minor fixes based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-05 10:49:47 -07:00
Adil Hafeez
5ba7db21d0
fix pkg_resources error (#123) 2024-10-04 20:24:08 -07:00
José Ulises Niño Rivera
2a9b9486f3
Clean up Embeddings Store (#121)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 19:33:52 -07:00
José Ulises Niño Rivera
10b5c5b42c
[BUG FIX] Add missing chat completions request hint (#116)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 19:19:44 -07:00
Adil Hafeez
07ef1af24f
fix embeddings not found bug (#120) 2024-10-04 17:07:59 -07:00
José Ulises Niño Rivera
093891b5f8
Remove hardcoded Access Key Envvars (#118)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 11:10:59 -07:00
Salman Paracha
dc57f119a0
archgw cli (#117)
* initial commit of the insurange agent demo, with the CLI tool

* committing the cli

* fixed some field descriptions for generate-prompt-targets

* CLI works with buil, up and down commands. Function calling example works stand-alone

* fixed README to install archgw cli

* fixing based on feedback

* fixing based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-03 18:21:27 -07:00
José Ulises Niño Rivera
af018e5fd8
Remove optional PromptGuards from Stream Context (#113)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 12:21:35 -07:00
José Ulises Niño Rivera
8ea917aae5
Add the ability to use LLM Providers from the Arch config (#112)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 10:57:01 -07:00
Adil Hafeez
1b57a49c9d
add support for default target (#111)
* add support for default target

* add more fixes
2024-10-02 20:43:16 -07:00
Aayush
c8d0dbec26
change default stat_prefix from ingress_http to arch (#109)
* change default stat_prefix from ingress_http to arch

* Update arch/envoy.template.yaml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

---------

Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-02 18:21:33 -07:00
Aayush
f0fb4a48cd
change root to config in config_generator as well as remove old line 17 from function calling docker compose (#108) 2024-10-01 15:40:55 -07:00
Co Tran
17a643c410
ArchFC endpoint integration (#94)
* integration

* mopdify docker file

* add params and fix python lint

* fix empty context and tool calls

* address comments

* revert port

* fix bug merge

* fix environment

* fix bug

* fix compose

* fix merge
2024-10-01 12:47:26 -07:00
Adil Hafeez
1a7c1ad0a5
rename archgw_model_sever => model_server (#106) 2024-10-01 11:24:43 -07:00
Salman Paracha
8654d3d5c5
simplify developer getting started experience (#102)
* Fixed build. Now, we have a bare bones version of the docker-compose file with only two services, archgw and archgw-model-server. Tested using CLI

* some pre-commit fixes

* fixed cargo formatting issues

* fixed model server conflict changes

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-01 10:02:23 -07:00
Adil Hafeez
15869825e3
add messages in params when making api calls (#104) 2024-10-01 09:14:01 -07:00
Adil Hafeez
f4395d39f9
Fold function_resolver into model_server (#103) 2024-10-01 09:13:50 -07:00
Adil Hafeez
2207021b9c
remove method type (#101) 2024-09-30 17:59:29 -07:00
José Ulises Niño Rivera
f154bc3741
Remove unnecessary envoy.yaml (#99)
* Remove unnecessary envoy.yaml

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* more

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

---------

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-30 17:49:25 -07:00
Adil Hafeez
cc35eb0cd7
update config (#93) 2024-09-30 17:49:05 -07:00
Adil Hafeez
ea86f73605
rename envoyfilter => arch (#91)
* rename envoyfilter => arch

* fix more files

* more fixes

* more renames
2024-09-27 16:41:39 -07:00