Commit graph

628 commits

Author SHA1 Message Date
Adil Hafeez
e81ca8d5cf
llm listener split (#155) 2024-10-09 15:47:32 -07:00
Co Tran
8b5db45507
Fix gpu dependency and only leverage onnx when GPU is available (#157)
* replacing appending instead of write

* fix eetq dependency

* gpu guard required eetq

* fix bug when gpu is available

* fix for gpu device

* reverse

* fix

* replace gpu -> cuda
2024-10-09 11:42:05 -07:00
Co Tran
5c4a6bc8ff
lint + formating with black (#158)
* lint + formating with black

* add black as pre commit
2024-10-09 11:25:07 -07:00
Salman Paracha
498e7f9724
minor fixes to README (#156)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-09 10:12:36 -07:00
Salman Paracha
42d4a28e13
updated all demo READMes and minor doc changes (#154)
* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 23:58:55 -07:00
Salman Paracha
b63a01fe82
Salmanap/fix network agent demo (#153)
* staging my changes to re-based from main

* adding debug statements to rust

* merged with main

* ready to push network agent

* removed the incomplete sql example

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 22:19:20 -07:00
Adil Hafeez
6acfea7787 bug fix - send all parameter irrespective of type
earlier we were only sending parameter if the type is string
2024-10-08 20:28:32 -07:00
Adil Hafeez
47c9c0aafc
fix lock file (#151) 2024-10-08 18:16:00 -07:00
Adil Hafeez
3b26b16fc8
add days and units to api server (#150)
* add days and units to api server

* add more stuff

* fix more
2024-10-08 18:14:06 -07:00
Adil Hafeez
e08d406be5
Update README.md 2024-10-08 17:20:51 -07:00
Adil Hafeez
ede125a4f3
ensure that tracing is optional in arch_config (#149) 2024-10-08 17:15:40 -07:00
Co Tran
e62c6e75ea
fix dependcy + logg info (#148) 2024-10-08 16:42:40 -07:00
Adil Hafeez
285aa1419b
Split listener (#141) 2024-10-08 16:24:08 -07:00
Co Tran
22bc3d2798
Cotran/prompt guard doc (#147)
* repalce prompt injection with jailbreak and removing toxc

* repalce prompt injection with jailbreak and removing toxc
2024-10-08 15:58:50 -07:00
Adil Hafeez
fab71abdac update readme for filter 2024-10-08 15:32:49 -07:00
Co Tran
80d2229053
Cotran/onnx conversion (#145)
* onnx replacement

* onnx conversion for nli and embedding model

* fix naming

* fix naming

* fix naming

* pin version
2024-10-08 14:37:48 -07:00
Shuguang Chen
b30ad791f7
Fix errors and improve Doc (#143)
* Fix link issues and add icons

* Improve Doc

* fix test

* making minor modifications to shuguangs' doc changes

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-08 13:18:34 -07:00
Salman Paracha
3ed50e61d2
ensure that we can call the new api.fc.archgw.com url, logging fixes … (#142)
* ensure that we can call the new api.fc.archgw.com url, logging fixes and minor cli bug fixes

* fixed a bug where model_server printed on terminal after start script stopped running

* updating the logo and fixing the website styles

* updated the branch with feedback from Co and Adil

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 12:40:24 -07:00
Salman Paracha
82fc91495e
Salmanap/fix demos (#140)
* Comitting to bring in main

* insurance agent updated

* updated the insurance agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 20:27:07 -07:00
Co Tran
b1fa127704
Hallucination integration with rust (#122) 2024-10-07 18:38:55 -07:00
Adil Hafeez
43dc2a0a73
system prompt (keep system prompt at the top) (#139)
* add system prompt

* fix
2024-10-07 17:50:07 -07:00
Adil Hafeez
422efd3887
add system prompt (#138) 2024-10-07 17:25:37 -07:00
José Ulises Niño Rivera
c1cfbcd44d
Implement Client trait for StreamContext (#134)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-07 16:50:15 -07:00
Shuguang Chen
5bfccd3959
Update arch_config.yaml (#137) 2024-10-07 16:01:12 -07:00
Co Tran
93abe553e3
formating and mointoring change (#136) 2024-10-07 15:21:05 -07:00
Salman Paracha
976b2eaae0
fixing docs so that GH pages picks up the right CNAME for DNS (#135)
* fixing docs so that GH pages picks up the right CNAME for DNS

* updating workflow to pick CNAME

* making sure to correctly set permissions on the build/html directory

* fixing GH actions issues with CNAME

* updated docker build command to run as local user

* fixed the CNAME issue and udpated GH actions

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 11:03:02 -07:00
Adil Hafeez
96686dc606
Serialize tool calls for Arch FC (#131)
* Serialize tool calls

* fix int tests
2024-10-07 00:03:25 -07:00
Adil Hafeez
b43f687b85
add files for local development (#133) 2024-10-06 23:10:13 -07:00
Salman Paracha
b60ceb9168
model server build (#127)
* first commit to have model_server not be dependent on Docker

* making changes to fix the docker-compose file for archgw to set DNS_V4 and minor fixes with the build

* additional fixes for model server to be separated out in the build

* additional fixes for model server to be separated out in the build

* fix to get model_server to be built as a separate python process. TODO: fix the embeddings logs after cli completes

* fixing init to pull tempfile using the tempfile python package

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 18:21:43 -07:00
Salman Paracha
7d21359f5b
added index.html and made minor README.md edits (#130)
* added index.html and made minor README.md edits

* minor fix to the text

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 17:16:48 -07:00
Shuguang Chen
5c7567584d
Doc Update (#129)
* init update

* Update terminology.rst

* fix the branch to create an index.html, and fix pre-commit issues

* Doc update

* made several changes to the docs after Shuguang's revision

* fixing pre-commit issues

* fixed the reference file to the final prompt config file

* added google analytics

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 16:54:34 -07:00
Salman Paracha
2a7b95582c
Build arch docs (#128)
* Build arch docs

The following action would build the docs from main

* triggering a push

* updating the command to run on pull request

* fixed bug

* updated documentation workflow to create a gh-pages branch and update it with the latest build

* fixing the deployment option to mick the right folder

* tested that the pages get build via GH actions, and get published appropriately

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 11:39:22 -07:00
Salman Paracha
af5671290c fixing CNAME for pre-commit 2024-10-06 09:56:56 -07:00
Salman Paracha
91d7e17c06
Create CNAME 2024-10-06 09:34:29 -07:00
Adil Hafeez
2a747df7c0
don't compute embeddings for names and other fixes see description (#126)
* serialize tools - 2

* fix int tests

* fix int test

* fix unit tests
2024-10-05 19:25:16 -07:00
Salman Paracha
0e5ea3d6db
Salmanap/fix config generator (#124)
* fixed environment variables issue with build. Now llm provider access keys are being written correctly

* fixed and verified that keys are being properly set when archgw is booted up

* removing leaf reference to a staged config file. not needed anymore

* minor fixes to get the build in more stable state

* minor fixes based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-05 10:49:47 -07:00
Adil Hafeez
5ba7db21d0
fix pkg_resources error (#123) 2024-10-04 20:24:08 -07:00
José Ulises Niño Rivera
2a9b9486f3
Clean up Embeddings Store (#121)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 19:33:52 -07:00
José Ulises Niño Rivera
10b5c5b42c
[BUG FIX] Add missing chat completions request hint (#116)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 19:19:44 -07:00
Adil Hafeez
07ef1af24f
fix embeddings not found bug (#120) 2024-10-04 17:07:59 -07:00
Salman Paracha
701187474f
load_models checks for device before getting the BGE or NLI model loaded in memory. Was defaulting to CPU. And removed gunk for load_sql (#119)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-04 13:09:35 -07:00
José Ulises Niño Rivera
093891b5f8
Remove hardcoded Access Key Envvars (#118)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 11:10:59 -07:00
Co Tran
7d38ef9719
Cotran/hallucination integration (#115)
* fix fc integration

* fix integration

* remove file

* Update arch_fc.py

* create model server hallucination detection class
2024-10-04 11:05:25 -07:00
Salman Paracha
dc57f119a0
archgw cli (#117)
* initial commit of the insurange agent demo, with the CLI tool

* committing the cli

* fixed some field descriptions for generate-prompt-targets

* CLI works with buil, up and down commands. Function calling example works stand-alone

* fixed README to install archgw cli

* fixing based on feedback

* fixing based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-03 18:21:27 -07:00
José Ulises Niño Rivera
af018e5fd8
Remove optional PromptGuards from Stream Context (#113)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 12:21:35 -07:00
José Ulises Niño Rivera
8ea917aae5
Add the ability to use LLM Providers from the Arch config (#112)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 10:57:01 -07:00
Adil Hafeez
1b57a49c9d
add support for default target (#111)
* add support for default target

* add more fixes
2024-10-02 20:43:16 -07:00
Aayush
c8d0dbec26
change default stat_prefix from ingress_http to arch (#109)
* change default stat_prefix from ingress_http to arch

* Update arch/envoy.template.yaml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

---------

Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-02 18:21:33 -07:00
Co Tran
ed50d29ccf
fix fc integration (#110)
* fix fc integration

* fix integration

* remove file

* Update arch_fc.py
2024-10-01 19:20:28 -07:00
Aayush
f0fb4a48cd
change root to config in config_generator as well as remove old line 17 from function calling docker compose (#108) 2024-10-01 15:40:55 -07:00