Commit graph

67 commits

Author SHA1 Message Date
Adil Hafeez
8a265de7c6
Propagate enums from prompt-target to bolt-fc (#56) 2024-09-17 12:03:21 -07:00
Adil Hafeez
63143fbd01
add bolt guard config (#55) 2024-09-17 10:59:50 -07:00
Adil Hafeez
dd35e77a73 add streaming chatbot-ui 2024-09-17 09:01:45 -07:00
Adil Hafeez
b8841fb8f4 rename bolt-config => bolt_config 2024-09-17 08:56:47 -07:00
Adil Hafeez
060a0d665e
improve service names (#54)
- embedding-server => model_server
- public-types => public_types
- chatbot-ui => chatbot_ui
- function-calling => function_calling
2024-09-17 08:47:35 -07:00
Adil Hafeez
215f96e273
rename envoy-1 => bolt-1 (#53) 2024-09-16 23:20:26 -07:00
Adil Hafeez
a0d132f5b6 remove unused demos 2024-09-16 19:21:59 -07:00
Adil Hafeez
ed329bd419 improve docker build speed 2024-09-16 19:20:27 -07:00
Adil Hafeez
9e50957f22
Improve prompt target intent matching (#51) 2024-09-16 19:20:07 -07:00
José Ulises Niño Rivera
8565462ec4
Resetting instead of resuming (#39)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-14 10:58:25 -07:00
José Ulises Niño Rivera
33a1a68218
Add small clarification to function calling demo (#48)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-11 11:37:53 -07:00
José Ulises Niño Rivera
3c5ac65e2a
More idiomatic conversion (#47)
* More idiomatic conversion

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* errors

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

---------

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-11 10:00:25 -07:00
Adil Hafeez
de8c706a69 update readme 2024-09-10 14:42:58 -07:00
Adil Hafeez
d95bac7650
Update README.md 2024-09-10 14:27:14 -07:00
Adil Hafeez
b53e1944dd
Update README.md 2024-09-10 14:25:52 -07:00
Adil Hafeez
7b5203a2ce
Add function calling support using bolt-fc-1b (#35) 2024-09-10 14:24:46 -07:00
José Ulises Niño Rivera
fdfad87347
Fix wasm load error (#45)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-05 16:29:41 -07:00
José Ulises Niño Rivera
dd48689aee
Add Ratelimit on request tokens (#44)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-04 17:28:12 -07:00
José Ulises Niño Rivera
d98517f240
Move shared types into their own crate (#41)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-04 15:31:05 -07:00
José Ulises Niño Rivera
4dd1f3693e
Add the Ratelimit module (#34)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-28 11:11:05 -07:00
José Ulises Niño Rivera
2a4cde950d
Add Ratelimit config (#32)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-07 14:15:26 -07:00
Adil Hafeez
445b1ea210
Add support for local llm (mistral 7b) (#31) 2024-08-06 23:40:06 -07:00
José Ulises Niño Rivera
b49fc2f264
Simplify Entity struct (#33)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-06 17:04:32 -07:00
Adil Hafeez
1fa5215753
Update README.md 2024-08-01 13:26:42 -07:00
Adil Hafeez
b01c9949bc
Update README.md 2024-08-01 13:25:42 -07:00
Adil Hafeez
0d02147912
Update README.md 2024-08-01 13:23:12 -07:00
Adil Hafeez
da84b92cca
Update README.md 2024-08-01 13:22:47 -07:00
Adil Hafeez
6f6454069f
Improve demo experience (#28)
* Imrpove demo experience
- generate envoy.yaml using jinja2 template

* add todo

* add config file
2024-07-31 17:13:39 -07:00
Adil Hafeez
51d6213cdd
update open-message-format (#30) 2024-07-31 15:56:13 -07:00
Adil Hafeez
71613c2789 add error warning if open ai key is missing 2024-07-31 15:51:31 -07:00
Adil Hafeez
56328bfb16 fix break 2024-07-31 14:43:28 -07:00
Adil Hafeez
9774148c75
Remove top level container and start snake-case for config files (#27)
t
2024-07-31 14:05:52 -07:00
José Ulises Niño Rivera
b8ea65d858
Remove unnecessary clones (#26)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-07-31 11:48:34 -07:00
José Ulises Niño Rivera
c13682a03b
Clean up imports (#25)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-31 11:33:39 -07:00
Adil Hafeez
5ecdf30fdc refactor callout context arm handler code 2024-07-30 18:20:57 -07:00
Adil Hafeez
09824c7236
Update README.md 2024-07-30 16:30:19 -07:00
Adil Hafeez
1f544d836c
Update README.md 2024-07-30 16:25:01 -07:00
Adil Hafeez
33f9dd22e6
Add workflow logic for weather forecast demo (#24) 2024-07-30 16:23:23 -07:00
José Ulises Niño Rivera
7ef68eccfb
Improve error handling (#23)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-29 12:15:26 -07:00
José Ulises Niño Rivera
a51a467cad
Add initial integration style tests (#20)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-25 14:41:36 -07:00
Adil Hafeez
a0abd9c42d
refactor code a bit (#21) 2024-07-24 14:13:18 -07:00
Adil Hafeez
15a44e680b
update pre-commit to only run fmt on intelligent-prompt-gateway package (#22) 2024-07-23 23:53:02 -07:00
Adil Hafeez
cad38295bf
Use open-message-format to serialize and deserialize embeddings api (#18)
* Use open-message-format to serialize and deserialize embeddings api
2024-07-23 11:56:49 -07:00
José Ulises Niño Rivera
a59c7df2a2
Address comments from clean up PR (#17)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-22 14:29:12 -07:00
José Ulises Niño Rivera
7ae2d918e8
Delete example code, and move prod code into functional separation (#16)
* Clean up example code

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* clean

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* rename

---------

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-07-21 23:22:47 -07:00
José Ulises Niño Rivera
5b4143d580
Add initial logic to send prompts to LLM API (#9)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-19 13:14:48 -07:00
Aayush
31c4ac267a
Add testing to GitHub workflow (#15)
* update workflow and pre-commit to test only unit tests

* Update .github/workflows/checks.yml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

* Update .pre-commit-config.yaml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

* update formatting of comments

* Update .pre-commit-config.yaml

* Update checks.yml

---------

Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-07-18 14:20:33 -07:00
Adil Hafeez
7bf77afa0e
add embedding store (#10) 2024-07-18 14:04:51 -07:00
Adil Hafeez
cc2a496f90
Setup pre-commit so it runs locally before every git push (#12)
* Setup pre-commit so it runs locally before every git push

* Update .pre-commit-config.yaml

* added more checks

* update readme

* Apply suggestions from code review

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* remove cargo-check

---------

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-18 11:01:02 -07:00
Adil Hafeez
a386d68b41
Parse katanemo config using serde/yaml package (#6)
* Parse katanemo config using serde/yaml package

- load yaml file into typed classes
- pass katanemo config to plugin using envoy wasm plugin config
- add tests in configuration.rs file
2024-07-16 14:50:32 -07:00