plano

mirror of https://github.com/katanemo/plano.git synced 2026-04-26 01:06:25 +02:00

Author	SHA1	Message	Date
Salman Paracha	1acf43ff7a	fixed cli to use poetry as well. this way we make it easy to have the… (#160 )	2024-10-09 15:53:12 -07:00
Co Tran	8b5db45507	Fix gpu dependency and only leverage onnx when GPU is available (#157 ) * replacing appending instead of write * fix eetq dependency * gpu guard required eetq * fix bug when gpu is available * fix for gpu device * reverse * fix * replace gpu -> cuda	2024-10-09 11:42:05 -07:00
Co Tran	5c4a6bc8ff	lint + formating with black (#158 ) * lint + formating with black * add black as pre commit	2024-10-09 11:25:07 -07:00
Salman Paracha	b63a01fe82	Salmanap/fix network agent demo (#153 ) * staging my changes to re-based from main * adding debug statements to rust * merged with main * ready to push network agent * removed the incomplete sql example --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-08 22:19:20 -07:00
Adil Hafeez	47c9c0aafc	fix lock file (#151 )	2024-10-08 18:16:00 -07:00
Co Tran	e62c6e75ea	fix dependcy + logg info (#148 )	2024-10-08 16:42:40 -07:00
Co Tran	80d2229053	Cotran/onnx conversion (#145 ) * onnx replacement * onnx conversion for nli and embedding model * fix naming * fix naming * fix naming * pin version	2024-10-08 14:37:48 -07:00
Salman Paracha	3ed50e61d2	ensure that we can call the new api.fc.archgw.com url, logging fixes … (#142 ) * ensure that we can call the new api.fc.archgw.com url, logging fixes and minor cli bug fixes * fixed a bug where model_server printed on terminal after start script stopped running * updating the logo and fixing the website styles * updated the branch with feedback from Co and Adil --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-08 12:40:24 -07:00
Co Tran	b1fa127704	Hallucination integration with rust (#122 )	2024-10-07 18:38:55 -07:00
Co Tran	93abe553e3	formating and mointoring change (#136 )	2024-10-07 15:21:05 -07:00
Adil Hafeez	96686dc606	Serialize tool calls for Arch FC (#131 ) * Serialize tool calls * fix int tests	2024-10-07 00:03:25 -07:00
Salman Paracha	b60ceb9168	model server build (#127 ) * first commit to have model_server not be dependent on Docker * making changes to fix the docker-compose file for archgw to set DNS_V4 and minor fixes with the build * additional fixes for model server to be separated out in the build * additional fixes for model server to be separated out in the build * fix to get model_server to be built as a separate python process. TODO: fix the embeddings logs after cli completes * fixing init to pull tempfile using the tempfile python package --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-06 18:21:43 -07:00
Adil Hafeez	2a747df7c0	don't compute embeddings for names and other fixes see description (#126 ) * serialize tools - 2 * fix int tests * fix int test * fix unit tests	2024-10-05 19:25:16 -07:00
Salman Paracha	701187474f	load_models checks for device before getting the BGE or NLI model loaded in memory. Was defaulting to CPU. And removed gunk for load_sql (#119 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-04 13:09:35 -07:00
Co Tran	7d38ef9719	Cotran/hallucination integration (#115 ) * fix fc integration * fix integration * remove file * Update arch_fc.py * create model server hallucination detection class	2024-10-04 11:05:25 -07:00
Salman Paracha	dc57f119a0	archgw cli (#117 ) * initial commit of the insurange agent demo, with the CLI tool * committing the cli * fixed some field descriptions for generate-prompt-targets * CLI works with buil, up and down commands. Function calling example works stand-alone * fixed README to install archgw cli * fixing based on feedback * fixing based on feedback --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-03 18:21:27 -07:00
Adil Hafeez	1b57a49c9d	add support for default target (#111 ) * add support for default target * add more fixes	2024-10-02 20:43:16 -07:00
Co Tran	ed50d29ccf	fix fc integration (#110 ) * fix fc integration * fix integration * remove file * Update arch_fc.py	2024-10-01 19:20:28 -07:00
Co Tran	17a643c410	ArchFC endpoint integration (#94 ) * integration * mopdify docker file * add params and fix python lint * fix empty context and tool calls * address comments * revert port * fix bug merge * fix environment * fix bug * fix compose * fix merge	2024-10-01 12:47:26 -07:00
Salman Paracha	8654d3d5c5	simplify developer getting started experience (#102 ) * Fixed build. Now, we have a bare bones version of the docker-compose file with only two services, archgw and archgw-model-server. Tested using CLI * some pre-commit fixes * fixed cargo formatting issues * fixed model server conflict changes --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-01 10:02:23 -07:00
Adil Hafeez	f4395d39f9	Fold function_resolver into model_server (#103 )	2024-10-01 09:13:50 -07:00
José Ulises Niño Rivera	b0ce5eca93	Rename bolt_config to arch_config (#100 ) Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>	2024-09-30 18:47:35 -07:00
Adil Hafeez	ea86f73605	rename envoyfilter => arch (#91 ) * rename envoyfilter => arch * fix more files * more fixes * more renames	2024-09-27 16:41:39 -07:00
Co Tran	d5d79256b0	remove guard config json (#70 ) * remove guard config json * formating	2024-09-24 13:33:31 -07:00
Adil Hafeez	eff4cd9826	improve response handling (#71 )	2024-09-23 22:56:35 -07:00
Co Tran	79b1c5415f	[Kan-103] add support toxic/jailbreak model (#49 ) * add toxic/jailbreak model * fix path loading model * fix syntax * fix bug,lint, format * fix bug * formatting * add parallel + chunking * fix bug * working version * fix onnnx name erorr * device * fix jailbreak config * fix syntax error * format * add requirement + cli download for dockerfile * add task * add skeleton change for envoy filter for prompt guard * fix hardware config * fix bug * add config changes * add gitignore * merge main * integrate arch-guard with filter * add hardware config * nothing * add hardware config feature * fix requirement * fix chat ui * fix onnx * fix lint * remove non intel cpu * remove onnx * working version * modify docker * fix guard time * add nvidia support * remove nvidia * add gpu * add gpu * add gpu support * add gpu support for compose * add gpu support for compose * add gpu support for compose * add gpu support for compose * add gpu support for compose * fix docker file * fix int test * correct gpu docker * upgrad python 10 * fix logits to be gpu compatible * default to cpu dockerfile * resolve comments * fix lint + unused parameters * fix * remove eetq install for cpu * remove deploy gpu --------- Co-authored-by: Adil Hafeez <adil@katanemo.com>	2024-09-23 12:07:31 -07:00
Adil Hafeez	31f26ef7ac	move demo functions out of model_server (#67 ) * pending * remove * fix docker build	2024-09-20 14:38:10 -07:00
Sampreeth Sarma	ed6a9139e6	demos for network copilot and sql analyzer (#57 ) * pulled from main branch after adding enums and made changes * added sql_analyzer folder and built a demo for Employee stats function calling. "top_employees" and "aggregate_stats". * sql_anayzer * After addressing PR comments * PR comments * PR comments * Addeed Network Analyzer FC Code * Added network Analyzer code for diff timeframes * Network Copilot and Employee Details demos are updated with their descriptions and resolved the PR comments * Added 2nd function in network copilot * Added 2nd function in network copilot * Added 2nd function in network copilot * Added 2nd function in network copilot * Added 2nd function in network copilot	2024-09-19 11:40:31 -07:00
Adil Hafeez	215d276acf	send all tools when sending request to arch-fc (#59 )	2024-09-18 15:54:40 -07:00
Adil Hafeez	060a0d665e	improve service names (#54 ) - embedding-server => model_server - public-types => public_types - chatbot-ui => chatbot_ui - function-calling => function_calling	2024-09-17 08:47:35 -07:00

30 commits