Automating Tasks Securely with RAG and a Choice of LLMs @oracledevs

Oracle Developers | Automating Tasks Securely with RAG and a Choice of LLMs @oracledevs | Uploaded May 2024 | Updated October 2024, 6 hours ago.
In the effort to streamline repetitive tasks or automate them entirely, why not enlist the help of AI? Using a foundation model to automate repetitive tasks may sound appealing, but it may put confidential data at risk. Retrieval-augmented generation (RAG) is an alternative to fine-tuning, keeping inference data isolated from a model’s corpus.

We want to keep our inference data and model separated—but we also want a choice in which large language model (LLM) we use and a powerful GPU for efficiency. Imagine if you could do all of this with just one GPU!

In this demo, we’ll show how to deploy a RAG solution using a single NVIDIA A10 GPU; an open source framework such as LangChain, LlamaIndex, Qdrant, or vLLM; and a light 7-billion-parameter LLM from Mistral AI. It’s an excellent balance of price and performance and keeps inference data separated while updating the data as needed.

Day One and Beyond: DNS Architecture on OCI

Strengthen Oracle Database Cyber Defense and Recovery with Zero Data Loss Air Gapped Backups

Unlock the Potential of Digital Assistants in Oracles PeopleSoft

Búsquedas vectorizadas en Oracle Database 23ai con APEX!

Oracle Integration at Oracle CloudWorld 2024

Enterprise Knowledge Q&A with RAG and OCI Generative AI

Cloud Coaching - Oracle APEX-press bitte steigen Sie ein

Day One and Beyond: Deploying Reference Architectures

Cloud Coaching - Real-time insights and anomaly detection in ATM transactions

Build a System to Develop Employee Competencies Using Generative AI

JSON Relational Duality: Data Modeling with Hackolade