Bringing RAG to Joule
Nvidia NeMo Retriever, a semantic-retrieval microservice unveiled last November that helps gen AI applications provide more accurate responses via retrieval-augmented generation (RAG), will bolster SAP’s Joule copilot.
RAG optimizes LLMs by giving them the ability to reference authoritative knowledge bases outside their training data.
“There are tons of documents that are not residing in an SAP system,” Herzig said. “Those might be your HR policy, your travel policy, compliance documents, your legal documents, that might be in a SharePoint or on a portal.”
Joule already has the power to answer simple questions, like, “How many vacation days do I have left?” That’s just a matter of the employee record. But Nvidia’s microservice empowers Joule to go a step further by giving it access to the HR policy and compare that with the employee record, too.
New features for data scientists, developers
The partnership is also exploring more than 20 gen AI use cases aimed at helping customers simplify digital transformation, including automating ERP with intelligent invoice matching in SAP S/4HANA Cloud, improving HR use cases via SAP SuccessFactors, and using gen AI insights from SAP Signavio to process business recommendations and optimize customer support processes.
Meanwhile, SAP is leveraging NVIDIA’s accelerated computing platforms and NVIDIA AI Enterprise data science software, including Nvidia Rapids, Rapids cuDF, and cuML, to make it easier for data scientists to access data and enhance ML workload performance in Datasphere. For developers, Nvidia AI foundry services will help them create domain-specific language code and fine-tune LLMs to write code in SAP’s Advanced Business Application Programming (ABAP) programming language.