Presentations | Hack Dojo

Sort by:

Searching for the Right Words: Bringing NLP to Apache solr through ONNX and OpenNLP

Conference: OpenAI + Data Forum 2022

Authors: Jeff Zemerick

2022-06-23

tldr - powered by Generative AI

Bringing NLP capabilities to Apache Solr through ONNX and OpenNLP

Apache OpenNLP is a Java-based NLP tool that has been around for over a decade and offers various capabilities such as tokenization, document classification, and named entity recognition
Apache Solr depends on Apache Lucene for search functionality, and Apache Lucene has a dependency on Apache OpenNLP for some NLP operations
The ONNX Runtime allows for the use of deep learning models across programming languages, architectures, and platforms, enabling the use of NLP services created in other languages
The speaker demonstrates how a deep learning model trained using PyTorch or Tensorflow can be used for inference from a Java search stack of Apache OpenNLP, Apache Lucene, and Apache Solr
The speaker discusses the challenges and relationships between OpenNLP, Lucene, and Solr, and provides resources for attendees to get started with these open source projects

Tags:

Natural language processing

Show 0 Comments

Accelerating High-Performance Machine Learning at Scale in Kubernetes

Conference: KubeCon + CloudNativeCon Europe 2022

Authors: Alejandro Saucedo, Elena Neroslavskaya

2022-05-18

tldr - powered by Generative AI

The presentation covers machine learning acceleration at scale, optimization of models, deployment to Kubernetes, and introduction of production cloud native tooling.

Running ML server locally is important to ensure everything works and debug any issues before deployment to production.
Other resources for CI/CD for production machine learning at scale, production machine learning monitoring, machine learning security, and machine learning ecosystem and operations.
Collaboration with Hugging Face team to access a pre-trained GPT2 model using their Transformers library.
Optimization of the model using ONNX serialization format.
Deployment to Kubernetes cluster after testing locally to ensure it works.
Anecdote about a computationally intensive dungeon crawler game that uses AI model for personalization.

Tags:

Show 0 Comments

Dates

Author

Conferences

Tags

Searching for the Right Words: Bringing NLP to Apache solr through ONNX and OpenNLP

tldr - powered by Generative AI

Accelerating High-Performance Machine Learning at Scale in Kubernetes

tldr - powered by Generative AI