Sustainability in the Age of Local LLMs: Who’s Watching the Electricity Bill?

tl;dr The rise of AI, particularly large language models (LLMs) like ChatGPT, has been transformative, making advanced technology accessible and widely used. However, with their growing adoption comes a pressing concern: the sustainability of their energy consumption. This article explores a practical approach to evaluating the power usage of different LLMs using a sample chatbot […]

A personal AI assistant for developers that doesn’t phone home

tl;dr It’s no surprise that developers are looking for ways to include powerful new technologies like AI Assistants to improve their workflow and productivity. However, many companies are reluctant to allow such technology due to concerns about privacy, security and IP law. This article addresses the concerns about privacy and security and describes how to […]

Develop faster, operate smart: A Kubernetes-native guide to AI application development

tl;dr A Kubernetes-native software engineering approach for the development of AI applications helps you increase developer productivity, optimize resource consumption as well as simplify operations. A hands-on demo of this approach can be seen here. Two-step development approach The usage of an AI/ML model in an application requires basically a two-step development approach. The first […]

How to install NVIDIA GPU Operator in OpenShift 4

The NVIDIA GPU Operator is used to manage GPU nodes in OpenShift and make these GPUs consumable for application workloads in an OpenShift cluster. There are several use cases which fit e.g, AI/ML workloads, data analysis, 3D processing. All of these can be done within an OpenShift cluster with GPU power enabled. So, today I […]