Tag: GPU

Sustainability in the Age of Local LLMs: Who’s Watching the Electricity Bill?

Post author By Danijel Soldo
Post date June 10, 2024

Created using Microsoft Bing Image Creator with prompt "Johannes Kepler measuring electricity with a llama standing behind him and observing shocked, digital art"

tl;dr The rise of AI, particularly large language models (LLMs) like ChatGPT, has been transformative, making advanced technology accessible and widely used. However, with their growing adoption comes a pressing concern: the sustainability of their energy consumption. This article explores a practical approach to evaluating the power usage of different LLMs using a sample chatbot […]

A personal AI assistant for developers that doesn’t phone home

Post author By Manuel Hahn
Post date November 6, 2023

Created using Microsoft Bing Image Creator with prompt "a robot assistant sitting at his desk programming while his human boss is in a sun chair giving instructions"

tl;dr It’s no surprise that developers are looking for ways to include powerful new technologies like AI Assistants to improve their workflow and productivity. However, many companies are reluctant to allow such technology due to concerns about privacy, security and IP law. This article addresses the concerns about privacy and security and describes how to […]

Develop faster, operate smart: A Kubernetes-native guide to AI application development

Post author By Manuel Hahn
Post date August 22, 2022

Image by Gerd Altmann from Pixabay

tl;dr A Kubernetes-native software engineering approach for the development of AI applications helps you increase developer productivity, optimize resource consumption as well as simplify operations. A hands-on demo of this approach can be seen here. Two-step development approach The usage of an AI/ML model in an application requires basically a two-step development approach. The first […]

How to install NVIDIA GPU Operator in OpenShift 4

Post author By Sebastian Dehn
Post date November 2, 2020

The NVIDIA GPU Operator is used to manage GPU nodes in OpenShift and make these GPUs consumable for application workloads in an OpenShift cluster. There are several use cases which fit e.g, AI/ML workloads, data analysis, 3D processing. All of these can be done within an OpenShift cluster with GPU power enabled. So, today I […]