Aspirations

This cluster is not a finished product. It is a foundation. Here is what it can become.

Multi-tenant hosting platform

The current setup requires SSH access or NST Init to deploy apps. The vision is a self-service platform where any NST student can:

Push code to a GitHub repository
The platform detects it, builds a container image, and deploys it
The app gets a public URL automatically
Logs, metrics, and resource usage are visible in a dashboard

Think Vercel or Railway, but running on your own hardware. Students would learn not just how to use such a platform, but how to build one — because they built it.

GitOps-driven deployments

Fleet is already installed. The next step is connecting student repositories so that pushing to main triggers an automatic deployment. No more SSH-ing in and running kubectl. The Git repository becomes the single source of truth.

This teaches a workflow that is standard in industry: infrastructure as code, declarative deployments, and automated reconciliation.

Monitoring and observability

The cluster currently has no centralized monitoring. Adding Prometheus and Grafana would provide:

Real-time dashboards showing cluster health, resource usage, and pod status
Alerting when nodes go down or resources run low
Historical data for capacity planning

Students interested in SRE (Site Reliability Engineering) could build and maintain this stack.

CI/CD pipelines

The container registry is in place. The next step is building CI/CD pipelines that:

Run tests on every pull request
Build container images automatically
Push to the cluster registry
Deploy to a staging environment
Promote to production on merge

This could use GitHub Actions, Tekton (Kubernetes-native CI), or Drone CI (self-hosted).

LLM and AI workloads

No GPUs yet, but the cluster could support:

CPU-based inference for smaller models (quantized LLMs, embeddings)
STT/TTS pipelines — speech-to-text and text-to-speech using Whisper and similar models
Batch processing for AI experiments and training data preparation

Adding even one GPU node would open up:

Stable Diffusion (already experimented with ComfyUI)
Fine-tuning small language models
Real-time inference APIs for student projects

Distributed computing lab

The cluster is a natural fit for teaching distributed systems concepts:

Map-reduce style workloads across nodes
Distributed databases (CockroachDB, Cassandra)
Message queues (Kafka, RabbitMQ)
Service mesh (Istio, Linkerd)
Chaos engineering — intentionally breaking things to test resilience

Edge computing and IoT

With Raspberry Pi nodes or similar edge devices, the cluster could extend to:

IoT data collection and processing
Edge ML inference
Sensor networks with central aggregation
Campus-wide distributed systems

Internal developer tools

Gitea or Forgejo — self-hosted Git (replace dependency on GitHub)
Minio — S3-compatible object storage
Vault — secrets management
Harbor — enterprise container registry with vulnerability scanning
Argo Workflows — Kubernetes-native workflow engine

Student-run infrastructure

The long-term aspiration is that students own and operate this cluster. Not just deploy apps on it, but:

Perform maintenance and upgrades
Add new nodes and services
Write documentation and runbooks
Respond to incidents
Mentor incoming cohorts

This transforms the cluster from a teaching tool into a living project that teaches infrastructure, operations, teamwork, and institutional knowledge transfer.

Hardware expansion

Current capacity: 5 nodes, 48 GB RAM, no GPUs.

Possible growth paths:

Add more worker nodes (8-16 GB each)
Add a GPU node for AI/ML workloads
Add NAS/SAN for shared storage
Add a dedicated monitoring node
Build a high-availability control plane (3 server nodes)

What you can do today

You do not need to wait for any of this to be built. Pick something from this list and start building it. The cluster is yours.

Aspirations ​

Multi-tenant hosting platform ​

GitOps-driven deployments ​

Monitoring and observability ​

CI/CD pipelines ​

LLM and AI workloads ​

Distributed computing lab ​

Edge computing and IoT ​

Internal developer tools ​

Student-run infrastructure ​

Hardware expansion ​

What you can do today ​