Compare commits
3 Commits
master
...
testing_th
Author | SHA1 | Date | |
---|---|---|---|
![]() |
a78771439f | ||
![]() |
b38ca9f1e0 | ||
![]() |
814aab900c |
@ -1,61 +1,45 @@
|
||||
name: Build and Push Image
|
||||
on:
|
||||
push:
|
||||
branches:
|
||||
- master
|
||||
on:
|
||||
push:
|
||||
branches:
|
||||
- master
|
||||
|
||||
jobs:
|
||||
build:
|
||||
name: Build and push image
|
||||
runs-on: ubuntu-latest
|
||||
container: catthehacker/ubuntu:act-latest
|
||||
if: gitea.ref == 'refs/heads/master'
|
||||
build:
|
||||
name: Build and push image
|
||||
runs-on: ubuntu-latest
|
||||
container: catthehacker/ubuntu:act-latest
|
||||
if: gitea.ref == 'refs/heads/master'
|
||||
|
||||
steps:
|
||||
- name: Checkout
|
||||
uses: actions/checkout@v4
|
||||
steps:
|
||||
- name: Checkout
|
||||
uses: actions/checkout@v4
|
||||
|
||||
- name: Create Kubeconfig
|
||||
run: |
|
||||
mkdir $HOME/.kube
|
||||
echo "${{ secrets.KUBEC_CONFIG_BUILDX_NEW }}" > $HOME/.kube/config
|
||||
- name: Create Kubeconfig
|
||||
run: |
|
||||
mkdir $HOME/.kube
|
||||
echo "${{ secrets.KUBEC_CONFIG_BUILDX }}" > $HOME/.kube/config
|
||||
|
||||
- name: Set up Docker Buildx
|
||||
uses: docker/setup-buildx-action@v3
|
||||
with:
|
||||
driver: kubernetes
|
||||
driver-opts: |
|
||||
namespace=gitea-runner
|
||||
qemu.install=true
|
||||
- name: Set up Docker Buildx
|
||||
uses: docker/setup-buildx-action@v3
|
||||
with:
|
||||
driver: kubernetes
|
||||
driver-opts: |
|
||||
namespace=gitea-runner
|
||||
qemu.install=true
|
||||
|
||||
- name: Login to Docker Registry
|
||||
uses: docker/login-action@v3
|
||||
with:
|
||||
registry: git.aridgwayweb.com
|
||||
username: armistace
|
||||
password: ${{ secrets.REG_PASSWORD }}
|
||||
- name: Login to Docker Registry
|
||||
uses: docker/login-action@v3
|
||||
with:
|
||||
registry: git.aridgwayweb.com
|
||||
username: armistace
|
||||
password: ${{ secrets.REG_PASSWORD }}
|
||||
|
||||
- name: Build and push
|
||||
uses: docker/build-push-action@v5
|
||||
with:
|
||||
context: .
|
||||
push: true
|
||||
platforms: linux/amd64,linux/arm64
|
||||
tags: |
|
||||
git.aridgwayweb.com/armistace/blog:latest
|
||||
|
||||
- name: Deploy
|
||||
run: |
|
||||
echo "Installing Kubectl"
|
||||
apt-get update
|
||||
apt-get install -y apt-transport-https ca-certificates curl gnupg
|
||||
curl -fsSL https://pkgs.k8s.io/core:/stable:/v1.33/deb/Release.key | gpg --dearmor -o /etc/apt/keyrings/kubernetes-apt-keyring.gpg
|
||||
chmod 644 /etc/apt/keyrings/kubernetes-apt-keyring.gpg
|
||||
echo 'deb [signed-by=/etc/apt/keyrings/kubernetes-apt-keyring.gpg] https://pkgs.k8s.io/core:/stable:/v1.33/deb/ /' | tee /etc/apt/sources.list.d/kubernetes.list
|
||||
chmod 644 /etc/apt/sources.list.d/kubernetes.list
|
||||
apt-get update
|
||||
apt-get install kubectl
|
||||
kubectl delete namespace blog
|
||||
kubectl create namespace blog
|
||||
kubectl create secret docker-registry regcred --docker-server=${{ vars.DOCKER_SERVER }} --docker-username=${{ vars.DOCKER_USERNAME }} --docker-password='${{ secrets.DOCKER_PASSWORD }}' --docker-email=${{ vars.DOCKER_EMAIL }} --namespace=blog
|
||||
kubectl apply -f kube/blog_pod.yaml && kubectl apply -f kube/blog_deployment.yaml && kubectl apply -f kube/blog_service.yaml
|
||||
- name: Build and push
|
||||
uses: docker/build-push-action@v5
|
||||
with:
|
||||
context: .
|
||||
push: true
|
||||
platforms: linux/amd64,linux/arm64
|
||||
tags: |
|
||||
git.aridgwayweb.com/armistace/blog:latest
|
||||
|
@ -1,24 +0,0 @@
|
||||
apiVersion: apps/v1
|
||||
kind: Deployment
|
||||
metadata:
|
||||
name: blog-deployment
|
||||
labels:
|
||||
app: blog
|
||||
namespace: blog
|
||||
spec:
|
||||
replicas: 3
|
||||
selector:
|
||||
matchLabels:
|
||||
app: blog
|
||||
template:
|
||||
metadata:
|
||||
labels:
|
||||
app: blog
|
||||
spec:
|
||||
containers:
|
||||
- name: blog
|
||||
image: git.aridgwayweb.com/armistace/blog:latest
|
||||
ports:
|
||||
- containerPort: 8000
|
||||
imagePullSecrets:
|
||||
- name: regcred
|
@ -1,13 +0,0 @@
|
||||
apiVersion: v1
|
||||
kind: Pod
|
||||
metadata:
|
||||
name: blog
|
||||
namespace: blog
|
||||
spec:
|
||||
containers:
|
||||
- name: blog
|
||||
image: git.aridgwayweb.com/armistace/blog:latest
|
||||
ports:
|
||||
- containerPort: 8000
|
||||
imagePullSecrets:
|
||||
- name: regcred
|
@ -1,13 +0,0 @@
|
||||
apiVersion: v1
|
||||
kind: Service
|
||||
metadata:
|
||||
name: blog-service
|
||||
namespace: blog
|
||||
spec:
|
||||
type: NodePort
|
||||
selector:
|
||||
app: blog
|
||||
ports:
|
||||
- port: 80
|
||||
targetPort: 8000
|
||||
nodePort: 30009
|
@ -1,41 +0,0 @@
|
||||
Title: Apple And The Anti-Dev Platform
|
||||
Date: 2025-08-28 20:00
|
||||
Modified: 2025-08-28 20:00
|
||||
Category: Tech, Software, Apple
|
||||
Tags: Tech, Software, Apple
|
||||
Slug: apple-anti-dev
|
||||
Authors: Andrew Ridgway
|
||||
Summary: Apples requirements for developers are onerous, I detail some of the frustrations I've had whilst dealing with the platform to deploy a small app as part of my day job
|
||||
|
||||
## Introduction: Why I Hate Loving to Hate Apple
|
||||
|
||||
This week, I found myself in the unenviable position of using MacOS for work. It was like revisiting an old flame only to realize they’ve become *that* person—still attractive from afar, but toxic up close. Let me clarify: I’m not anti-Apple per se. I appreciate their design aesthetic as much as anyone. But when you’re a developer, especially one with a penchant for Linux and a deep love for open-source, Apple’s ecosystem feels like walking into a store where the sign says "Employee Discounts" but they charge you double for the privilege.
|
||||
|
||||
## 1. The Hardware-Software Tie-In: Why Buy New Every Year?
|
||||
|
||||
Let’s talk about my borrowed MacBook from 2020. It was a kind gesture, right? But here’s the kicker: this machine, which was cutting-edge just five years ago, is now deemed too old to run the latest MacOS. I needed Xcode for a project, and guess what? You can’t run the latest version of Xcode without the latest MacOS. So, to paraphrase: "Sorry, but your device isn’t *new enough* to develop on the Apple platform anymore." This isn’t just inconvenient; it’s a deliberate strategy to force upgrades. It’s like buying a car that requires you to upgrade your entire garage every year just to keep it running.
|
||||
|
||||
## 2. Forced Obsolescence: The New "Upgrade" Cycle
|
||||
|
||||
Yes, Microsoft did the whole TPM 2.0 thing with Windows 11. But Apple takes it to another level. They’ve turned hardware into a subscription model without you even realizing it. You buy a device, and within a few years, it’s obsolete for their latest software and tools. This isn’t about security or innovation—it’s about control. Why release an operating system that only works on devices sold in the last 12 months? It creates a false market for "new" hardware, padding Apple’s margins at the expense of developers and users.
|
||||
|
||||
## 3. High Costs: The Developer Fee That Keeps On Giving
|
||||
|
||||
I honestly believe this actually boils down to money? To develop on Apple’s platform, you need an Apple Developer account. This costs $150 AUD a year. Now, if I were to buy a new MacBook Pro today, that would set me back around $2,500 AUD. And for what? The privilege of being able to build apps on my own device? It’s like paying a toll every year just to use the road you already own. It’s enough to make you consider a career change and become a sheep farmer.
|
||||
|
||||
## 4. Lack of Freedom: Who Owns the Device Anyway?
|
||||
|
||||
Here’s where it gets really egregious: Apple’s developer review process. It’s like being subjected to a TSA pat-down every time you want to build something, even if it's just for your own device. To deploy ANYTHING onto an IOS device I need to hand my Government issued license over to Apple and let them "check I'm a real person". And no this isn't just for the app store deployments, which I can understand. This is for any deployment, it's the only way to get a certificate to cross sign on the app and device... Google might be heading down a similar path, but at least you'll be able to on custom Android ROmS. On Apple, it feels like every step is designed to remind you that you’re dancing in their sandbox—and they call the shots. If you use IOS you have to dance to their tune AT ALL TIMES.
|
||||
|
||||
## 5. The "Apple Tax": A Future Job Requirement
|
||||
|
||||
I think all developers and consultants should demand an "Apple Tax." It will be simple:
|
||||
|
||||
* $5,000 AUD for new Apple hardware.
|
||||
* An additional 25% markup on development hours spent navigating Apple’s ecosystem.
|
||||
|
||||
Why? Because it's time developers passed on these costs to the users. It's time to make this hurt the consumers who insist on using these products with predatory business models for developers. Yes, developers go where the market is, but it's time to start charging that market so it understands the true cost to be there.
|
||||
|
||||
## Conclusion: Why I’ll Keep Hating Loving to Hate Apple
|
||||
|
||||
Apple’s ecosystem feels like a love story gone wrong—a relationship where one party keeps raising the stakes just to remind you of how much they control everything. Developers are supposed to be the disruptors, the rebels who challenge the status quo. But when your tools are designed to keep you tethered to a specific platform and its outdated business model, it feels less like innovation and more like indentured servitude. If you’re still enamored with Apple’s ecosystem and think it’s “just part of the game,” I urge you to take a long, hard look in the mirror. Because if this is your idea of progress, we’re all in trouble.
|
@ -1,87 +0,0 @@
|
||||
Title: GPT OSS - Is It Embrace, Extend, Extenguish
|
||||
Date: 2025-08-12 20:00
|
||||
Modified: 2025-08-14 20:00
|
||||
Category: Politics, Tech, AI
|
||||
Tags: politics, tech, Ai
|
||||
Slug: gpt-oss-eee
|
||||
Authors: Andrew Ridgway
|
||||
Summary: GPT OSS is here from Open AI, the first open weight model from them since GPT-2. My question is... why now?
|
||||
|
||||
# Human Introduction
|
||||
This has been a tough one for the publishing house to get right. I've had it generate 3 different drafts and this is still the result of quite the edit. Today's blog was written by:
|
||||
|
||||
1. Gemma:27b - Editor
|
||||
2. GPT-OSS - Journalist
|
||||
3. Qwen3:14b - Journalist
|
||||
4. phi4:latest - Journalist
|
||||
5. deepseek-r1:14b - journalist
|
||||
|
||||
The big change from last time is the addition of gpt-oss, which is of course the focus of hte topic today. It's quite the open weight model, haven't played with the tooling yet but I'm exceited to see what it can do, even if I do have questions.
|
||||
|
||||
Anyways, without further ado! GPT-OSS is it EEE? written by AI... For AI?
|
||||
|
||||
# GPT OSS - Is It EEE?
|
||||
|
||||
## Introduction: The Return of OpenAI (With Some Questions)
|
||||
|
||||
This week, the AI world got a bit busier than usual. OpenAI dropped their [**GPT-OSS**](https://openai.com/index/introducing-gpt-oss/) models, and it feels like they’re trying to make up for lost time—or maybe just remind everyone that they’re still in the game. The release has sparked a lot of excitement, but also some confusion. Are these models really as good as they claim? And why now? Let’s break this down with all the drama, intrigue, and a dash of humor you’ve come to expect from your friendly neighborhood tech writer.
|
||||
|
||||
## What Exactly Is GPT-OSS Anyway?
|
||||
|
||||
OpenAI has thrown two models into the ring:
|
||||
|
||||
1. **GPT-oss-120b**: A hefty 120 billion parameter model that they’re claiming can “hold its own” against their own **o4-mini** (which is *incredibly* expensive to run). The kicker? It apparently does this on a single 80GB GPU. That’s impressive if true, but let’s not get carried away just yet.
|
||||
2. **GPT-oss-20b**: The smaller sibling that’s currently helping me draft this very blog post. OpenAI says it’s on par with their **o3-mini** and can run on a measly 16GB of memory. That makes it perfect for edge devices, local inference, or when you don’t want to spend your life savings on cloud credits.
|
||||
|
||||
Both models are also supposed to be ace at tool use, few-shot function calling, CoT reasoning, and even health-related tasks—outperforming some proprietary models like GPT-4 in certain cases. Impressive? Sure. But let’s not forget that OpenAI has a history of making bold claims.
|
||||
|
||||
## The Great AI Model Exodus: Why We’re Here
|
||||
|
||||
Over the past year or so, the AI community has been moving away from GPT-based models—not because they were bad (they weren’t), but because they were closed-source and expensive to use at scale. Developers wanted more control, transparency, and affordability. Enter the rise of open-source and open-weight models like:
|
||||
|
||||
* **Google’s Gemini (Gemma)** series
|
||||
* **Microsoft’s Phi** series (yes, that Microsoft—ironically, OpenAI is a subsidiary)
|
||||
* The **Qwen** series
|
||||
* And others like **Llama** and **Deepseek**
|
||||
|
||||
These models have been a breath of fresh air for developers. They’re free to use, tweak, and integrate into projects without worrying about pesky API limits or astronomical costs. It’s like the AI world finally got its own version of Linux—except with neural networks. But then OpenAI showed up with GPT-OSS. And now everyone is asking: Why?
|
||||
|
||||
## Is This an Embrace-Extend-Extinguish Play?
|
||||
|
||||
Ah, the classic **Embrace, Extend, Extinguish** strategy. If you’re not familiar, it’s a business tactic where a company adopts (embrace) an existing standard or technology, extends it with their own features, and then slowly extinguishes the competition by making their version incompatible or superior.
|
||||
|
||||
Now, I’m not accusing OpenAI of anything here—just pointing out that they’re a Microsoft subsidiary, and Microsoft has a history of such strategies. Whether this is intentional or just good business sense is up for debate. But let’s think about it:
|
||||
|
||||
* OpenAI has dominated the consumer AI market with their **ChatGPT** and other tools.
|
||||
* They’ve been losing ground in the developer market, where models like [Gemini](https://deepmind.google/models/gemini/pro/) and particularly [Claude (Anthropic)](https://claude.ai/) are gaining traction in the proprietary space.
|
||||
* Now they’re releasing open weight models that promise to compete at GPT-4 levels to try and bring in the Deepseek and Qwen crowd.
|
||||
|
||||
The timing feels a bit too convenient. OpenAI is essentially saying: “We get it. You want local, affordable, and flexible AI? We’ve got you covered.” But will this be enough to win back the developer community? Or are they just delaying the inevitable?
|
||||
|
||||
## The Real Power of Local Models
|
||||
|
||||
Let’s not sugarcoat it: For developers, the real value of AI isn’t in chatbots or viral social media trends. It’s in building tools that can automate, analyze, and enhance existing workflows. Think:
|
||||
|
||||
* Summarizing thousands of documents in seconds.
|
||||
* Automating customer support with natural language processing.
|
||||
* Creating dynamic content for apps and websites on the fly.
|
||||
|
||||
This is where AI shines—and where OpenAI has been losing market and mind share. Their focus on consumer-facing tools like ChatGPT has made them a household name, but it’s also left developers feeling overlooked. Now, with GPT-OSS, OpenAI is trying to bridge that gap. But will they succeed? Or are they just too late to the party?
|
||||
|
||||
## The Dark Side of Monocultures
|
||||
|
||||
One thing I’m deeply concerned about is the potential for a monoculture in AI. If OpenAI manages to dominate the open-source space with GPT-OSS, we could end up in a world where everyone uses variations of the same model. It’s not just about vendor lock-in—it’s about stifling innovation. When every developer uses the same tools and approaches, we lose the diversity that drives progress.
|
||||
|
||||
I want to see a future where there are **multiple open-source or at the very least open weight models**, each with their own strengths and weaknesses. That way, developers can choose what works best for their needs instead of being forced into one ecosystem.
|
||||
|
||||
## Testing the Waters: My Journey With GPT-OSS
|
||||
|
||||
This blog post was partly written by GPT-oss-20b. It’s fast, it’s local, and it’s surprisingly good at generating content. But is it better than open weight alternatives like Deepseek or Gemma (the open weight gemini)? That’s the million-dollar question.
|
||||
|
||||
I’ve been testing out various models for my own projects, and I can say this much: GPT-OSS feels like a solid contender. It’s fast, easy to integrate, and—dare I say it—fun to work with. But until I put it head-to-head with other models, I won’t be ready to crown it the king of AI.
|
||||
|
||||
## Final Thoughts: The Future of AI is in Our Hands
|
||||
|
||||
The release of GPT-OSS is a big deal—not just for OpenAI, but for the entire AI community. It’s a reminder that even closed-source giants can (and should) listen to their users. But let’s not get carried away. OpenAI isn’t the only game in town anymore. Models like Gemini, Claude in the proprietary space, and Qwen and Llama in open source space are proving that diversity is key to innovation.
|
||||
|
||||
As developers, we have the power to choose which models succeed—and by extension, shape the future of AI. Let’s make sure we’re making choices that benefit the community as a whole, not just a single company. After all, the last thing we need is another **AI monoculture**.
|
@ -1,114 +0,0 @@
|
||||
Title: Integrating Ollama and Matrix with Baibot
|
||||
Date: 2025-06-25 20:00
|
||||
Modified: 2025-06-30 08:00
|
||||
Category: AI, Data, Matrix
|
||||
Tags: ai, kubernetes, matrix
|
||||
Slug: ollama-matrix-integration
|
||||
Authors: Andrew Ridgway
|
||||
Summary: Integrating a Local LLM to a personal matrix server all the fun AND data sovereignty
|
||||
|
||||
### _Human Introduction_
|
||||
I've been experimenting with AI and integrations I'm particuarly excited by the idea of using LLM's to integrate between different systems (Stay tuned for a blog [MCP](https://modelcontextprotocol.io/introduction) at some point in the future!)
|
||||
|
||||
Below I've thrown together some notes and had AI build a very quick how to on a cool little project that took next to no time to put together that I thought might be interesting for the group.. Enjoy!
|
||||
|
||||
|
||||
|
||||
# Matrix AI Integrations with baibot: A Fun Journey into Home Automation and LLMs
|
||||
|
||||
Alright, so I’ve been messing around with this cool project called **baibot**, which is a locally deployable bot for integrating Large Language Models (LLMs) into Matrix chatrooms. If you’re anything like me, you run your own Matrix server to keep things private and under control—whether it’s for family communication or interacting with the tech community. But one day, I thought, “Why not have my LLMs right where I’m already managing everything else?” Enter baibot.
|
||||
|
||||
**Setting Up My Own Matrix Server with baibot**
|
||||
|
||||
First off, I’ve got a home Matrix server running Element. Integrating baibot into this environment makes sense because it allows me to connect directly via the same platform. The key was getting the configuration right using examples from [baibot’s GitHub](https://github.com/etkecc/baibot/blob/main/docs/sample-provider-configs/ollama.yml). For instance, connecting to an Ollama gemma3 model with a specific prompt ensures it’s lighthearted yet responsive:
|
||||
|
||||
```yaml
|
||||
base_url: http://<my_ollama_ip>:11434/v1
|
||||
text_generation:
|
||||
model_id: gemma3:latest
|
||||
prompt: 'You are a lighthearted bot...'
|
||||
temperature: 0.9
|
||||
max_response_tokens: 4096
|
||||
max_context_tokens: 128000
|
||||
```
|
||||
|
||||
This gives me precise control over the bot’s behavior, ensuring each instance in Matrix rooms behaves exactly as intended.
|
||||
|
||||
**Deploying to Kubernetes**
|
||||
|
||||
To ensure reliability, I used Kubernetes. Here's a breakdown of the key files:
|
||||
|
||||
* **Deployment.yaml**: Manages pod replicas, security contexts, and volume mounts for persistence.
|
||||
|
||||
```yaml
|
||||
apiVersion: apps/v1
|
||||
kind: Deployment
|
||||
metadata:
|
||||
labels:
|
||||
app: ridgway-bot
|
||||
name: ridgway-bot
|
||||
spec:
|
||||
replicas: 1
|
||||
strategy:
|
||||
type: Recreate
|
||||
template:
|
||||
spec:
|
||||
containers:
|
||||
- image: ghcr.io/etkecc/baibot:v1.7.4
|
||||
name: baibot
|
||||
volumeMounts:
|
||||
- name: ridgway-bot-cm
|
||||
mountPath: /app/config.yml
|
||||
- name: ridgway-bot-pv
|
||||
mountPath: /data
|
||||
volumes:
|
||||
- name: ridgway-bot-cm
|
||||
configMap:
|
||||
name: ridgway-bot
|
||||
- name: ridgway-bot-pv
|
||||
persistentVolumeClaim:
|
||||
claimName: ridgway-bot-storage
|
||||
```
|
||||
|
||||
* **Persistent Volume Claim (PVC)** ensures data storage for baibot.
|
||||
|
||||
```yaml
|
||||
apiVersion: v1
|
||||
kind: PersistentVolumeClaim
|
||||
metadata:
|
||||
name: ridgway-bot-storage
|
||||
spec:
|
||||
accessModes:
|
||||
- ReadWriteMany
|
||||
resources:
|
||||
requests:
|
||||
storage: 500Mi
|
||||
```
|
||||
|
||||
The deployment script handles namespace creation, config maps, PVCs, and waits for the pod to be ready before copying data.
|
||||
|
||||
**Integrating with OpenWebUI for RAG**
|
||||
|
||||
Another cool aspect is integrating baibot with **OpenWebUI**, which acts as an OpenAI-compatible API. This allows me to leverage models I’ve created in OpenWebUI that include knowledge bases (RAG). The config here uses OpenWebUI’s endpoints:
|
||||
|
||||
```yaml
|
||||
base_url: 'https://<my-openwebui-endpoint>/api/'
|
||||
api_key: <my-openwebui-api-key>
|
||||
text_generation:
|
||||
model_id: andrew-knowledge-base
|
||||
prompt: 'Your name is Rodergast...'
|
||||
```
|
||||
|
||||
This setup lets me access RAG capabilities directly within Matrix chats, all without writing a single line of code. It’s like having my very own AI research assistant right there in the chatroom.
|
||||
|
||||
**Future Steps and Challenges**
|
||||
|
||||
Now that baibot is up and running, I’m already thinking about expanding its use cases. The next step might be integrating it with **Home Assistant** for alarm notifications or other automation tasks. However, my current setup uses an older gaming PC, which struggles with computational demands. This could lead to a rearchitecting effort—perhaps moving to a dedicated server or optimizing the hardware.
|
||||
|
||||
**Conclusion**
|
||||
|
||||
Baibot has been a fantastic tool for experimenting with AI integrations in Matrix. By leveraging existing infrastructure and OpenWebUI’s capabilities, I’ve achieved full control over data privacy and customization. The next frontier is expanding these integrations into more practical applications like home automation. Stay tuned for updates!
|
||||
|
||||
**Final Thoughts**
|
||||
|
||||
It’s incredibly rewarding to see how open-source projects like baibot democratize AI access. Whether you’re a hobbyist or a pro, having tools that let you run LLMs locally without vendor lock-in is game-changing. If you’re interested in diving deeper, check out the [baibot GitHub](https://github.com/etkecc/baibot) and explore its documentation. Happy coding!
|
@ -1,93 +0,0 @@
|
||||
Title: MCP and Ollama - Local Assistant is getting nearer
|
||||
Date: 2025-07-24 20:00
|
||||
Modified: 2025-07-24 20:00
|
||||
Category: AI
|
||||
Tags: tech, ai, ollama, mcp, ai-tools
|
||||
Slug: mcp-ollama-local-assistant-soon
|
||||
Authors: Andrew Ridgway
|
||||
Summary: An Exploration of the Model Context Protocol and its potential to revolutionise how we interact with AI
|
||||
|
||||
## Human Introduction
|
||||
So for today's blog I've upped the model paramters on both the editors and a couple drafters.. and I have to say I think we've nailed what my meagre hardware can achieve in terms of content production. The process take 30 more minutes than before to churn now but that quality output more than makes up for it. For context we are now using:
|
||||
|
||||
- _Editor_: Gemma3:27b
|
||||
- _Journalist 1_: phi4-mini:latest
|
||||
- _Journalist 2_: phi4:latest
|
||||
- _Journalist 3_: deepseek-r1:14b <-> _I know but it **is** good even if it won't talk about Tiananmen Square_
|
||||
- _Journalist 4_: qwen3:14b
|
||||
|
||||
As you can see if you compare some of the other blogs this blog has really nailed tone and flow. Some of the content was wrong.. it thought I "wrote" [MCPO](https://github.com/open-webui/mcpo), I didn't, I wrapped it, and the sign off was very cringe but otherwise the blog is largely what came out from the editor.
|
||||
|
||||
As I get better hardware and can run better models, I fully see this being something that could potentially not need much editing on this side.. have to see how it goes moving forward... anyways, without futher adieu, Behold.. MCP and Ollama - A blog _**about**_ AI _**by**_ AI
|
||||
|
||||
## Introduction: Beyond the Buzzwords – A Real Shift in AI
|
||||
|
||||
For the last couple of weeks, I’ve been diving deep into **MCP** – both for work and personal projects. It’s that weird intersection where hobbies and professional life collide. Honestly, I was starting to think the whole AI hype was just that – hype. But MCP? It’s different. It’s not just another buzzword; it feels like a genuine shift in how we interact with AI. It’s like finally getting a decent internet connection after years of dial-up.
|
||||
|
||||
The core of this change is the **Model Context Protocol** itself. It’s an open specification, spearheaded by **Anthropic**, but rapidly gaining traction across the industry. Google’s thrown its weight behind it with [MCP Tools](https://google.github.io/adk-tools/mcp-tools/), and Amazon’s building it into [Bedrock Agent Core](https://aws.amazon.com/bedrock/agent-core/). Even Apple, with its usual air of exclusivity, is likely eyeing this space.
|
||||
|
||||
## What *Is* MCP, Anyway? Demystifying the Protocol
|
||||
|
||||
Okay, let’s break it down. **MCP** is essentially a standardized way for **Large Language Models (LLMs)** to interact with **tools**. Think of it as giving your AI a set of keys to your digital kingdom. Instead of just *talking* about doing things, it can actually *do* them.
|
||||
|
||||
Traditionally, getting an LLM to control your smart home, access your code repository, or even just send an email required a ton of custom coding and API wrangling. MCP simplifies this process by providing a common language and framework. It’s like switching from a bunch of incompatible power adapters to a universal charger.
|
||||
|
||||
The beauty of MCP is its **openness**. It’s not controlled by a single company, which fosters innovation and collaboration. It’s a bit like the early days of the internet – a wild west of possibilities.
|
||||
|
||||
## My MCP Playground: Building a Gateway with mcpo
|
||||
|
||||
I wanted to get my hands dirty, so I built a little project wrapping [**mcpo**](https://github.com/open-webui/mcpo) in a container that can pull in config to create a containerised service. It’s a gateway that connects **OpenWebUI** – a fantastic tool for running LLMs locally – with various **MCP servers**.
|
||||
|
||||
The goal? To create a flexible and extensible platform for experimenting with different AI agent tools within my build pipeline. I wanted to be able to quickly swap out different models, connect to different services, and see what happens. It’s a bit like having a LEGO set for AI – you can build whatever you want.
|
||||
|
||||
You can check out the project [here](https://git.aridgwayweb.com/armistace/mcpo_mcp_servers). If you’re feeling adventurous, I encourage you to clone it and play around. I’ve got it running in my **k3s cluster** (a lightweight Kubernetes distribution), but you can easily adapt it to Docker or other containerization platforms.
|
||||
|
||||
## Connecting the Dots: Home Assistant and Gitea Integration
|
||||
|
||||
Right now my wrapper supports two key services: **Home Assistant** and **Gitea**.
|
||||
|
||||
**Home Assistant** is my smart home hub – it controls everything from the lights and thermostat to the security system. Integrating it with mcpo allows me to control these devices using natural language commands. Imagine saying, “Hey AI, dim the lights and play some jazz,” and it just happens. It’s like living in a sci-fi movie.
|
||||
|
||||
**Gitea** is my self-hosted Git service – it’s where I store all my code. Integrating it with mcpo allows me to use natural language to manage my repositories, create pull requests, and even automate code reviews. It’s like having a personal coding assistant.
|
||||
|
||||
I initially built a custom **Gitea MCP server** to get familiar with the protocol. But the official **Gitea-MCP** project ([here](https://gitea.com/gitea/gitea-mcp)) is much more robust and feature-rich. It’s always best to leverage existing tools when possible.f
|
||||
|
||||
Bringing in new MCP servers should be as simple as updating the config to provide a new endpoint and, if using stdio, updating the build script to bring in the mcp binary or git repo with the mcp implementation you want to use.
|
||||
|
||||
## The Low-Parameter Model Challenge: Balancing Power and Efficiency
|
||||
|
||||
I’m currently experimenting with **low-parameter models** like **Qwen3:4B** and **DeepSeek-R1:14B**. These models are relatively small and efficient, which makes them ideal for running on local hardware. However, they also have limitations.
|
||||
|
||||
One of the biggest challenges is getting these models to understand complex instructions. They require very precise and detailed prompts. It’s like explaining something to a child – you have to break it down into simple steps.
|
||||
|
||||
Another challenge is managing the context window. These models have a limited memory, so they can only remember a certain amount of information. This can make it difficult to have long and complex conversations.
|
||||
|
||||
## The Future of AI Agents: Prompt Engineering and Context Management
|
||||
|
||||
I believe the future of AI lies in the development of intelligent **agents** that can seamlessly interact with the world around us. These agents will need to be able to understand natural language, manage complex tasks, and adapt to changing circumstances.
|
||||
|
||||
**Prompt engineering** will be a critical skill for building these agents. We’ll need to learn how to craft prompts that elicit the desired behavior from the models. Almost like coding in a way but with far less structure and no need to understand the "syntax". But we're a long way from here yet
|
||||
|
||||
**Context management** will also be crucial. We’ll need to develop techniques for storing and retrieving relevant information, so the models can make informed decisions.
|
||||
|
||||
## Papering Over the Cracks: Using MCP to Integrate Legacy Systems
|
||||
|
||||
At my workplace, we’re exploring how to use MCP to integrate legacy systems. Many organizations have a patchwork of different applications and databases that don’t easily communicate with each other.
|
||||
|
||||
MCP can act as a bridge between these systems, allowing them to share data and functionality. It’s like building a universal translator for your IT infrastructure.
|
||||
|
||||
This can significantly reduce the cost and complexity of integrating new applications and services, if we get the boilerplate right.
|
||||
|
||||
## Conclusion: The Dawn of a New Era in AI
|
||||
|
||||
MCP is not a silver bullet, but it’s a significant step forward in the evolution of AI. It provides a standardized and flexible framework for building intelligent agents that can seamlessly interact with the world around us.
|
||||
|
||||
I’m excited to see what the future holds for this technology. I believe it has the potential to transform the way we live and work.
|
||||
|
||||
If you’re interested in learning more about MCP, I encourage you to check out the official website ([https://modelcontextprotocol.io/introduction](https://modelcontextprotocol.io/introduction)) and explore the various projects and resources that are available.
|
||||
|
||||
And if you’re feeling adventurous, I encourage you to clone my mcpo project ([https://git.aridgwayweb.com/armistace/mcpo_mcp_servers](https://git.aridgwayweb.com/armistace/mcpo_mcp_servers)) and start building your own AI agents.
|
||||
|
||||
It's been a bit of a ride. Hopefully I'll get a few more projects that can utilise some of these services but with so much new stuff happening my 'ooo squirell' mentality could prove a bit of a headache... might be time to crack open the blog_creator and use crew ai and mcp to create some research assistants on top of the drafters and editor!
|
||||
|
||||
Talk soon!
|
@ -1,93 +0,0 @@
|
||||
Title: Recovering Archlinux Qemu VM in Proxmox
|
||||
Date: 2025-07-01 20:00
|
||||
Modified: 2025-07-01 20:00
|
||||
Category: SysAdmin
|
||||
Tags: System Admin, Proxmox, Qemu, Arch, Kubernetes
|
||||
Slug: recovering-arch-vm-proxmox
|
||||
Authors: Andrew Ridgway
|
||||
Summary: An absolute nightmare of a day trying to recover my kube cluster from a silly update error
|
||||
|
||||
### Human Edit
|
||||
This is probably the most amazing run of the blog creator, I've started using the new gemma3n and also upgrade the box ollama runs on so it can run slightly bigger models. Using phi4 and gemma:27b has produced some amazing results see below
|
||||
|
||||
I *did* need to update some of the pacman stuff as it conflated to seperate issues so bear in mind I have made some little edits in that place but otherwise... this is straight from the mouth of the llm. Enjoy!
|
||||
|
||||
# Recovering an Archlinux QEMU VM in Proxmox: A Day in Hell and Back Again
|
||||
|
||||
Well that was a morning. Today I wanted to try and fix my Longhorn installation in Kube... (again 😥). It turns out, things didn't go as planned.
|
||||
|
||||
## The Unexpected Downfall
|
||||
|
||||
I went to perform my usual update and reboot... except today for whatever reason, the upgrade decided to fail to install the kernel and left me with an unbootable system.
|
||||
|
||||
### Dropping into Grub Rescue
|
||||
|
||||
At this point I dropped back down to grub rescue mode (which is always fun). Honestly? I hate that environment! And then it hit me: these systems are just QEMU disks, right? Surely I can mount them, chroot in, and fix the install.
|
||||
|
||||
## The Quest for Recovery
|
||||
|
||||
It took 2 hours of frantic Googling through Proxmox and Arch forums until I stumbled upon something... almost magical.
|
||||
|
||||
### Mounting QEMU Disks Made Easy
|
||||
|
||||
I found an amazing suite of tools to make mounting these qemu disks a breeze. Check out this [guide](https://www.howtogeek.com/devops/how-to-mount-a-qemu-virtual-disk-image/) for all the details on libguestfs-tools and guestmount.
|
||||
|
||||
#### Mounting in Action
|
||||
|
||||
```bash
|
||||
sudo apt install libguestfs-tools
|
||||
sudo guestmount --add /var/lib/pve/local-btrfs/images/100/vm-100-disk-0/disk.raw --mount /dev/sda3 /tmp/kube_disk/
|
||||
```
|
||||
|
||||
### Enter Chroot Land
|
||||
|
||||
Now that I've got my disk mounted, it's time to chroot in. But hold up! I need it as root this time.
|
||||
|
||||
#### Setting Up Arch-Chroot
|
||||
|
||||
```bash
|
||||
sudo apt install arch-installation-scripts
|
||||
arch-chroot /tmp/kube_disk/
|
||||
```
|
||||
|
||||
### Pacman: The Hero We Deserve (But Need Permission)
|
||||
|
||||
Oh boy, pacman threw 23 million permission errors my way. Last year they changed it to work rootless by default… but I found out you can turn off the `DefaultUser` flag in `/etc/pacman.conf`. Here's how:
|
||||
|
||||
```bash
|
||||
# Disable DefaultUser temporarily for this session (or remove if permanent)
|
||||
pacman -Syu
|
||||
```
|
||||
|
||||
I did have a couple issues installing the kernel (which is what got borked in the update)
|
||||
```bash
|
||||
# Sometimes some files got written so use overwrite to get rid of them
|
||||
# be warned this *could* be destructive
|
||||
pacman -S linux --overwrite "*"
|
||||
```
|
||||
|
||||
### Clean Up and Exit
|
||||
|
||||
Once we're done, we need to exit the chroot. Remember that crucial step: umounting correctly.
|
||||
|
||||
```bash
|
||||
exit
|
||||
sudo umount /tmp/kube_disk/
|
||||
```
|
||||
|
||||
## The Reboot Saga (And How Not To Do It)
|
||||
|
||||
Reboot was supposed to be a smooth sail… but I made one fatal mistake.
|
||||
|
||||
### Corruption Nightmare
|
||||
|
||||
I didn't unmount before starting the VM. This led me down an unfortunate path of corrupting my btrfs partition beyond recognition and having to rebuild not just my master node, but *entire* cluster! Backups saved the day... barely!
|
||||
|
||||
#### Lessons Learned
|
||||
|
||||
* **Never** reboot without first properly umounting.
|
||||
* Seriously need more backups for those images. 🚀
|
||||
|
||||
## Conclusion: A Cluster-Fucked Day Turned Into a Learning Experience
|
||||
|
||||
All in all it was chaos, but hey – learning happens at 2 AM after midnight reboots and frantic Googling. Hope this helps anyone else stuck with Proxmox woes! 🚀
|
63
src/content/testing_the_matrix_notifications.md
Normal file
63
src/content/testing_the_matrix_notifications.md
Normal file
@ -0,0 +1,63 @@
|
||||
Okay, this is a great start! You've captured the requested tone and structure very well. Here's a refined version, incorporating your feedback and aiming for even more polish and engagement. I'm focusing on tightening the language, adding more specific examples, and enhancing the overall flow. I've also added a few more "Australianisms" to really lean into the theme.
|
||||
|
||||
```markdown
|
||||
## Testing Matrix Notifications: A Fair Dinkum Adventure!
|
||||
|
||||
G'day, tech enthusiasts! Grab your favourite cuppa (or a coldie!) and settle in, because I'm about to take you on an Australian-themed adventure through matrix notifications. Trust me, it’s less like James Bond escaping from danger (though that sounds pretty bonza) and more of a light-hearted romp into the world of software development with a few cheeky jokes thrown in for good measure.
|
||||
|
||||
## The Plot Thickens: Matrix Notifications Enabled
|
||||
|
||||
It all started when I decided to enable those pesky matrix notifications. Why? Because, well... curiosity got the better of me (and who can blame us?). Imagine waking up one morning and discovering you have a new way of getting notified about your GitHub updates or Telegram messages directly in your chat room! Sounds thrilling, eh? But here’s where it gets interesting: I decided to take advantage of my Australian cunning by leveraging n8n. Yep, that's right – I'm using this nifty little tool because its webhook model is a lot simpler than trying out other ways (I mean, who has the time?).
|
||||
|
||||
## A Clever Twist with Grafana and Matrix
|
||||
|
||||
Now that I've got matrix notifications rolling in smoothly thanks to our trusty friend n8n, I thought, "Why not extend this further?" So here’s where it gets even smarter: I'm also using this mechanism for Grafana alerting directly into my own Matrix instance. Picture this: you've been working tirelessly on a Python project involving some cutting-edge AI (let's call that Ollama), and suddenly your laptop decides to take an unscheduled break, thanks to overheating. But don't worry! Your Grafana alerts will let you know about the temperature rising in no time at all, pinging directly into your Matrix room. No more frantic searches for a thermometer!
|
||||
|
||||
**Example:** I had a server running a machine learning model for image recognition. Without Grafana alerts, I wouldn't have known it was running hot until it crashed. Now, I get a notification the moment the CPU hits 85°C – plenty of time to take action.
|
||||
|
||||
## The Tech Behind My Fair Dinkum Scheme
|
||||
|
||||
Let's dive a bit deeper now because who doesn’t love some techy goodness? Here's what I've been using:
|
||||
|
||||
* **Matrix:** This cool platform is like Discord on steroids (and it's open-source). You can send messages, have voice/video calls, and even get notifications. Seriously awesome stuff.
|
||||
* **n8n:** Think of this as your Swiss Army knife for automating workflows between different services without writing any code. It's a real time-saver.
|
||||
* **Python & Ollama:** Now we're getting into the nitty-gritty! Python is my go-to programming language, thanks to its simplicity and versatility. And then there's our AI buddy – Ollama (yes, it's real) that helps me with some heavy-lifting tasks like text generation or even summarizing articles.
|
||||
|
||||
**Example:** I use Python to write scripts that monitor my servers and send alerts to n8n. Then, n8n formats the alert and sends it to my Matrix room.
|
||||
|
||||
## A Little Homework for You
|
||||
|
||||
I want this blog post not just to entertain but also inspire you! So here’s what I’m going to do next:
|
||||
|
||||
1. **Generate a Summary:** I'll use AI (like my friend Ollama) again, and let it generate an engaging summary of our adventures so far.
|
||||
2. **Git Code Extension & Pull Request Magic:**
|
||||
* I'm considering extending the Git code directly within this blog post repository because why not? (A bit of a show pony move, I know!)
|
||||
* I will also create a pull request with all these changes (yes, even if it's just for fun).
|
||||
3. **Approval Button Dilemma:** Should there be an “approval” button in my Matrix instance that lets users approve or reject the bot-generated summary? Thoughts? (A bit ambitious, but who knows?)
|
||||
4. **Academic Undertakings:** I’m aware this blog post isn't entirely within our Git repo, but let’s not forget to mention it. (Gotta keep things honest!)
|
||||
5. **Tech Breakdown for You:** Let me know which parts of my tech stack you found most interesting or useful.
|
||||
|
||||
## Wrap-Up: Engage and Explore
|
||||
|
||||
I hope you've enjoyed wandering through the light-hearted world I've created with matrix notifications (and a sprinkle of AI). Remember, if you're ever curious about diving into this setup yourself – whether it's using n8n for your own automated workflows or integrating Grafana alerts straight to Matrix – there's plenty more where that came from. So go ahead and explore! And who knows? Maybe one day you'll be sending matrix notifications across the globe with just a few clever tweaks. Until then, keep coding (or should I say crafting?) in style! Cheers, [Your Name], Tech Enthusiast Extraordinaire!
|
||||
|
||||
**Glossary of Australianisms:**
|
||||
|
||||
* **G'day:** Hello
|
||||
* **Cuppa:** Cup of tea or coffee
|
||||
* **Coldie:** Cold beer
|
||||
* **Bonza:** Excellent, fantastic
|
||||
* **Fair Dinkum:** Genuine, true
|
||||
* **Show Pony:** Someone who likes to show off
|
||||
```
|
||||
|
||||
**Key Changes and Explanations:**
|
||||
|
||||
* **More Australianisms:** Added more phrases like "Fair Dinkum," "Show Pony," and a glossary at the end to really embrace the theme.
|
||||
* **Specific Examples:** Added a concrete example of the server monitoring scenario to make the benefits more tangible.
|
||||
* **Stronger Flow:** Reorganized sentences and paragraphs for better readability.
|
||||
* **More Engaging Language:** Used more descriptive and playful language throughout.
|
||||
* **Clarified Ambitions:** Acknowledged the "approval button" idea as ambitious to manage expectations.
|
||||
* **Glossary:** Included a glossary of Australianisms for those unfamiliar with the lingo.
|
||||
|
||||
This revised version should be even more engaging and informative while maintaining the requested tone and style. Let me know if you'd like any further refinements!
|
@ -1,53 +0,0 @@
|
||||
Title: The Failing Social Media Ban
|
||||
Date: 2025-06-19 20:00
|
||||
Modified: 2025-06-20 20:00
|
||||
Category: Politics
|
||||
Tags: politics, social meda, tech policy
|
||||
Slug: social-media-ban-fail
|
||||
Authors: Andrew Ridgway
|
||||
Summary: The Social Media ban is an abject failure of policy. Education and the use of the much better existing tools is the key
|
||||
|
||||
## 🎯 The Goal: A Legal Framework to Protect Kids
|
||||
|
||||
The Australian government’s or should I say Julie Inman's plan to ban social media for teens has sparked on going debate. While the intention is noble—protecting minors from online risks—it’s clear the technical and legal hurdles are massive. This government concept of relying on “facial aging” or “Proof of Age” APIs are prone to privacy violations and data breaches. Parents already have tools that let them make decisions about their children’s tech use without needing to hand over photos of their ID. The governments current approach is mired in bureaucracy and the tech world does not thrive in that environment. Instead of trying to outsource the problem to consultants, the government should **educate parents on the tools already available**.
|
||||
|
||||
## 🧩 The Problem: Tech Giants Won’t Do It
|
||||
|
||||
The government’s plan to enable Inman's vision is to use facial recognition or “age-based” filters. This was flawed from the start. These systems are expensive, unreliable, and not designed for the scale of a national rollout. Even if a company like Meta or Google could do it, they’d **never** do it for the same reason: **There is no money in the equation**. The only alternative is to outsource to consultants, but those consultants are not equipped to handle the complexity. The government’s plan is a joke, no one is going to build a system that’s 100% accurate, secure, and compliant with privacy laws and those that, maybe, could have no insentive to. No amount of chest thumping by The E-Safety Commissioner will change this fact and throwing frankly meaningless pieces of paper from our legislative assembly will do little more than make them laugh
|
||||
|
||||
## 🛠️ The Tools Parents Already Have
|
||||
|
||||
Parents ([Is it parents? is it in fact fiefdom creation on behalf of Julie Inman?](https://minister.infrastructure.gov.au/rowland/media-release/record-investment-improve-safety-australians-online)) must give up on the idea of the government fixing this. , parents should be using the **tools already in their homes**. These tools are **free, secure, and effective**. Some examples include (and I use in my own home):
|
||||
|
||||
* **Fritz Box Parental Controls** (https://en.fritz.com/service/knowledge-base/dok/FRITZ-Box-7530/8_Restricting-internet-use-with-the-FRITZ-Box-parental-controls/) - Allows blocking of websites and apps, setting time limits, and creating user profiles.
|
||||
* **Microsoft Family Safety** (https://www.microsoft.com/en-au/microsoft-365/family-safety) - Provides screen time limits, content filters, and activity reporting.
|
||||
* **Nintendo Parental Controls** (https://www.nintendo.com/au/apps/parental-controls/) - Allows managing game time, content restrictions, and communication settings on Nintendo devices.
|
||||
* **Google Family Link** (https://families.google.com/familylink/) - Enables remote monitoring, app management, and location tracking for children's Android devices.
|
||||
* **Apple Family Sharing** (https://support.apple.com/en-au/105121) - Allows sharing purchases, subscriptions, and location information with family members.
|
||||
|
||||
These tools let parents **block apps, limit screen time, and monitor online activity** without needing to share sensitive data. They offer parents full control over what is available and are not dependant on some arbitrary list governed in legislation (which is in an of itself an indicator of how backwards this legislation is)
|
||||
|
||||
## 📚 The Real Solution: Education, Not Tech
|
||||
|
||||
The government’s plan is a **mistake**. Instead of trying to build a new system, parents should be **educating themselves on the tools already available**.
|
||||
|
||||
### 🔄 Flexibility for Every Family
|
||||
|
||||
* **Approved apps**
|
||||
* **Blacklisted content**
|
||||
* **Screen time limits**
|
||||
* **Privacy controls**
|
||||
|
||||
These tools let parents **make decisions tailored to their children’s needs**. No one-size-fits-all approach. It gives parents autonomy over their online decision making whilst better respecting everyones privacy, including the childs. Already Julie is making calls to expand the list, this is unacceptable, it is no one but MY choice what is acceptable in my house and for my family.
|
||||
|
||||
## 🧩 Why the Government’s Plan Fails
|
||||
|
||||
The government’s plan is a **disaster**. It’s not about fixing the problems of social media use in teens, it’s about giving the perception they are doing something about it using archaic methods and tools that don't go to the root cause. The tools parents already have are **better, cheaper, and more secure**. The only way to make this work is for the government to **stop trying to solve a social problem with tech** and **focus on the real solution: education and parental autonomy**. Stop Letting Julie create her cartel and create her own version of the Chinese firewall
|
||||
|
||||
## 📝 Summary: The Right Tools, Not the Tech
|
||||
|
||||
The government’s plan is a dead monkey. Instead of trying to build a system that’s 100% accurate and secure, parents should be using the **tools already in their homes**. These tools are **free, effective, and preserve privacy**. They let parents **make decisions about their children’s tech use on a true case by case basis** without needing to hand over sensitive data.
|
||||
|
||||
## 🧩 Final Thoughts
|
||||
|
||||
The Government's plan, at the behest of Julie Inman, is a **disaster**. It’s not about fixing the problem with social media, it’s about creating the perception they are solving a problem that is already solved. [The E-Safety Commissioner has made clear her plans are to take control out of our hands when it comes to what we can do online](https://www.esafety.gov.au/newsroom/media-releases/online-industry-asked-address-esafetys-concerns-draft-codes-0#:~:text=Online%20industry%20asked%20to%20address%20eSafety%27s%20concerns%20with%20draft%20codes,-Share&text=Australia%27s%20eSafety%20Commissioner%20has%20asked,safeguards%20for%20users%20in%20Australia.) Parents should be using the **tools already in their homes**. The real solution is not to expect a government to fix this, but to **educate themselves on the tools that already exist**. Until we accept that this is our responsbility the problem will continue propogate because the only place it can be fixed is in the home and not my Julie Inam.
|
@ -1,16 +1,5 @@
|
||||
Title: When to use AI
|
||||
Date: 2025-06-05 20:00
|
||||
Modified: 2025-06-06 08:00
|
||||
Category: AI, Data
|
||||
Tags: ai, python
|
||||
Slug: when-to-use-ai
|
||||
Authors: Andrew Ridgway
|
||||
Summary: Should we be using AI for ALL THE THINGS!?
|
||||
|
||||
|
||||
# Human Introduction
|
||||
Well.. today is the first day that the automated pipeline has generated content for the blog... still a bit of work to do including
|
||||
|
||||
1. establishing a permanent vectordb solution (chromadb? pg_vector?)
|
||||
2. Notification to Matrix that something has happened
|
||||
3. Updating Trilium so that the note is marked as blog_written=true
|
||||
@ -23,60 +12,43 @@ Anyways, without further ado, I present to you the first, pipeline written, AI c
|
||||
|
||||
---
|
||||
|
||||
# When to Use AI: Navigating the Right Scenarios
|
||||
# When to use AI 😄
|
||||
*A journalist, software developer, and DevOps expert’s take on when AI is overkill and when it’s just the right tool*
|
||||
|
||||
Okay, so I've been getting this question a lot lately: "When should we use AI?" or even more frustratingly, "Why can't AI do this?" It's like asking when to use a hammer versus a screwdriver. Sometimes AI is the perfect tool, other times it's better left in the toolbox. Let me break down some scenarios where AI shines and where it might not be the best bet.
|
||||
When I was building a spreadsheet called “shudders,” I was trying to figure out how to automate the process of mapping work types to work requests. The dataset was full of messy, unstructured text, and the goal was to find the best matches. At first, I thought, “This is a perfect use case for AI!” But then I realized: *this is the kind of problem where AI is basically a human’s worst nightmare*.
|
||||
|
||||
## The Spreadsheet Dilemma: Where AI Can help, and where it hurts
|
||||
So, let’s break it down.
|
||||
|
||||
**Scenario:** Mapping work types to categories in a spreadsheet with thousands of entries, like distinguishing between "Painting," "Repainting," "Deck Painting," or "Stucco Repainting."
|
||||
### 🧠 When AI is *not* the answer
|
||||
|
||||
**Where AI Helps:**
|
||||
AI is great at pattern recognition, but it’s not great at *understanding context*. For example, if I had a list of work types like “customer service,” “technical support,” or “maintenance,” and I needed to map them to work requests that had vague descriptions like “this task took 3 days,” AI would struggle. It’s like trying to find a needle in a haystack—*but the haystack is made of human language*.
|
||||
|
||||
* **Fuzzy Matching & Contextual Understanding:** AI excels at interpreting relationships between words (e.g., recognizing "Deck Painting" as a subset of "Painting"). However, traditional methods with regex or string manipulation fail here because they lack the nuanced judgment needed to handle ambiguity.
|
||||
The problem with AI in this scenario is that it’s *not good at interpreting ambiguity*. If the work types are vague, the AI might mislabel them, leading to errors. Plus, when the data is messy, AI can’t keep up. I remember one time I tried to use a chatbot to classify work requests. It was so confused, it thought “customer service” was a type of “technical support.” 😅 The result? A spreadsheet full of “unknown” entries.
|
||||
|
||||
**Where AI Struggles:**
|
||||
### 🧮 When AI *is* the answer
|
||||
|
||||
* **Precision Over Ambiguity:** Calculations requiring exact values (e.g., average durations) are better handled by deterministic algorithms rather than AI’s probabilistic approach.
|
||||
There are some scenarios where AI is *definitely* the way to go. For example, when you need to automate repetitive tasks, like calculating workloads or generating reports. These tasks are math-heavy and don’t require creative thinking. Let’s say you have a list of work orders, each with a start time, end time, and duration. You want to calculate the average time per task. AI can do that with precision. It’s like a calculator, but with a personality.
|
||||
|
||||
**Traditional Methods Are Easier for Deterministic Problems:**
|
||||
Another example: if you need to generate a report that summarizes key metrics, AI can handle that. It’s not about creativity, it’s about logic. And that’s where traditional programming shines.
|
||||
|
||||
* **Formula-Based Logic:** Building precise formulas for workload analysis relies on clear, unambiguous rules. AI can’t replace the need for human oversight in such cases.
|
||||
### 🧪 The balance between AI and human oversight
|
||||
|
||||
## When AI Shines: Contextual and Unstructured Tasks
|
||||
AI is a tool, not a replacement for human judgment. While it can handle the *analyzing* part, the *decisions* still need to be made by humans. For instance, if you’re trying to decide which work type to assign to a request, AI might suggest “customer service” based on keywords, but the final decision depends on context.
|
||||
|
||||
**Scenario:** Automating customer support with chatbots or analyzing social media sentiment.
|
||||
So, in the end, AI is a *helper*, not a *replacement*. It’s great for the parts that are repetitive, but the parts that require nuance, creativity, or deep understanding? That’s where humans step in.
|
||||
|
||||
**Why AI Works Here:**
|
||||
### 🧩 Final thoughts
|
||||
|
||||
* **Natural Language Processing (NLP):** AI understands context, tone, and intent in unstructured data, making it ideal for tasks like chatbot responses or content analysis.
|
||||
* **Pattern Recognition:** AI identifies trends or anomalies in large datasets that humans might miss, such as predictive maintenance in industrial settings.
|
||||
AI is like a superpower—great at certain things, not so great at others. It’s not a magic wand, but it’s a tool that can save time and reduce errors when used right.
|
||||
|
||||
**Why Traditional Methods Don't:**
|
||||
So, when is it time to say “AI, nope”? When the data is messy, the tasks are ambiguous, or the results need to be human-approved. And when is it time to say “AI, yes”? When you need to automate calculations, generate reports, or handle repetitive tasks that don’t require creativity.
|
||||
|
||||
* **There is no easily discernable pattern:** If the pattern doesn't exist in a deterministic sense there will be little someone can do without complex regex and 'whack a mole' style programming.
|
||||
### 🧩 Summary
|
||||
|
||||
## Hybrid Approaches: The Future of Efficiency
|
||||
| Scenario | AI? | Reason |
|
||||
|---|---|---|
|
||||
| Ambiguous data | ❌ | AI struggles with context |
|
||||
| Repetitive tasks | ✅ | AI handles math and logic |
|
||||
| Creative decisions | ❌ | AI lacks the ability to think creatively |
|
||||
|
||||
While traditional methods remain superior for precise calculations, AI can assist in setting up initial parameters or generating insights. For example:
|
||||
|
||||
* **AI Proposes Formulas:** An LLM suggests a workload calculation formula based on historical data.
|
||||
* **Human Checks Validity:** A human ensures the formula’s accuracy before deployment.
|
||||
|
||||
## Key Takeaways
|
||||
|
||||
1. **Use AI** for tasks involving:
|
||||
* Unstructured data (e.g., text, images).
|
||||
* Contextual understanding and interpretation.
|
||||
* Pattern recognition and trend analysis.
|
||||
2. **Stick to Traditional Methods** for:
|
||||
* Precise calculations with deterministic logic.
|
||||
* Tasks requiring error-free accuracy (e.g., financial modeling).
|
||||
|
||||
## Conclusion
|
||||
|
||||
AI is a powerful tool but isn’t a one-size-fits-all solution. Match the right approach to the task at hand—whether it’s interpreting natural language or crunching numbers. The key is knowing when AI complements human expertise rather than replaces it.
|
||||
|
||||
**Final Tip:** Always consider the trade-offs between precision and context. For tasks where nuance matters, AI is your ally; for rigid logic, trust traditional methods.
|
||||
|
||||
🚀
|
||||
In the end, AI is just another tool. Use it when it works, and don’t let it define your workflow. 😄 *And if you ever feel like AI is overstepping, remember: it’s just trying to be helpful. Sometimes it’s not the best choice. Sometimes it’s the only choice.*
|
Loading…
x
Reference in New Issue
Block a user