Recent ML Projects

Rebooting, again

November 27, 2022 259 words 2 minutes

Contents

2 months ago, I was halfway into publishing a series on my TTS project. Today, the site for that project is defunct, and no new posts have been made about it.

What happened?

Well¹, I’ve been working on a lot of different projects. Some of them already exist on my Github, but only as code, without elaboration or presentation. So I’m going to dedicate the next few weeks of my time to talking about them on this space.

But before I get to writing those longer posts, here’s a quick summary of what I’ve been working on:

Project	Status	Description	Publishing
Stable Diffusion related	✅	Embeddings, Dreambooth, Hypernetworks, etc	❌
Custom object detection model	Dataset done	Applying object detection models like YOLOv* to custom made datasets	❔
voice2img for Stable Diffusion	Prototype published	Using Automatic Speech Recognition (ASR) models like Whisper to generate images by vocal commands	✅
Copilot@home	❌	Open source alternative plugin for Copilot. Cancelled because a much better effort already exists	✅
GPT-J text editor	✅ in progress	A webapp text editor that uses GPT-J for autocomplete	✅

I’ve also had other ideas that I’ve yet to write any code for, like:

Idea	Description	Likelihood
Better TTS models	Disco Narrator but actually good	0.01
More uses of Whisper	Adding to GPT-J text editor (wasm? API?). Using as a phone app	Maybe
Using YOLO at home	Mostly to notify for presence of visitors	Maybe
Research thing	Related to language models	Probably

Also, my computer died. I have a blog post about it I’ve yet to publish too.