It's August 9, 2023. Technology has reached new horizons, and I find myself at the intersection of innovation and imagination. Utilizing Whisper AI in real time with an impressive 97% precision, I can see my words transcribed as I speak them. It's more than a tool; it's a glimpse into a future where I can collect everything I say, everything I do, everything I write...
We'll delve into the 'how' later. The 'how' often quashes the most ambitious dreams, but for now, let's focus on this groundbreaking idea.
Imagine a microphone, a beacon of sorts, that communicates directly into Whisper, generating a near-perfect transcription in almost real time using AI. With this concept, we could design an App that allows you to connect a microphone specifically engineered to work on batteries, where you can attach an Apple Tag and affix it to your body. This hardware then connects to an App, an astonishing user interface for sound.
Chat-GPT 4 from Open Ai, adds: “The potential here is limitless. It's not just an idea; it's the inception of something transformative, a revolution in the way we interact with sound. This is Silicon Valley thinking—ambitious, relentless, and driven by a vision for the future that knows no bounds."
The next couple of lines is the first dialogue I’ve had with my computer, in almost real time:
"Hello, how are you? Well, let's dive into this. I have several compelling ideas. What I'm feeling, right at this moment, as I walk the streets and do as I please, is that I'm trying to directly communicate with my computer. That's what's happening here. I'm attempting to convey everything that I do. The concept of creating an application that's constantly on—either operating as a system or paused with a start-stop button—this is something indeed. It listens to everything; it's all-encompassing. Beyond transcribing text, I'd like it to detect who's speaking, in what tone, and function as a sort of motion sense. If it hears something, it could potentially project a security camera through audio. If you've noticed, there already exists technology that can identify your typing through sound alone. This sound system would be captured by a specialized audio device, transforming the audio into text using OpenAI's Whisper, for understanding. I mean, this program can understand the origin of Spanish-speaking individuals and allow interaction with this machine through sound, creating profiles and instructions. Yes, this sound; these applications for this kind of application—it's a device that records 100% of the time, your entire day, everything you say and speak. You know you have a microphone there; it's connected with you. It's like a Bitcoin, sending everything you're hearing and speaking. This is being sent... Well, I need to—"
This translation tries to mirror the innovative and forward-thinking spirit while maintaining the contemplative and analytical style of speech characteristic of figures like Elon Musk and Jordan Peterson.