8 Commits

Author SHA1 Message Date
michael.borak
de504fbcb4 feat: improve audio exclusion with smart matching and reliability fixes 2026-01-25 16:37:58 +01:00
michael.borak
69dc6b8fac chore: bump version to 1.2.1 for release 2026-01-24 14:33:35 +01:00
michael.borak
291f4950e8 docs: add 3CX to supported applications list 2026-01-24 14:29:00 +01:00
michael.borak
9a65f42f51 feat: refine meeting auto-start, silence timeout (25s) and improve transcription logging 2026-01-24 14:16:55 +01:00
michael.borak
a3e4fa4ec7 Release 1.2.0: Remove backup encryption and switch to JSON-only backups with history support 2026-01-24 13:10:18 +01:00
michael.borak
897f2ec0c2 fix(recorder): Resolve infinite loop stale closure & reset status text on discard 2026-01-24 01:47:47 +01:00
michael.borak
e24b448c6c docs: Update README for v1.2.0 (Native Audio & Auto-Loop) 2026-01-24 01:40:11 +01:00
michael.borak
4e9a1fd038 feat(v1.2.0): Final Release - Native Audio, Smart VAD, Auto-Loop & Quality Fixes
- Implemented standard 48kHz audio pipeline to fix sample rate mismatch/distortion
- Added Native System Audio (ScreenCaptureKit) support
- Implemented Smart VAD (Voice Activity Detection) with Auto-Start on valid audio
- Added Auto-Loop: Automatically re-arms recording after stop
- Added Empty Guard: Prevents transcribing silent recordings (< 20s empty)
- Increased Pre-Roll buffer to 3.0s to prevent cut-off speech
- Fixed clipping with clamped audio mixing
2026-01-24 01:35:09 +01:00
17 changed files with 3469 additions and 638 deletions

188
README.md
View File

@@ -1,165 +1,94 @@
# Hearbit AI 🦉🎙️ # Hearbit AI 🦉🎙️
**Hearbit AI** is your professional meeting assistant for macOS. It records both your microphone and system audio (e.g., Teams, Zoom), transcribes it with high precision using Infomaniak's Whisper API, and generates intelligent, structured summaries. **Hearbit AI** is your professional meeting assistant for macOS. It records both your microphone and system audio (e.g., Teams, Zoom, 3CX, Talk), transcribes it with high precision using Infomaniak's Whisper API, and generates intelligent, structured summaries.
![App Icon](src-tauri/icons/128x128@2x.png) ![App Icon](src-tauri/icons/128x128@2x.png)
## ✨ Features ## ✨ New in v1.2.x
* **🎙️ Dual-Channel Recording**: seamlessly capture your voice and meeting audio from apps like Microsoft Teams, Zoom, or Google Meet. ### v1.2.3: App Blocking & Selective Recording 🚫🎧
* **📁 Import Audio Files**: Upload existing recordings (MP3, MP4, WAV, M4A, FLAC, OGG, AAC, WMA) for transcription and summarization. * **🔇 App Blocking (Selective Recording)**: You can now exclude specific applications from being captured by the system audio recording.
* **⏱️ Long Meeting Support**: Record meetings up to 2+ hours with automatic MP3 conversion and chunking. * **🧠 Smart Matching**: Uses prefix-based matching (e.g., `com.apple.Safari`) to reliably block main apps AND their background/helper processes (e.g., "Safari Web Content").
* **🎵 Smart Auto-Stop**: * **🚀 Quick Select Presets**: Quickly block common background apps like **Spotify, Apple Music, VLC, Safari, or Chrome** with one click.
* **Universal Auto-Stop**: Automatically stops recording after **20 seconds of silence** in ALL modes (Voice Memo & Meeting). * **🔍 Live App List**: Filter any currently running application directly from the new **Apps** tab in Settings.
* **Noise Filtering**: Enhanced VAD (Voice Activity Detection) ignores background noise and keyboard typing, only triggering on clear speech. * **🔄 Robust Auto-Refresh**: The capture filter automatically updates every 5 seconds to account for newly opened browsers or media players.
* **📅 Microsoft 365 Integration**:
* **Upcoming Meetings**: View your daily schedule and join with **one click**.
* **Meeting Details**: View full agenda and **invited attendee status** (Accepted/Declined).
* **💾 Persistent History**: Automatically saves all transcripts and summaries to disk. Search and review past meetings anytime.
* **✉️ Email Summaries**: Send professional, formatted HTML summaries (with preview) directly to attendees via your own SMTP server.
* **🧠 Powered by Infomaniak AI**:
* **Precision Transcription**: Standard-compliant formatting with **second-by-second timestamps**.
* **Smart Summaries**: Uses **Smart Templates** to automatically select the best format (Business Protocol vs. 1:1) based on meeting content.
* **🔇 Smart VAD**: Automatically filters out silence and stops recording when you stop talking.
* **🎨 White-Labeling**: Upload your **custom company logo** in Settings to brand the application.
* **🔒 Privacy-First**: Data is processed securely via your own Infomaniak API keys.
--- ### v1.2.0: Native Audio & Auto-Loop 🦉
* **🎧 Native System Audio**: No more BlackHole driver needed! Captures Teams, Zoom, 3CX, and Talk audio directly via macOS ScreenCaptureKit.
* **🔁 Auto-Loop (Standby Mode)**: Automatically "re-arms" after a call finishes.
* **⚡ Smart VAD & Pre-Roll**: 3-second pre-roll and noise filtering.
* **🛡️ Empty Audio Guard**: Automatically discards silent recordings.
* **💾 Daily Security Backups**: Automatic JSON history backups every 24 hours.
## 🚀 Getting Started ## 🚀 Key Features
### Required * **🎙️ Dual-Channel Recording**: Seamlessly capture your voice and meeting audio.
* **📁 Import Audio Files**: Upload existing recordings (MP3, WAV, M4A, etc.).
* **macOS** (tested on macOS Monterey and later) * **⏱️ Long Meeting Support**: Handles meetings 2+ hours with automatic chunking.
* **BlackHole 2ch Driver** ([Download here](https://existential.audio/blackhole/)) * **📅 Microsoft 365 Integration**: View upcoming meetings and join with one click.
* **MANDATORY** for system audio capture (MS Teams, Zoom, etc.) * **💾 Persistent History**: Automatically saves all transcripts and summaries locally.
* Without this, you can only record microphone input * **✉️ Email Summaries**: Send formatted HTML summaries via your own SMTP server.
* **ffmpeg** for audio processing * **🎨 White-Labeling**: Upload your custom company logo.
```bash
brew install ffmpeg
```
* **Infomaniak AI Account**: You need an API Key and Product ID from the [Infomaniak Developer Portal](https://manager.infomaniak.com/).
### 2. Installation
1. Download the latest `.dmg` file from the [Releases page](#).
2. Open the `.dmg` and drag **Hearbit AI** to your Applications folder.
3. Launch the app.
---
## 🎧 Recording System Audio (Teams, Zoom, etc.)
We've made this easy! **Note: You must have the BlackHole driver installed.**
1. **Create "Hearbit Audio" Device**:
* Open the app and select **Meeting** mode.
* If you don't have the device yet, click the **"🪄 Create Hearbit Audio Device"** button.
* This creates a specialized "Multi-Output Device" that routes audio to both your headphones/speakers AND the app.
2. **Configure Teams / Zoom / Webex**:
* **Speaker / Output**: Change this to **Hearbit Audio**.
* *Why?* This ensures the audio goes to the recording app *and* your ears.
* **Microphone / Input**: Leave this as your normal microphone (e.g., MacBook Pro Mic).
* *Note:* Do **not** select Hearbit Audio as your microphone in Teams.
3. **Start Recording**:
* In Hearbit AI, ensure **Hearbit Audio** is selected as the input.
--- ---
## 🛠️ Usage Guide ## 🛠️ Usage Guide
1. **Configuration**: ### 1. Installation
* Click the **Settings** (gear icon). 1. Download the latest `.dmg` file from the [Releases page](#).
* Enter your **Infomaniak API Key** and **Product ID**. 2. Open the `.dmg` and drag **Hearbit AI** to your Applications folder.
3. **Permission Check**: On first launch, grant "Screen Recording" permission (required for capturing System Audio).
2. **Connect M365 (Optional)**: ### 2. Configuration
* Copy the **Application (client) ID**. 1. Click **Settings** (gear icon).
* Click the **Meetings** tab. 2. Enter your **Infomaniak API Key** and **Product ID**.
* Enter your **Client ID** and click "Connect". 3. (Optional) Configure **SMTP** for email sending and **Microsoft 365** for calendar integration.
* Proceed with MS login.
* View your upcoming meetings.
3. **Recording**: ### 3. Recording a Meeting
* Choose your **Template** (e.g., "Meeting Protocol"). 1. **Select Mode**: Choose "Meeting" (captures Mic + System) or "Voice Memo" (Mic only).
* Select your **Input Device**. 2. **Auto-Start Logic**:
* Click **Start Recording**. - **Meeting Mode**: Triggers only when the call actually starts (system audio detected).
- **Voice Memo**: Triggers immediately when you start speaking.
3. **Standby**: Click "Standby (Auto-Start)". The app waits silently.
4. **Join Call**: Join your Teams/Zoom call.
5. **Trigger**: Hearbit starts recording automatically based on the selected mode.
6. **Finish**: When the call ends (silence > 25s), Hearbit stops, transcribes, summarizes, and **goes back to Standby** for the next call.
4. **Processing**: ### 4. Optimal Setup (MS Teams / Zoom / 3CX)
* Click **Stop** when finished. For the best experience without changing any software settings:
* The app will transcribe the audio (with timestamps!) and generate a summary based on your selected template. * **Hearbit App**: Select your **real microphone** (e.g., "MacBook Mic" or Headset).
* You will be automatically taken to the **Transcription** tab to review the results. * **Meeting Software**: Use your standard output (Speakers/Headset).
* *How it works*: Hearbit captures your voice via mic and the other side via macOS System Audio Capture automatically.
--- *Note: If you choose "Hearbit Audio" (Aggregate Device) in the app, you MUST set your Teams/Zoom/3CX speaker output to "Hearbit Audio" as well.*
## 🎨 Custom Branding (White-Labeling) ### 5. Customizing Prompts
You can create custom AI templates in Settings -> Prompts. Example:
You can replace the default Livtec logo with your own company branding: * **"Sales Call"**: Focus on budget, timeline, and decision makers.
* **"Daily Standup"**: Extract blockers and next steps.
1. Go to **Settings** (gear icon) → **Branding**. * **"General Protocol"**: Standard meeting minutes.
2. Click **Upload Logo**.
3. Select your file (PNG, JPG, SVG).
4. The content changes immediately across the app.
5. *Tip*: Use a transparent PNG for best results.
---
## 📧 Advanced Email Templates
The email system supports **full HTML & JavaScript** templates. This allows for dynamic dashboards, charts, and interactive reports.
**How to use:**
1. Go to **Settings** → **Email**.
2. Create a new template.
3. Use `{{summary}}` as a placeholder for the raw AI JSON output.
4. In your HTML/Script, parse it:
```javascript
const reportData = {{summary}};
// Now you can use reportData.todos, reportData.updates, etc.
```
5. Use `{{date}}` for the current date and `{{subject}}` for the meeting title.
*Example*: Create a "Daily Standup Dashboard" that visualizes Blocker/Updates/Todos in a grid layout.
--- ---
## ❓ Troubleshooting ## ❓ Troubleshooting
### "Hearbit AI is damaged and can't be opened" ### "Hearbit AI is damaged and can't be opened"
This is a standard macOS warning for apps not signed with an Apple Developer Certificate. To fix it: If macOS blocks the app because it's not notarized:
1. Open **Terminal**. 1. Open **Terminal**.
2. Run the following command: 2. Run: `sudo xattr -cr /Applications/Hearbit\ AI.app`
```bash 3. Enter your password and try again.
sudo xattr -cr /Applications/Hearbit\ AI.app
```
3. Enter your password.
4. Open the app again.
### Long Meetings (> 1 hour) ### Audio cuts off at the start?
v1.2.0 includes a **3-second buffer**. The Meeting mode now uses a more sensitive trigger (0.005 energy) to catch even quiet participants.
**Automatic Handling**: The app automatically handles long recordings: ### "Batch processing failed"
- **MP3 Conversion**: All recordings are converted to MP3 (64kbps) for 10x compression This means the audio was empty or too short. Check the **Logs** tab for detailed error messages from Infomaniak. The most common cause is selecting the wrong input device or a lack of Screen Recording permissions.
- **Chunking**: Files ≥18 MB are automatically split into 10-minute segments
- **Processing**: Each segment is transcribed separately and merged with timestamps
**Example**: A 2-hour meeting:
1. Records as WAV (~120 MB)
2. Converts to MP3 (~12 MB)
3. Stays under limit → No chunking needed!
**Very long meetings** (e.g., all-day workshops):
- Automatically chunks into segments
- Shows progress: "Processing chunk 1/15..."
- Merges all transcriptions seamlessly
### No Audio / Can't Hear Meeting Participants
--- ---
## 👨‍💻 Development ## 👨‍💻 Development
Built with **Tauri**, **React**, and **TypeScript**. Built with **Tauri v2**, **React**, and **TypeScript**.
### Setup ### Setup
```bash ```bash
@@ -175,7 +104,6 @@ npm run tauri dev
```bash ```bash
npm run tauri build npm run tauri build
``` ```
*The build artifact will be located in `src-tauri/target/release/bundle/dmg/*`*
--- ---

22
RELEASE_NOTES_1.2.0.md Normal file
View File

@@ -0,0 +1,22 @@
# Release Notes - Hearbit AI v1.2.0
## 🚀 Neuheiten
### Native System Audio (ScreenCaptureKit)
Wir haben die Audio-Engine komplett erneuert!
- **Keine Treiber mehr:** Sie müssen BlackHole nicht mehr installieren.
- **Funktioniert überall:** Egal ob Teams, Zoom, Webex, Nextcloud Talk oder 3CX die App hört jetzt nativ mit.
- **Berechtigung:** Die App fragt beim ersten Start nach der "Bildschirmaufnahme"-Berechtigung. Dies ist der moderne Apple-Standard für Audio-Capture.
### Smart VAD (Intelligente Spracherkennung)
- **Ignoriert Musik:** Die App unterscheidet jetzt präzise zwischen menschlicher Sprache und Musik.
- **Wartebereich-Filter:** Musik im Teams-Wartebereich wird nicht mehr aufgenommen. Die Aufnahme startet erst, wenn wirklich gesprochen wird.
### UI Verbesserungen
- **Neuer Setup-Flow:** Das komplizierte Audio-Setup wurde entfernt.
- **Freie Wahl:** Nutzen Sie jedes Mikrofon, das Sie möchten.
## 🛠️ Technische Änderungen
- Update auf `screencapturekit` Framework (macOS 12.3+ erforderlich).
- BlackHole-Abhängigkeit entfernt.
- Audio-Mixing direkt in der App.

4
package-lock.json generated
View File

@@ -1,12 +1,12 @@
{ {
"name": "hearbit-ai", "name": "hearbit-ai",
"version": "0.1.0", "version": "1.2.2",
"lockfileVersion": 3, "lockfileVersion": 3,
"requires": true, "requires": true,
"packages": { "packages": {
"": { "": {
"name": "hearbit-ai", "name": "hearbit-ai",
"version": "0.1.0", "version": "1.2.2",
"dependencies": { "dependencies": {
"@tailwindcss/postcss": "^4.1.18", "@tailwindcss/postcss": "^4.1.18",
"@tauri-apps/api": "^2", "@tauri-apps/api": "^2",

View File

@@ -1,7 +1,7 @@
{ {
"name": "hearbit-ai", "name": "hearbit-ai",
"private": true, "private": true,
"version": "1.1.1", "version": "1.2.3",
"type": "module", "type": "module",
"scripts": { "scripts": {
"dev": "vite", "dev": "vite",

113
src-tauri/Cargo.lock generated
View File

@@ -347,6 +347,12 @@ dependencies = [
"wyz", "wyz",
] ]
[[package]]
name = "block"
version = "0.1.6"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "0d8c1fef690941d3e7788d328517591fecc684c084084702d6ff1641e993699a"
[[package]] [[package]]
name = "block-buffer" name = "block-buffer"
version = "0.10.4" version = "0.10.4"
@@ -623,6 +629,35 @@ dependencies = [
"stacker", "stacker",
] ]
[[package]]
name = "cocoa"
version = "0.26.1"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "ad36507aeb7e16159dfe68db81ccc27571c3ccd4b76fb2fb72fc59e7a4b1b64c"
dependencies = [
"bitflags 2.10.0",
"block",
"cocoa-foundation",
"core-foundation 0.10.1",
"core-graphics",
"foreign-types 0.5.0",
"libc",
"objc",
]
[[package]]
name = "cocoa-foundation"
version = "0.2.1"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "81411967c50ee9a1fc11365f8c585f863a22a9697c89239c452292c40ba79b0d"
dependencies = [
"bitflags 2.10.0",
"block",
"core-foundation 0.10.1",
"core-graphics-types",
"objc",
]
[[package]] [[package]]
name = "combine" name = "combine"
version = "4.6.7" version = "4.6.7"
@@ -1739,16 +1774,21 @@ checksum = "841d1cc9bed7f9236f321df977030373f4a4163ae1a7dbfe1a51a2c1a51d9100"
[[package]] [[package]]
name = "hearbit-ai" name = "hearbit-ai"
version = "0.1.2" version = "1.2.1"
dependencies = [ dependencies = [
"base64 0.22.1", "base64 0.22.1",
"chrono", "chrono",
"cocoa",
"core-foundation 0.10.1",
"cpal", "cpal",
"hound", "hound",
"lettre", "lettre",
"oauth2", "oauth2",
"objc",
"reqwest 0.11.27", "reqwest 0.11.27",
"rubato", "rubato",
"screencapturekit",
"screencapturekit-sys",
"serde", "serde",
"serde_json", "serde_json",
"tauri", "tauri",
@@ -2425,6 +2465,15 @@ dependencies = [
"libc", "libc",
] ]
[[package]]
name = "malloc_buf"
version = "0.0.6"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "62bb907fe88d54d8d9ce32a3cceab4218ed2f6b7d35617cafe9adf84e43919cb"
dependencies = [
"libc",
]
[[package]] [[package]]
name = "markup5ever" name = "markup5ever"
version = "0.14.1" version = "0.14.1"
@@ -2717,6 +2766,27 @@ dependencies = [
"url", "url",
] ]
[[package]]
name = "objc"
version = "0.2.7"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "915b1b472bc21c53464d6c8461c9d3af805ba1ef837e1cac254428f4a77177b1"
dependencies = [
"malloc_buf",
"objc_exception",
]
[[package]]
name = "objc-foundation"
version = "0.1.1"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "1add1b659e36c9607c7aab864a76c7a4c2760cd0cd2e120f3fb8b952c7e22bf9"
dependencies = [
"block",
"objc",
"objc_id",
]
[[package]] [[package]]
name = "objc2" name = "objc2"
version = "0.6.3" version = "0.6.3"
@@ -2979,6 +3049,24 @@ dependencies = [
"objc2-security", "objc2-security",
] ]
[[package]]
name = "objc_exception"
version = "0.1.2"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "ad970fb455818ad6cba4c122ad012fae53ae8b4795f86378bce65e4f6bab2ca4"
dependencies = [
"cc",
]
[[package]]
name = "objc_id"
version = "0.1.1"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "c92d4ddb4bd7b50d730c215ff871754d0da6b2178849f8a2a2ab69712d0c073b"
dependencies = [
"objc",
]
[[package]] [[package]]
name = "object" name = "object"
version = "0.32.2" version = "0.32.2"
@@ -4114,6 +4202,29 @@ version = "1.2.0"
source = "registry+https://github.com/rust-lang/crates.io-index" source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "94143f37725109f92c262ed2cf5e59bce7498c01bcc1502d7b9afe439a4e9f49" checksum = "94143f37725109f92c262ed2cf5e59bce7498c01bcc1502d7b9afe439a4e9f49"
[[package]]
name = "screencapturekit"
version = "0.2.8"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "1a5eeeb57ac94960cfe5ff4c402be6585ae4c8d29a2cf41b276048c2e849d64e"
dependencies = [
"screencapturekit-sys",
]
[[package]]
name = "screencapturekit-sys"
version = "0.2.8"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "22411b57f7d49e7fe08025198813ee6fd65e1ee5eff4ebc7880c12c82bde4c60"
dependencies = [
"block",
"dispatch",
"objc",
"objc-foundation",
"objc_id",
"once_cell",
]
[[package]] [[package]]
name = "sct" name = "sct"
version = "0.7.1" version = "0.7.1"

View File

@@ -1,6 +1,6 @@
[package] [package]
name = "hearbit-ai" name = "hearbit-ai"
version = "0.1.2" version = "1.2.1"
description = "A Tauri App" description = "A Tauri App"
authors = ["you"] authors = ["you"]
edition = "2021" edition = "2021"
@@ -38,3 +38,8 @@ lettre = { version = "0.11", features = ["tokio1", "tokio1-native-tls", "builder
tauri-plugin-log = "2.0.0" tauri-plugin-log = "2.0.0"
tauri-plugin-shell = "2.3.4" tauri-plugin-shell = "2.3.4"
base64 = "0.22" base64 = "0.22"
screencapturekit = "0.2.0"
screencapturekit-sys = "0.2.8"
cocoa = "0.26.1"
objc = "0.2.7"
core-foundation = "0.10.1"

65
src-tauri/src/apps.rs Normal file
View File

@@ -0,0 +1,65 @@
use cocoa::base::{id, nil};
use cocoa::foundation::{NSArray, NSAutoreleasePool, NSString};
use objc::{msg_send, sel, sel_impl};
use serde::Serialize;
#[derive(Serialize, Debug)]
pub struct RunningApp {
pub name: String,
pub bundle_id: String,
}
pub fn get_running_applications() -> Vec<RunningApp> {
let mut apps = Vec::new();
unsafe {
let pool = NSAutoreleasePool::new(nil);
// [NSWorkspace sharedWorkspace]
let workspace_class = objc::runtime::Class::get("NSWorkspace").unwrap();
let shared_workspace: id = msg_send![workspace_class, sharedWorkspace];
// [sharedWorkspace runningApplications]
let running_apps: id = msg_send![shared_workspace, runningApplications];
let count: usize = msg_send![running_apps, count];
for i in 0..count {
let app: id = msg_send![running_apps, objectAtIndex: i];
// Check if it's a regular GUI application
// activationPolicy: 0 = regular, 1 = accessory, 2 = prohibited
let policy: isize = msg_send![app, activationPolicy];
if policy != 0 {
continue;
}
let bundle_id_ns: id = msg_send![app, bundleIdentifier];
let name_ns: id = msg_send![app, localizedName];
if bundle_id_ns != nil && name_ns != nil {
let bundle_id = nsstring_to_string(bundle_id_ns);
let name = nsstring_to_string(name_ns);
apps.push(RunningApp { name, bundle_id });
}
}
// We don't drain the pool here as it might be handled by Tauri's main loop if we are called frequently,
// but for a one-off command it's safer.
let _: () = msg_send![pool, release];
}
// Sort by name for better UI experience
apps.sort_by(|a, b| a.name.to_lowercase().cmp(&b.name.to_lowercase()));
apps
}
unsafe fn nsstring_to_string(ns_string: id) -> String {
let char_ptr: *const std::os::raw::c_char = msg_send![ns_string, UTF8String];
if char_ptr.is_null() {
return String::new();
}
std::ffi::CStr::from_ptr(char_ptr)
.to_string_lossy()
.into_owned()
}

View File

@@ -1,5 +1,6 @@
use std::sync::{Arc, Mutex}; use std::sync::{Arc, Mutex};
use tauri::{AppHandle, Emitter}; use tauri::{AppHandle, Emitter};
use crate::emit_log;
use cpal::Sample; use cpal::Sample;
use hound::WavWriter; use hound::WavWriter;
use rubato::{Resampler, FastFixedIn, PolynomialDegree}; use rubato::{Resampler, FastFixedIn, PolynomialDegree};
@@ -39,6 +40,12 @@ pub struct AudioProcessor {
// Event Emission // Event Emission
app_handle: Option<AppHandle>, app_handle: Option<AppHandle>,
last_event_time: std::time::Instant, last_event_time: std::time::Instant,
// System Audio Queue for Mixing
pub system_queue: Arc<Mutex<std::collections::VecDeque<f32>>>,
// Recording Mode (voice or meeting)
recording_mode: String,
} }
impl AudioProcessor { impl AudioProcessor {
@@ -47,7 +54,8 @@ impl AudioProcessor {
channel_count: u16, channel_count: u16,
writer: Arc<Mutex<WavWriter<std::io::BufWriter<std::fs::File>>>>, writer: Arc<Mutex<WavWriter<std::io::BufWriter<std::fs::File>>>>,
app_handle: AppHandle, app_handle: AppHandle,
wait_for_speech: bool wait_for_speech: bool,
recording_mode: String,
) -> Result<Self, String> { ) -> Result<Self, String> {
let vad_sample_rate = 16000; let vad_sample_rate = 16000;
let vad_chunk_size = 512; let vad_chunk_size = 512;
@@ -68,8 +76,8 @@ impl AudioProcessor {
1 1
).map_err(|e| format!("Failed to init Resampler: {:?}", e))?; ).map_err(|e| format!("Failed to init Resampler: {:?}", e))?;
// Pre-roll buffer (1.0 seconds) * Channels (interleaved store) // Pre-roll buffer (3.0 seconds) * Channels (interleaved store)
let ring_curr_seconds = 1.0; let ring_curr_seconds = 3.0;
// WavWriter writes interleaved, so we store interleaved. // WavWriter writes interleaved, so we store interleaved.
let ring_size = (sample_rate as f32 * ring_curr_seconds) as usize * channel_count as usize; let ring_size = (sample_rate as f32 * ring_curr_seconds) as usize * channel_count as usize;
@@ -96,10 +104,56 @@ impl AudioProcessor {
total_processed_samples: 0, total_processed_samples: 0,
app_handle: Some(app_handle), app_handle: Some(app_handle),
last_event_time: std::time::Instant::now(), last_event_time: std::time::Instant::now(),
system_queue: Arc::new(Mutex::new(std::collections::VecDeque::new())),
recording_mode,
}) })
} }
pub fn process(&mut self, data: &[f32]) { pub fn process(&mut self, input_data: &[f32]) {
// MIXING LOGIC:
// We have `input_data` (Microphone). We check `system_queue` for System Audio.
// System Audio is hardcoded to 2 channels (Stereo) in sc_audio.rs.
// Microphone `self.channel_count` can be 1 (Mono) or 2 (Stereo).
let mic_channels = self.channel_count as usize;
let mut mixed_data = input_data.to_vec();
let mut max_system_energy = 0.0;
let gain_mic = 1.0;
let gain_sys = 0.8; // Slightly lower system audio to prioritize speaker
if let Ok(mut queue) = self.system_queue.lock() {
let frames = mixed_data.len() / mic_channels;
for f in 0..frames {
// system_queue is always stereo (L, R, L, R...)
if let (Some(l), Some(r)) = (queue.pop_front(), queue.pop_front()) {
let abs_l = l.abs();
let abs_r = r.abs();
let current_sys_max = if abs_l > abs_r { abs_l } else { abs_r };
if current_sys_max > max_system_energy {
max_system_energy = current_sys_max;
}
if mic_channels == 1 {
// Mic is Mono: Mix System L+R down to Mono
let sys_mono = (l + r) / 2.0;
let mixed = (mixed_data[f] * gain_mic) + (sys_mono * gain_sys);
mixed_data[f] = mixed.max(-1.0).min(1.0);
} else {
// Mic is Stereo: Mix L-to-L and R-to-R
let f_start = f * 2;
let mixed_l = (mixed_data[f_start] * gain_mic) + (l * gain_sys);
let mixed_r = (mixed_data[f_start + 1] * gain_mic) + (r * gain_sys);
mixed_data[f_start] = mixed_l.max(-1.0).min(1.0);
mixed_data[f_start + 1] = mixed_r.max(-1.0).min(1.0);
}
}
}
}
let data = &mixed_data;
// 1. Add to Ring Buffer (Interleaved data - Record EVERYTHING) // 1. Add to Ring Buffer (Interleaved data - Record EVERYTHING)
for &sample in data { for &sample in data {
self.ring_buffer[self.ring_pos] = sample; self.ring_buffer[self.ring_pos] = sample;
@@ -108,8 +162,7 @@ impl AudioProcessor {
// 2. Prepare VAD Signal (Mono Mixdown) // 2. Prepare VAD Signal (Mono Mixdown)
// FRESH START LOGIC (v0.2.0): // FRESH START LOGIC (v0.2.0):
// We expect standard Stereo Input (BlackHole 2ch). // We expect standard Stereo Input.
// No magic 3-channel aggregate.
let channels = self.channel_count as usize; let channels = self.channel_count as usize;
let frame_count = data.len() / channels; let frame_count = data.len() / channels;
@@ -146,7 +199,6 @@ impl AudioProcessor {
self.vad_buffer.extend_from_slice(&waves_out[0][0..out_len]); self.vad_buffer.extend_from_slice(&waves_out[0][0..out_len]);
} }
} }
// Update output buffer usage... logic is tricky with drain.
} }
// 4. Process VAD // 4. Process VAD
@@ -155,21 +207,15 @@ impl AudioProcessor {
// Run Detection // Run Detection
let probability = self.vad.predict(vad_chunk.clone()); let probability = self.vad.predict(vad_chunk.clone());
// Calculate RMS for this chunk to use as fallback/hybrid detection let system_is_active = max_system_energy > 0.005; // Lowered to match trigger
let sq_sum: f32 = vad_chunk.iter().map(|x| x * x).sum(); let is_speech = probability > 0.9;
let rms = (sq_sum / vad_chunk.len() as f32).sqrt();
// Hybrid VAD: Probability > 0.9 OR RMS > 0.025 if is_speech || system_is_active {
// INCREASED THRESHOLDS (v1.1.1):
// Reduced sensitivity to avoid background noise triggering recording.
let is_speech = probability > 0.9 || rms > 0.025;
if is_speech {
self.is_speech_active = true; self.is_speech_active = true;
self.last_speech_time = self.total_processed_samples; self.last_speech_time = self.total_processed_samples;
} }
// Emit VAD event periodically (every 500ms is enough for non-diagnostic mode) // Emit VAD event periodically
if self.last_event_time.elapsed().as_millis() > 500 { if self.last_event_time.elapsed().as_millis() > 500 {
if let Some(app) = &self.app_handle { if let Some(app) = &self.app_handle {
#[derive(Clone, serde::Serialize)] #[derive(Clone, serde::Serialize)]
@@ -183,11 +229,6 @@ impl AudioProcessor {
}); });
} }
self.last_event_time = std::time::Instant::now(); self.last_event_time = std::time::Instant::now();
// IMPORTANT: We reset is_speech_active after emitting,
// so we don't latch it forever if the user stops talking.
// However, the main loop sets it to true if current chunk is speech.
// This logic is a bit of a "latch for X ms".
self.is_speech_active = false; self.is_speech_active = false;
} }
} }
@@ -195,9 +236,32 @@ impl AudioProcessor {
// 4. Update Hangover and Check Write condition // 4. Update Hangover and Check Write condition
if self.waiting_for_speech { if self.waiting_for_speech {
if self.is_speech_active { // TRIGGER CONDITION:
// 1. VAD says speech (Someone is talking)
// 2. AND System Audio has energy (Meaning audio is coming from the PC, i.e., Call started)
// Threshold 0.01 is roughly -40dB, should cover ringtones/speech easily but ignore silence/hiss.
let system_active = max_system_energy > 0.005;
// Periodically log energy to help debug why meeting mode might not start
if self.last_event_time.elapsed().as_millis() > 2000 && self.recording_mode == "meeting" {
if let Some(app) = &self.app_handle {
emit_log(app, "DEBUG", &format!("Waiting for Meeting... Current System Energy: {:.4} (Threshold: 0.005)", max_system_energy));
}
}
// MODE-SPECIFIC TRIGGER LOGIC:
// "voice" -> Trigger if user speaks (is_speech_active)
// "meeting" -> Trigger ONLY if system audio energy detected (Call starting)
let trigger = if self.recording_mode == "voice" {
self.is_speech_active
} else {
system_active
};
if trigger {
// Trigger Detected! // Trigger Detected!
println!("Auto-Start: Speech detected. Flushing pre-roll..."); println!("Auto-Start: Call detected (SysEnergy: {}). Flushing pre-roll...", max_system_energy);
self.waiting_for_speech = false; self.waiting_for_speech = false;
// Flush Ring Buffer (Orderly: from ring_pos to end, then 0 to ring_pos) // Flush Ring Buffer (Orderly: from ring_pos to end, then 0 to ring_pos)
@@ -229,7 +293,13 @@ impl AudioProcessor {
// Standard Recording Logic (Active or Hangover) // Standard Recording Logic (Active or Hangover)
let time_since_speech = self.total_processed_samples.saturating_sub(self.last_speech_time); let time_since_speech = self.total_processed_samples.saturating_sub(self.last_speech_time);
if self.is_speech_active || time_since_speech < self.hangover_samples { // We write to file if:
// 1. VAD thinks someone is speaking (Mic or System)
// 2. OR System audio energy is currently above threshold (Ensures calls are captured)
// 3. OR we are within the hangover period
let system_is_active = max_system_energy > 0.005;
if self.is_speech_active || system_is_active || time_since_speech < self.hangover_samples {
let mut guard = self.writer.lock().unwrap(); let mut guard = self.writer.lock().unwrap();
for &sample in data { for &sample in data {
let amplitude = i16::MAX as f32; let amplitude = i16::MAX as f32;

File diff suppressed because it is too large Load Diff

984
src-tauri/src/lib.rs.bak Normal file
View File

@@ -0,0 +1,984 @@
use tauri::{
AppHandle, Manager, State, Emitter,
menu::{Menu, MenuItem},
tray::{TrayIconBuilder, TrayIconEvent},
WindowEvent
};
use std::sync::{Arc, Mutex};
use std::process::Command;
use cpal::traits::{DeviceTrait, HostTrait, StreamTrait};
use std::time::Duration;
use tokio::time::sleep;
use base64::Engine;
mod audio_processor;
use audio_processor::AudioProcessor;
mod auth;
mod email;
mod sc_audio;
// State to hold the active recording stream
struct AppState {
recording_stream: Mutex<Option<cpal::Stream>>,
recording_file_path: Mutex<Option<String>>,
system_capture: Mutex<Option<sc_audio::SystemAudioCapture>>,
}
#[derive(serde::Serialize)]
struct AudioDevice {
id: String,
name: String,
}
#[derive(serde::Serialize, Clone)]
struct LogEvent {
level: String,
message: String,
timestamp: String,
}
pub(crate) fn emit_log(app: &AppHandle, level: &str, message: &str) {
let log = LogEvent {
level: level.to_string(),
message: message.to_string(),
timestamp: chrono::Local::now().format("%H:%M:%S").to_string(),
};
let _ = app.emit("log-event", log);
}
#[tauri::command]
fn greet(name: &str) -> String {
format!("Hello, {}! You've been greeted from Rust!", name)
}
#[tauri::command]
fn get_input_devices() -> Result<Vec<AudioDevice>, String> {
let host = cpal::default_host();
let devices = host.input_devices().map_err(|e| e.to_string())?;
let mut result = Vec::new();
for device in devices {
#[allow(deprecated)]
if let Ok(name) = device.name() {
// macOS often produces weird names, but let's just use what we get
result.push(AudioDevice {
id: name.clone(), // Using name as ID for simplicity in this MVP
name,
});
}
}
Ok(result)
}
#[tauri::command]
async fn start_recording(app: AppHandle, state: State<'_, AppState>, device_id: String, save_path: Option<String>, custom_filename: Option<String>, wait_for_speech: Option<bool>, mode: String) -> Result<(), String> {
emit_log(&app, "INFO", &format!("Starting recording [Mode: {}] on device: {}", mode, device_id));
let host = cpal::default_host();
// Find device by name (using name as ID)
#[allow(deprecated)]
let device = host.input_devices()
.map_err(|e| e.to_string())?
.find(|d| d.name().map(|n| n == device_id).unwrap_or(false))
.or_else(|| host.default_input_device())
.ok_or("No input device found")?;
// Select the configuration with the MAXIMUM number of channels
// This is crucial for "Hearbit Audio" (Aggregate) which lists 3 channels but might default to 2.
// Select Audio Configuration
// We prioritize 48kHz because System Audio (ScreenCaptureKit) acts best at 48k.
let supported_configs: Vec<_> = device.supported_input_configs().map_err(|e| e.to_string())?.collect();
// Try to find 48kHz specifically
// Note: cpal::SampleRate is likely a type alias for u32 here, so we pass 48000 directly.
let config = supported_configs.iter()
.find(|c| c.min_sample_rate() <= 48000 && c.max_sample_rate() >= 48000)
.map(|c| c.with_sample_rate(48000))
.or_else(|| {
// Fallback: Max sample rate
supported_configs.iter()
.max_by_key(|c| c.channels())
.map(|c| c.with_max_sample_rate())
})
.ok_or("No supported input configurations found")?;
emit_log(&app, "INFO", &format!("Selected Audio Config: {} Channels, {} Hz", config.channels(), config.sample_rate()));
let spec = hound::WavSpec {
channels: config.channels(),
sample_rate: config.sample_rate(),
bits_per_sample: 16,
sample_format: hound::SampleFormat::Int,
};
// Determine file path: User provided or Temp
let filename = if let Some(name) = custom_filename {
// Sanitize filename
let safe_name: String = name.chars().map(|x| if x.is_alphanumeric() || x == ' ' || x == '-' || x == '_' { x } else { '_' }).collect();
format!("{}.wav", safe_name)
} else {
format!("recording_{}.wav", std::time::SystemTime::now().duration_since(std::time::UNIX_EPOCH).unwrap().as_secs())
};
let file_path = if let Some(path) = save_path {
if path.trim().is_empty() {
std::env::temp_dir().join(&filename)
} else {
std::path::PathBuf::from(path).join(&filename)
}
} else {
std::env::temp_dir().join(&filename)
};
let file_path_str = file_path.to_string_lossy().to_string();
emit_log(&app, "INFO", &format!("Saving recording to: {}", file_path_str));
let writer = hound::WavWriter::create(&file_path, spec).map_err(|e| e.to_string())?;
let writer = Arc::new(Mutex::new(writer));
let writer_clone = writer.clone();
// Initialize AudioProcessor (VAD)
// We pass the writer to it.
let should_wait = wait_for_speech.unwrap_or(false);
if should_wait {
emit_log(&app, "INFO", &format!("Recording started in WAITING mode (Trigger: {}).", if mode == "voice" { "Speech" } else { "System Audio" }));
}
let processor = AudioProcessor::new(config.sample_rate(), config.channels(), writer.clone(), app.clone(), should_wait, mode)
.map_err(|e| format!("Failed to create AudioProcessor: {}", e))?;
// Wrap processor in Arc<Mutex> so we can share/move it into callback
// Actually, cpal callback takes ownership of its closure state usually if 'move'.
// Since stream is on another thread, we need Send. AudioProcessor should be Send.
// However, the callback is called repeatedly. We need to keep state.
// The workaround is to wrap it in a Mutex.
let processor = Arc::new(Mutex::new(processor));
let processor_clone = processor.clone();
// --- SYSTEM AUDIO CAPTURE START ---
// Prevent Doubling: If user selected an aggregate device (Hearbit Audio/BlackHole),
// it ALREADY contains system audio. In that case, we don't need internal SCK capture.
let is_aggregate = device_id.contains("Hearbit") || device_id.contains("BlackHole");
if is_aggregate {
emit_log(&app, "INFO", "Aggregate device detected. Disabling internal System Audio Capture to prevent doubling.");
} else {
let mut sys_capture = sc_audio::SystemAudioCapture::new(config.sample_rate());
// Get the queue to share with the capture callback
let queue_clone = {
let p = processor.lock().unwrap();
p.system_queue.clone() // Access the pub field we added
};
let sys_callback = move |data: &[f32]| {
// Push to queue
if let Ok(mut q) = queue_clone.lock() {
q.extend(data.iter());
// Limit queue size to avoid memory leaks if main process loop is slow
while q.len() > 48000 * 5 { // 5 seconds buffer
q.pop_front();
}
}
};
match sys_capture.start(sys_callback).await {
Ok(_) => emit_log(&app, "INFO", "System Audio Capture started."),
Err(e) => emit_log(&app, "WARN", &format!("System Audio Capture failed (Permissions?): {}", e)),
}
*state.system_capture.lock().unwrap() = Some(sys_capture);
}
// --- SYSTEM AUDIO CAPTURE END ---
let app_handle = app.clone();
let err_fn = move |err| {
eprintln!("an error occurred on stream: {}", err);
emit_log(&app_handle, "ERROR", &format!("Stream error: {}", err));
};
let stream = match config.sample_format() {
cpal::SampleFormat::F32 => device.build_input_stream(
&config.into(),
move |data: &[f32], _: &_| {
if let Ok(mut p) = processor_clone.lock() {
p.process(data);
}
},
err_fn,
None
),
// For I16 and U16 we need to convert to F32 for our processor
cpal::SampleFormat::I16 => device.build_input_stream(
&config.into(),
move |data: &[i16], _: &_| {
// Convert i16 to f32
let f32_data: Vec<f32> = data.iter().map(|&s| s as f32 / i16::MAX as f32).collect();
if let Ok(mut p) = processor_clone.lock() {
p.process(&f32_data);
}
},
err_fn,
None
),
cpal::SampleFormat::U16 => device.build_input_stream(
&config.into(),
move |data: &[u16], _: &_| {
// Convert u16 to f32
let f32_data: Vec<f32> = data.iter().map(|&s| (s as i32 - 32768) as f32 / 32768.0).collect();
if let Ok(mut p) = processor_clone.lock() {
p.process(&f32_data);
}
},
err_fn,
None
),
_ => return Err("Unsupported sample format".to_string()),
}.map_err(|e| e.to_string())?;
stream.play().map_err(|e| e.to_string())?;
// Store state
*state.recording_stream.lock().unwrap() = Some(stream);
*state.recording_file_path.lock().unwrap() = Some(file_path_str.clone());
emit_log(&app, "SUCCESS", &format!("Recording started. File: {}", file_path_str));
Ok(())
}
#[tauri::command]
fn stop_recording(app: AppHandle, state: State<'_, AppState>) -> Result<String, String> {
emit_log(&app, "INFO", "Stopping recording...");
// Drop stream to stop recording
{
let mut stream_guard = state.recording_stream.lock().unwrap();
// Also stop System Capture
let mut sys_guard = state.system_capture.lock().unwrap();
if let Some(sys) = sys_guard.as_mut() {
sys.stop();
}
*sys_guard = None;
if stream_guard.is_none() {
return Err("Not recording".to_string());
}
*stream_guard = None; // This drops the stream and stops recording
}
// Return file path
let mut path_guard = state.recording_file_path.lock().unwrap();
let path = path_guard.take().ok_or("No recording path found".to_string())?;
emit_log(&app, "SUCCESS", &format!("Recording stopped. Saved to: {}", path));
Ok(path)
}
#[tauri::command]
fn pause_recording(app: AppHandle, state: State<'_, AppState>) -> Result<(), String> {
emit_log(&app, "INFO", "Pausing recording...");
let stream_guard = state.recording_stream.lock().unwrap();
if let Some(stream) = stream_guard.as_ref() {
stream.pause().map_err(|e| e.to_string())?;
emit_log(&app, "SUCCESS", "Recording paused.");
Ok(())
} else {
Err("Not recording".to_string())
}
}
#[tauri::command]
fn resume_recording(app: AppHandle, state: State<'_, AppState>) -> Result<(), String> {
emit_log(&app, "INFO", "Resuming recording...");
let stream_guard = state.recording_stream.lock().unwrap();
if let Some(stream) = stream_guard.as_ref() {
stream.play().map_err(|e| e.to_string())?;
emit_log(&app, "SUCCESS", "Recording resumed.");
Ok(())
} else {
Err("Not recording".to_string())
}
}
#[derive(serde::Deserialize)]
struct ModelListResponse {
data: Vec<ModelData>,
}
#[derive(serde::Deserialize)]
struct ModelData {
id: String,
#[allow(dead_code)]
owned_by: Option<String>,
}
// Structs for Infomaniak API responses
#[derive(serde::Deserialize)]
struct WhisperResponse {
text: Option<String>,
batch_id: Option<String>,
}
#[derive(serde::Deserialize)]
struct ChatCompletionResponse {
choices: Vec<Choice>,
}
#[derive(serde::Deserialize)]
struct Choice {
message: Message,
}
#[derive(serde::Deserialize)]
struct Message {
#[allow(dead_code)]
content: String,
}
#[derive(serde::Serialize)]
struct ModelInfo {
id: String,
name: String,
}
#[tauri::command]
async fn get_available_models(app: AppHandle, api_key: String, product_id: String) -> Result<Vec<ModelInfo>, String> {
emit_log(&app, "INFO", "Fetching available models from Infomaniak...");
let client = reqwest::Client::new();
// Use the v2/openai compliant endpoint as per docs
let url = format!("https://api.infomaniak.com/2/ai/{}/openai/v1/models", product_id);
emit_log(&app, "DEBUG", &format!("GET {}", url));
let res = client.get(&url)
.header("Authorization", format!("Bearer {}", api_key))
.send()
.await
.map_err(|e| {
let msg = format!("Network error fetching models: {}", e);
emit_log(&app, "ERROR", &msg);
msg
})?;
if res.status().is_success() {
let raw_body = res.text().await.map_err(|e| e.to_string())?;
// println!("Models Raw Response: {}", raw_body);
let list: ModelListResponse = serde_json::from_str(&raw_body)
.map_err(|e| format!("Failed to parse models: {}. Body: {}", e, raw_body))?;
let models = list.data.into_iter()
.filter(|m| !m.id.to_lowercase().contains("mini_lm") && !m.id.to_lowercase().contains("bert") && !m.id.to_lowercase().contains("embedding"))
.map(|m| ModelInfo {
id: m.id.clone(),
name: m.id, // Use ID as name for now, or fetch more details if available
}).collect::<Vec<ModelInfo>>();
emit_log(&app, "SUCCESS", &format!("Loaded {} models.", models.len()));
Ok(models)
} else {
let err = res.text().await.unwrap_or_default();
emit_log(&app, "ERROR", &format!("Failed to fetch models: {}", err));
Err(format!("Failed to fetch models: {}", err))
}
}
#[derive(serde::Deserialize)]
struct WhisperVerboseResponse {
text: Option<String>,
segments: Option<Vec<Segment>>,
}
#[derive(serde::Deserialize)]
struct Segment {
start: f64,
end: f64,
text: String,
}
#[tauri::command]
async fn transcribe_audio(app: AppHandle, file_path: String, api_key: String, product_id: String) -> Result<String, String> {
emit_log(&app, "INFO", "Starting transcription with timestamps...");
let client = reqwest::Client::new();
// Prepare file part
let file_bytes = std::fs::read(&file_path).map_err(|e| e.to_string())?;
// We must use a known file name for the part, Infomaniak might care, or not.
let file_part = reqwest::multipart::Part::bytes(file_bytes)
.file_name("recording.wav")
.mime_str("audio/wav")
.map_err(|e| e.to_string())?;
let form = reqwest::multipart::Form::new()
.part("file", file_part)
.text("model", "whisper")
.text("response_format", "verbose_json")
.text("timestamp_granularities[]", "segment"); // Crucial for accurate segments
let url = format!("https://api.infomaniak.com/1/ai/{}/openai/audio/transcriptions", product_id);
emit_log(&app, "DEBUG", &format!("POST {}", url));
let res = client.post(&url)
.header("Authorization", format!("Bearer {}", api_key))
.multipart(form)
.send()
.await
.map_err(|e| {
let msg = format!("Network error during transcription: {}", e);
emit_log(&app, "ERROR", &msg);
msg
})?;
if res.status().is_success() {
let raw_body = res.text().await.map_err(|e| e.to_string())?;
// Check if we got a batch ID
#[derive(serde::Deserialize)]
struct BatchResponse {
batch_id: Option<String>,
}
// Try parsing as batch response first (Infomaniak specific behavior)
if let Ok(batch_res) = serde_json::from_str::<BatchResponse>(&raw_body) {
if let Some(batch_id) = batch_res.batch_id {
emit_log(&app, "INFO", &format!("Transcription queued. Batch ID: {}", batch_id));
return poll_transcription(&app, &client, &api_key, &product_id, &batch_id).await;
}
}
// If not batch, try parsing verbose response directly
// Log the raw body so we can see why it fails
emit_log(&app, "DEBUG", &format!("Direct Response (first 500 chars): {:.500}", raw_body));
let response: WhisperVerboseResponse = serde_json::from_str(&raw_body)
.map_err(|e| format!("Failed to decode JSON: {}. Body: {}", e, raw_body))?;
if let Some(segments) = response.segments {
emit_log(&app, "INFO", &format!("Found {} segments (Direct).", segments.len()));
for (i, seg) in segments.iter().take(3).enumerate() {
emit_log(&app, "DEBUG", &format!("Seg {}: start={}", i, seg.start));
}
// Format timestamps: [MM:SS] Text
let mut formatted_transcript = String::new();
for segment in segments {
let start_mins = (segment.start / 60.0).floor() as u64;
let start_secs = (segment.start % 60.0).floor() as u64;
formatted_transcript.push_str(&format!("[{:02}:{:02}] {}\n", start_mins, start_secs, segment.text.trim()));
}
// Fallback to raw text if segments empty
if formatted_transcript.trim().is_empty() {
if let Some(text) = response.text {
emit_log(&app, "SUCCESS", "Segments missing, using raw text.");
return Ok(text);
}
} else {
emit_log(&app, "SUCCESS", "Transcription received with timestamps.");
return Ok(formatted_transcript);
}
} else if let Some(text) = response.text {
emit_log(&app, "SUCCESS", "Segments missing, using raw text.");
return Ok(text);
}
emit_log(&app, "ERROR", "Response contained no recognized content.");
Err(format!("Response contained no recognized content. Body: {}", raw_body))
} else {
let error_text = res.text().await.unwrap_or_default();
emit_log(&app, "ERROR", &format!("Transcription failed: {}", error_text));
Err(format!("Transcription failed: {}", error_text))
}
}
async fn poll_transcription(app: &AppHandle, client: &reqwest::Client, api_key: &str, product_id: &str, batch_id: &str) -> Result<String, String> {
let status_url = format!("https://api.infomaniak.com/1/ai/{}/results/{}", product_id, batch_id);
let mut attempts = 0;
while attempts < 40 { // 40 * 2s = 80s timeout
attempts += 1;
sleep(Duration::from_secs(2)).await;
emit_log(app, "DEBUG", &format!("Polling status... Attempt {}", attempts));
let res = client.get(&status_url)
.header("Authorization", format!("Bearer {}", api_key))
.send()
.await
.map_err(|e| format!("Polling error: {}", e))?;
if res.status().is_success() {
let json: serde_json::Value = res.json().await.map_err(|e| e.to_string())?;
// Check 'status'
if let Some(status) = json.get("status").and_then(|s| s.as_str()) {
if status == "success" {
// Download the result
let download_url = format!("https://api.infomaniak.com/1/ai/{}/results/{}/download", product_id, batch_id);
let dl_res = client.get(&download_url)
.header("Authorization", format!("Bearer {}", api_key))
.send()
.await
.map_err(|e| e.to_string())?;
if dl_res.status().is_success() {
let content = dl_res.text().await.map_err(|e| e.to_string())?;
emit_log(app, "DEBUG", &format!("Poll Raw Content (first 500 chars): {:.500}", content));
// Try to parse as Verbose JSON to get timestamps
if let Ok(response) = serde_json::from_str::<WhisperVerboseResponse>(&content) {
if let Some(segments) = response.segments {
emit_log(app, "INFO", &format!("Found {} segments.", segments.len()));
// Log first 3 segments start times
for (i, seg) in segments.iter().take(3).enumerate() {
emit_log(app, "DEBUG", &format!("Seg {}: start={}", i, seg.start));
}
let mut formatted_transcript = String::new();
for segment in segments {
let start_mins = (segment.start / 60.0).floor() as u64;
let start_secs = (segment.start % 60.0).floor() as u64;
formatted_transcript.push_str(&format!("[{:02}:{:02}] {}\n", start_mins, start_secs, segment.text.trim()));
}
if !formatted_transcript.trim().is_empty() {
emit_log(app, "SUCCESS", "Transcription completed (async) with timestamps.");
return Ok(formatted_transcript);
} else {
emit_log(app, "WARN", "Segments found but empty content.");
}
} else {
emit_log(app, "WARN", "Verbose parsed but no segments found.");
}
if let Some(text) = response.text {
emit_log(app, "SUCCESS", "Transcription completed (async) - raw text (segments missing).");
return Ok(text);
}
} else {
emit_log(app, "WARN", "Failed to parse poll content as WhisperVerboseResponse");
}
emit_log(app, "SUCCESS", "Transcription completed - returning raw content.");
// If not JSON or no text field, return raw content
return Ok(content);
} else {
emit_log(app, "ERROR", "Failed to download transcription results.");
return Err(format!("Download failed: {}", dl_res.status()));
}
} else if status == "failed" || status == "error" {
let err_msg = format!("Batch processing failed [Status: {}]. Full Response: {:?}", status, json);
emit_log(app, "ERROR", &err_msg);
return Err(err_msg);
}
// If 'processing' or 'pending', continue loop
}
}
}
emit_log(app, "ERROR", "Transcription timed out after 80s.");
Err("Transcription timed out".to_string())
}
#[tauri::command]
async fn summarize_text(app: AppHandle, text: String, api_key: String, product_id: String, prompt: String, model: String) -> Result<String, String> {
emit_log(&app, "INFO", "Starting summarization...");
let client = reqwest::Client::new();
let url = format!("https://api.infomaniak.com/2/ai/{}/openai/v1/chat/completions", product_id);
let messages = serde_json::json!([
{ "role": "system", "content": prompt },
{ "role": "user", "content": text }
]);
let model_to_use = if model.is_empty() { "mixtral".to_string() } else { model };
let body = serde_json::json!({
"model": model_to_use,
"messages": messages
});
emit_log(&app, "DEBUG", &format!("POST {}", url));
let res = client.post(&url)
.header("Authorization", format!("Bearer {}", api_key))
.header("Content-Type", "application/json")
.json(&body)
.send()
.await
.map_err(|e| {
let msg = format!("Network error during summarization: {}", e);
emit_log(&app, "ERROR", &msg);
msg
})?;
if res.status().is_success() {
let raw_body = res.text().await.map_err(|e| e.to_string())?;
// println!("Summarization Raw Response: {}", raw_body);
let response_body: ChatCompletionResponse = serde_json::from_str(&raw_body)
.map_err(|e| format!("Failed to decode JSON: {}. Body: {}", e, raw_body))?;
if let Some(choice) = response_body.choices.first() {
emit_log(&app, "SUCCESS", "Summarization received.");
Ok(choice.message.content.clone())
} else {
emit_log(&app, "WARN", "No summary generated in response.");
Err("No summary generated".to_string())
}
} else {
let error_text = res.text().await.unwrap_or_default();
emit_log(&app, "ERROR", &format!("Summarization failed: {}", error_text));
Err(format!("Summarization failed: {}", error_text))
}
}
#[derive(serde::Serialize)]
struct AudioMetadata {
duration: f64,
size: u64,
format: String,
}
// Helper to find ffmpeg/ffprobe in common paths
fn resolve_binary_path(binary_name: &str) -> String {
let common_paths = [
format!("/opt/homebrew/bin/{}", binary_name),
format!("/usr/local/bin/{}", binary_name),
format!("/usr/bin/{}", binary_name),
];
for path in common_paths.iter() {
if std::path::Path::new(path).exists() {
return path.clone();
}
}
// Fallback to expecting it in PATH
binary_name.to_string()
}
#[tauri::command]
fn get_audio_metadata(app: AppHandle, file_path: String) -> Result<AudioMetadata, String> {
emit_log(&app, "INFO", &format!("Getting metadata for: {}", file_path));
let path = std::path::Path::new(&file_path);
if !path.exists() {
return Err(format!("File not found: {}", file_path));
}
let size = std::fs::metadata(&file_path)
.map_err(|e| e.to_string())?
.len();
// Use ffprobe to get duration
// Try resolved path first
let ffprobe_cmd = resolve_binary_path("ffprobe");
let output = Command::new(&ffprobe_cmd)
.args([
"-v", "error",
"-show_entries", "format=duration",
"-of", "default=noprint_wrappers=1:nokey=1",
&file_path
])
.output()
.map_err(|e| format!("Failed to execute ffprobe at '{}': {}", ffprobe_cmd, e))?;
let duration_str = String::from_utf8_lossy(&output.stdout);
let duration: f64 = duration_str.trim().parse().unwrap_or(0.0);
// Extension as format
let format = path.extension()
.and_then(|e| e.to_str())
.unwrap_or("unknown")
.to_string();
Ok(AudioMetadata {
duration,
size,
format,
})
}
#[tauri::command]
fn convert_to_mp3(app: AppHandle, wav_path: String) -> Result<String, String> {
emit_log(&app, "INFO", &format!("Converting to MP3: {}", wav_path));
let mp3_path = wav_path.replace(".wav", ".mp3");
let ffmpeg_cmd = resolve_binary_path("ffmpeg");
let output = Command::new(&ffmpeg_cmd)
.args([
"-i", &wav_path,
"-codec:a", "libmp3lame",
"-b:a", "64k",
"-y", // overwrite
&mp3_path
])
.output()
.map_err(|e| format!("Failed to execute ffmpeg at '{}': {}", ffmpeg_cmd, e))?;
if output.status.success() {
emit_log(&app, "SUCCESS", &format!("MP3 created: {}", mp3_path));
Ok(mp3_path)
} else {
let error = String::from_utf8_lossy(&output.stderr);
emit_log(&app, "ERROR", &format!("MP3 conversion failed: {}", error));
Err(format!("MP3 conversion failed: {}", error))
}
}
#[tauri::command]
fn chunk_audio(app: AppHandle, file_path: String, chunk_minutes: u32) -> Result<Vec<String>, String> {
emit_log(&app, "INFO", &format!("Chunking audio: {} ({}min chunks)", file_path, chunk_minutes));
let chunk_seconds = chunk_minutes * 60;
let ffprobe_cmd = resolve_binary_path("ffprobe");
let ffmpeg_cmd = resolve_binary_path("ffmpeg");
// Get total duration using ffprobe
let duration_output = Command::new(&ffprobe_cmd)
.args([
"-v", "error",
"-show_entries", "format=duration",
"-of", "default=noprint_wrappers=1:nokey=1",
&file_path
])
.output()
.map_err(|e| format!("Failed to get duration with '{}': {}", ffprobe_cmd, e))?;
let duration_str = String::from_utf8_lossy(&duration_output.stdout);
let duration: f64 = duration_str.trim().parse()
.map_err(|_| "Failed to parse duration".to_string())?;
let num_chunks = (duration / chunk_seconds as f64).ceil() as usize;
emit_log(&app, "INFO", &format!("Total duration: {}s, creating {} chunks", duration, num_chunks));
let mut chunk_paths = Vec::new();
let base_path = file_path.replace(".mp3", "");
for i in 0..num_chunks {
let start_time = i as u32 * chunk_seconds;
let chunk_path = format!("{}_chunk_{}.mp3", base_path, i);
let output = Command::new(&ffmpeg_cmd)
.args([
"-i", &file_path,
"-ss", &start_time.to_string(),
"-t", &chunk_seconds.to_string(),
"-c", "copy",
"-y",
&chunk_path
])
.output()
.map_err(|e| format!("Failed to create chunk {} with '{}': {}", i, ffmpeg_cmd, e))?;
if !output.status.success() {
let error = String::from_utf8_lossy(&output.stderr);
return Err(format!("Chunk {} failed: {}", i, error));
}
chunk_paths.push(chunk_path);
}
emit_log(&app, "SUCCESS", &format!("Created {} chunks", chunk_paths.len()));
Ok(chunk_paths)
}
#[tauri::command]
fn read_image_as_base64(app: AppHandle, file_path: String) -> Result<String, String> {
emit_log(&app, "INFO", &format!("Reading image as base64: {}", file_path));
let bytes = std::fs::read(&file_path)
.map_err(|e| format!("Failed to read file: {}", e))?;
// Detect image type from extension
let extension = std::path::Path::new(&file_path)
.extension()
.and_then(|e| e.to_str())
.unwrap_or("png")
.to_lowercase();
let mime_type = match extension.as_str() {
"jpg" | "jpeg" => "image/jpeg",
"png" => "image/png",
"svg" => "image/svg+xml",
"gif" => "image/gif",
_ => "image/png"
};
// Use base64 encoding
let base64_str = base64::prelude::BASE64_STANDARD.encode(&bytes);
let data_url = format!("data:{};base64,{}", mime_type, base64_str);
emit_log(&app, "SUCCESS", &format!("Image converted to base64 ({} bytes)", base64_str.len()));
Ok(data_url)
}
#[tauri::command]
fn open_audio_midi_setup() -> Result<(), String> {
Command::new("open")
.arg("-a")
.arg("Audio MIDI Setup")
.spawn()
.map_err(|e| e.to_string())?;
Ok(())
}
#[tauri::command]
fn create_hearbit_audio_device(app: AppHandle) -> Result<String, String> {
emit_log(&app, "INFO", "Attempting to create Hearbit Audio device...");
// Resolve resource path
let resource_path = app.path().resource_dir()
.map_err(|e| e.to_string())?
.join("resources/create_hearbit_audio.swift");
if !resource_path.exists() {
// Fallback for dev environment where resources might not be bundled yet or different path
emit_log(&app, "WARN", &format!("Resource script not found at {:?}. Trying local src-tauri path.", resource_path));
}
// For now, in dev mode, we might need to point to the source location if bundle isn't active
// But let's try running it.
let output = Command::new("swift")
.arg(resource_path)
.output()
.map_err(|e| e.to_string())?;
let stdout = String::from_utf8_lossy(&output.stdout).to_string();
let stderr = String::from_utf8_lossy(&output.stderr).to_string();
emit_log(&app, "DEBUG", &format!("Script Output: {}", stdout));
if !stderr.is_empty() {
emit_log(&app, "WARN", &format!("Script Stderr: {}", stderr));
}
if output.status.success() {
emit_log(&app, "SUCCESS", "Hearbit Audio device created successfully.");
Ok("Device created successfully".to_string())
} else {
emit_log(&app, "ERROR", "Failed to create device.");
Err(format!("Failed to create device: {} {}", stdout, stderr))
}
}
#[tauri::command]
async fn check_screen_recording_permission() -> bool {
sc_audio::check_permissions().await
}
#[tauri::command]
async fn save_text_file(app: AppHandle, path: String, content: String) -> Result<(), String> {
emit_log(&app, "INFO", &format!("Saving text file to: {}", path));
match std::fs::write(&path, content) {
Ok(_) => {
emit_log(&app, "SUCCESS", "File saved successfully.");
Ok(())
},
Err(e) => {
emit_log(&app, "ERROR", &format!("Failed to save file: {}", e));
Err(e.to_string())
}
}
}
#[tauri::command]
async fn read_log_file(app: AppHandle) -> Result<String, String> {
let log_path = app.path().app_log_dir().map_err(|e| e.to_string())?.join("hearbit-ai.log");
if log_path.exists() {
let content = std::fs::read_to_string(&log_path).map_err(|e| e.to_string())?;
Ok(content)
} else {
Ok("No log file found yet.".to_string())
}
}
#[cfg_attr(mobile, tauri::mobile_entry_point)]
pub fn run() {
tauri::Builder::default()
.setup(|app| {
// Setup Tray Icon
let quit_i = MenuItem::with_id(app, "quit", "Quit Hearbit AI", true, None::<&str>).unwrap();
let show_i = MenuItem::with_id(app, "show", "Show Window", true, None::<&str>).unwrap();
let menu = Menu::with_items(app, &[&show_i, &quit_i]).unwrap();
let _tray = TrayIconBuilder::new()
.icon(app.default_window_icon().unwrap().clone())
.menu(&menu)
.show_menu_on_left_click(true)
.on_menu_event(|app, event| {
match event.id.as_ref() {
"quit" => app.exit(0),
"show" => {
if let Some(window) = app.get_webview_window("main") {
let _ = window.show();
let _ = window.set_focus();
}
}
_ => {}
}
})
.on_tray_icon_event(|tray, event| {
if let TrayIconEvent::Click { .. } = event {
let app = tray.app_handle();
if let Some(window) = app.get_webview_window("main") {
let _ = window.show();
let _ = window.set_focus();
}
}
})
.build(app)?;
Ok(())
})
.on_window_event(|window, event| {
if let WindowEvent::CloseRequested { api, .. } = event {
// Prevent window from closing, just hide it
window.hide().unwrap();
api.prevent_close();
}
})
.plugin(tauri_plugin_shell::init())
.plugin(tauri_plugin_log::Builder::default()
.targets([
tauri_plugin_log::Target::new(tauri_plugin_log::TargetKind::Stdout),
tauri_plugin_log::Target::new(tauri_plugin_log::TargetKind::LogDir { file_name: Some("hearbit-ai.log".to_string()) }),
])
.build())
.plugin(tauri_plugin_opener::init())
.plugin(tauri_plugin_dialog::init())
.plugin(tauri_plugin_fs::init())
.plugin(tauri_plugin_oauth::init())
.manage(AppState {
recording_stream: Mutex::new(None),
recording_file_path: Mutex::new(None),
system_capture: Mutex::new(None),
})
.invoke_handler(tauri::generate_handler![
greet,
get_input_devices,
start_recording,
stop_recording,
pause_recording,
resume_recording,
transcribe_audio,
summarize_text,
get_available_models,
open_audio_midi_setup,
create_hearbit_audio_device,
check_screen_recording_permission,
auth::start_auth_flow,
auth::get_calendar_events,
save_text_file,
read_log_file,
get_audio_metadata,
convert_to_mp3,
chunk_audio,
read_image_as_base64,
email::send_smtp_email
])
.run(tauri::generate_context!())
.expect("error while running tauri application");
}

171
src-tauri/src/sc_audio.rs Normal file
View File

@@ -0,0 +1,171 @@
use cocoa::base::nil;
use objc::{msg_send, sel, sel_impl};
use screencapturekit_sys::{
cm_sample_buffer_ref::CMSampleBufferRef,
content_filter::{UnsafeContentFilter, UnsafeInitParams},
os_types::rc::Id,
shareable_content::{UnsafeSCRunningApplication, UnsafeSCShareableContent},
stream::UnsafeSCStream,
stream_configuration::UnsafeStreamConfiguration,
stream_error_handler::UnsafeSCStreamError,
stream_output_handler::UnsafeSCStreamOutput,
};
pub struct SystemAudioCapture {
stream: Option<Id<UnsafeSCStream>>,
sample_rate: u32,
excluded_apps: Vec<String>,
}
struct AudioOutputWrapper {
callback: Box<dyn Fn(&[f32]) + Send + Sync>,
}
impl UnsafeSCStreamOutput for AudioOutputWrapper {
fn did_output_sample_buffer(&self, sample: Id<CMSampleBufferRef>, of_type: u8) {
if of_type == 1 {
// Audio
let buffers = sample.get_av_audio_buffer_list();
for buffer in buffers {
let data_u8 = buffer.data;
let data_f32: &[f32] = unsafe {
std::slice::from_raw_parts(data_u8.as_ptr() as *const f32, data_u8.len() / 4)
};
(self.callback)(data_f32);
}
}
}
}
struct ErrorHandler;
impl UnsafeSCStreamError for ErrorHandler {
fn handle_error(&self) {
// eprintln!("Stream Error");
}
}
pub async fn check_permissions() -> bool {
UnsafeSCShareableContent::get().is_ok()
}
impl SystemAudioCapture {
pub fn new(sample_rate: u32, excluded_apps: Vec<String>) -> Self {
Self {
stream: None,
sample_rate,
excluded_apps,
}
}
async fn build_filter(&self) -> Result<Id<UnsafeContentFilter>, String> {
let content =
UnsafeSCShareableContent::get().map_err(|_| format!("Failed to get content"))?;
let displays = content.displays();
let display = displays.first().ok_or("No display found")?;
if self.excluded_apps.is_empty() {
return Ok(UnsafeContentFilter::init(UnsafeInitParams::Display(
display.clone(),
)));
}
let mut apps_to_exclude = Vec::new();
let all_apps = content.applications();
// Prepare lowercase excluded list for case-insensitive matching
let excluded_lower: Vec<String> = self
.excluded_apps
.iter()
.map(|s| s.to_lowercase())
.collect();
for app in all_apps {
if let Some(bid) = app.get_bundle_identifier() {
let bid_lower = bid.to_lowercase();
// Smart match: check if the running app's ID starts with any blocked ID
// e.g., "com.apple.Safari.WebContent" starts with "com.apple.Safari"
if excluded_lower
.iter()
.any(|excluded| bid_lower.starts_with(excluded))
{
apps_to_exclude.push(app);
}
}
}
// NOTE: In screencapturekit-sys 0.2.8, DisplayIncludingApplicationsExceptingWindows
// actually calls initWithDisplay:excludingApplications: (it's a bug in the crate)
let filter_init = UnsafeInitParams::DisplayIncludingApplicationsExceptingWindows(
display.clone(),
apps_to_exclude,
Vec::new(),
);
Ok(UnsafeContentFilter::init(filter_init))
}
pub async fn start<F>(&mut self, callback: F) -> Result<(), String>
where
F: Fn(&[f32]) + Send + Sync + 'static,
{
let filter = self.build_filter().await?;
let mut config = UnsafeStreamConfiguration::default();
config.width = 100;
config.height = 100;
config.captures_audio = 1;
config.sample_rate = self.sample_rate;
config.channel_count = 2;
config.excludes_current_process_audio = 1;
let output_wrapper = AudioOutputWrapper {
callback: Box::new(callback),
};
let stream = UnsafeSCStream::init(filter, config.into(), ErrorHandler);
stream.add_stream_output(output_wrapper, 1); // 1 = Audio
stream
.start_capture()
.map_err(|_| "Failed to start capture".to_string())?;
self.stream = Some(stream);
Ok(())
}
pub fn stop(&mut self) {
if let Some(stream) = &self.stream {
let _ = stream.stop_capture();
}
self.stream = None;
}
pub async fn refresh_filter(&mut self) -> Result<(), String> {
let stream = match &self.stream {
Some(s) => s,
None => return Ok(()),
};
let filter = self.build_filter().await?;
// Call updateContentFilter:completionHandler:
// screencapturekit-sys 0.2.8 does not have this method exposed yet in UnsafeSCStream.
// We use msg_send! to call it directly on the underlying object.
unsafe {
// Get raw pointer to the underlying Objective-C objects.
// Since we don't have easy access to the inner pointer of Id<T> via methods,
// we cast the pointer to the Id wrapper itself to a pointer to a pointer.
// This assumes Id<T> is a transparent wrapper around a pointer.
let stream_ptr =
*(stream as *const Id<UnsafeSCStream> as *const *mut objc::runtime::Object);
let filter_ptr =
*(&filter as *const Id<UnsafeContentFilter> as *const *mut objc::runtime::Object);
let _: () =
msg_send![stream_ptr, updateContentFilter: filter_ptr completionHandler: nil];
}
Ok(())
}
}

View File

@@ -0,0 +1,103 @@
use screencapturekit_sys::{
os_types::rc::Id,
shareable_content::UnsafeSCShareableContent,
content_filter::{UnsafeContentFilter, UnsafeInitParams},
stream_configuration::UnsafeStreamConfiguration,
stream::UnsafeSCStream,
stream_error_handler::UnsafeSCStreamError,
stream_output_handler::UnsafeSCStreamOutput,
cm_sample_buffer_ref::CMSampleBufferRef,
};
pub struct SystemAudioCapture {
stream: Option<Id<UnsafeSCStream>>,
sample_rate: u32,
}
struct AudioOutputWrapper {
callback: Box<dyn Fn(&[f32]) + Send + Sync>,
}
impl UnsafeSCStreamOutput for AudioOutputWrapper {
fn did_output_sample_buffer(&self, sample: Id<CMSampleBufferRef>, of_type: u8) {
if of_type == 1 { // Audio
let buffers = sample.get_av_audio_buffer_list();
for buffer in buffers {
// Buffer data is u8, we usually get F32 from SCK if configured.
// Assuming f32 (Floating Point) based on our config.
// We need to convert [u8] to [f32].
let data_u8 = buffer.data;
let data_f32: &[f32] = unsafe {
std::slice::from_raw_parts(
data_u8.as_ptr() as *const f32,
data_u8.len() / 4,
)
};
(self.callback)(data_f32);
}
}
}
}
struct ErrorHandler;
impl UnsafeSCStreamError for ErrorHandler {
fn handle_error(&self) {
// eprintln!("Stream Error");
}
}
pub async fn check_permissions() -> bool {
UnsafeSCShareableContent::get().is_ok()
}
impl SystemAudioCapture {
pub fn new(sample_rate: u32) -> Self {
Self { stream: None, sample_rate }
}
pub async fn start<F>(&mut self, callback: F) -> Result<(), String>
where F: Fn(&[f32]) + Send + Sync + 'static {
let content = UnsafeSCShareableContent::get().map_err(|e| format!("Failed to get content"))?;
let displays = content.displays();
let display = displays.first().ok_or("No display found")?;
let filter_init = UnsafeInitParams::Display(display.clone());
let filter = UnsafeContentFilter::init(filter_init);
// Wait, 'pixel_format' is OSType. b"BGRA" is &[u8;4].
// FourCharCode::from_chars exists in crate::os_types::four_char_code but we didn't import it.
// Actually, we can just use the Default and overwrite fields.
// But better: use Default and only set what we need.
let mut config = UnsafeStreamConfiguration::default();
config.width = 100;
config.height = 100;
config.captures_audio = 1;
config.sample_rate = self.sample_rate;
config.channel_count = 2;
config.excludes_current_process_audio = 0;
let output_wrapper = AudioOutputWrapper {
callback: Box::new(callback),
};
// Convert config to Id<UnsafeStreamConfigurationRef> using Into
let stream = UnsafeSCStream::init(filter, config.into(), ErrorHandler);
stream.add_stream_output(output_wrapper, 1); // 1 = Audio
stream.start_capture().map_err(|e| "Failed to start capture".to_string())?;
self.stream = Some(stream);
Ok(())
}
pub fn stop(&mut self) {
if let Some(stream) = &self.stream {
stream.stop_capture();
}
self.stream = None;
}
}

View File

@@ -1,7 +1,7 @@
{ {
"$schema": "https://schema.tauri.app/config/2", "$schema": "https://schema.tauri.app/config/2",
"productName": "Hearbit AI", "productName": "Hearbit AI",
"version": "1.1.1", "version": "1.2.3",
"identifier": "com.hearbit-ai.desktop", "identifier": "com.hearbit-ai.desktop",
"build": { "build": {
"beforeDevCommand": "npm run dev", "beforeDevCommand": "npm run dev",

View File

@@ -1,4 +1,5 @@
import { useState } from 'react'; import { useState, useEffect, useCallback } from 'react';
import { invoke } from '@tauri-apps/api/core';
import { Settings as SettingsIcon } from "lucide-react"; import { Settings as SettingsIcon } from "lucide-react";
import Settings, { SmtpConfig, AzureConfig } from "./components/Settings"; import Settings, { SmtpConfig, AzureConfig } from "./components/Settings";
import Recorder from "./components/Recorder"; import Recorder from "./components/Recorder";
@@ -60,6 +61,15 @@ function App() {
return localStorage.getItem('hearbit_selected_model') || 'mixtral'; return localStorage.getItem('hearbit_selected_model') || 'mixtral';
}); });
// Daily Backup State
const [dailyBackupEnabled, setDailyBackupEnabled] = useState(() => localStorage.getItem('hearbit_daily_backup_enabled') === 'true');
const [dailyBackupPath, setDailyBackupPath] = useState(() => localStorage.getItem('hearbit_daily_backup_path') || '');
const [lastBackupDate, setLastBackupDate] = useState(() => localStorage.getItem('hearbit_last_backup_date') || '');
const [blockedApps, setBlockedApps] = useState<string[]>(() => {
const saved = localStorage.getItem('hearbit_blocked_apps');
return saved ? JSON.parse(saved) : [];
});
const handleModelChange = (model: string) => { const handleModelChange = (model: string) => {
setSelectedModel(model); setSelectedModel(model);
localStorage.setItem('hearbit_selected_model', model); localStorage.setItem('hearbit_selected_model', model);
@@ -234,7 +244,10 @@ Thanks!`
newSavePath: string, newSavePath: string,
newSmtp: SmtpConfig, newSmtp: SmtpConfig,
newAzure: AzureConfig, newAzure: AzureConfig,
newEmailTemplates: EmailTemplate[] newEmailTemplates: EmailTemplate[],
newDailyBackupEnabled: boolean,
newDailyBackupPath: string,
newBlockedApps: string[]
) => { ) => {
setApiKey(newApiKey); setApiKey(newApiKey);
setProductId(newProductId); setProductId(newProductId);
@@ -244,14 +257,22 @@ Thanks!`
setAzureConfig(newAzure); setAzureConfig(newAzure);
setEmailTemplates(newEmailTemplates); setEmailTemplates(newEmailTemplates);
localStorage.setItem('infomaniak_api_key', newApiKey); setDailyBackupEnabled(newDailyBackupEnabled);
localStorage.setItem('infomaniak_product_id', newProductId); setDailyBackupPath(newDailyBackupPath);
localStorage.setItem('infomaniak_prompts', JSON.stringify(newPrompts)); setBlockedApps(newBlockedApps);
localStorage.setItem('infomaniak_save_path', newSavePath);
localStorage.setItem('hearbit_api_key', newApiKey);
localStorage.setItem('hearbit_product_id', newProductId);
localStorage.setItem('hearbit_prompts', JSON.stringify(newPrompts));
localStorage.setItem('hearbit_save_path', newSavePath);
localStorage.setItem('hearbit_smtp_config', JSON.stringify(newSmtp)); localStorage.setItem('hearbit_smtp_config', JSON.stringify(newSmtp));
localStorage.setItem('hearbit_azure_config', JSON.stringify(newAzure)); localStorage.setItem('hearbit_azure_config', JSON.stringify(newAzure));
localStorage.setItem('hearbit_email_templates', JSON.stringify(newEmailTemplates)); localStorage.setItem('hearbit_email_templates', JSON.stringify(newEmailTemplates));
localStorage.setItem('hearbit_daily_backup_enabled', String(newDailyBackupEnabled));
localStorage.setItem('hearbit_daily_backup_path', newDailyBackupPath);
localStorage.setItem('hearbit_blocked_apps', JSON.stringify(newBlockedApps));
setView(lastTab); setView(lastTab);
}; };
@@ -332,6 +353,80 @@ Thanks!`
setView('transcription'); // Switch to Transcription view to see content setView('transcription'); // Switch to Transcription view to see content
}; };
const performBackup = useCallback(async (isAuto = false) => {
try {
if (isAuto && !dailyBackupEnabled) return;
const dataToBackup = {
apiKey,
productId,
prompts,
savePath,
smtp: smtpConfig,
azure: azureConfig,
emailTemplates,
history, // Including history!
// Also include daily backup settings so they persist on restore
dailyBackup: {
enabled: dailyBackupEnabled,
path: dailyBackupPath,
}
};
// Always save as JSON (no encryption)
const content = JSON.stringify(dataToBackup, null, 2);
const dateStr = new Date().toISOString().slice(0, 10);
const fileName = `hearbit_backup_${isAuto ? 'auto_' : ''}${dateStr}.json`;
// Determine path: use specific daily backup path, or general savePath
const targetDir = (isAuto ? dailyBackupPath : savePath) || savePath;
if (!targetDir) {
if (!isAuto) addToast('No backup path configured.', 'error');
return;
}
const fullPath = `${targetDir}/${fileName}`;
await invoke('save_text_file', { path: fullPath, content });
if (isAuto) {
const now = new Date().toISOString();
setLastBackupDate(now);
localStorage.setItem('hearbit_last_backup_date', now);
console.log("Auto-backup completed:", fullPath);
} else {
addToast(`Backup saved to ${fullPath}`, 'success');
}
} catch (e) {
console.error("Backup failed:", e);
if (!isAuto) addToast(`Backup failed: ${e}`, 'error');
}
}, [apiKey, productId, prompts, savePath, smtpConfig, azureConfig, emailTemplates, history, dailyBackupEnabled, dailyBackupPath]);
// Check for Daily Backup on Mount / State Change
useEffect(() => {
if (!dailyBackupEnabled) return;
const check = async () => {
const today = new Date().toISOString().slice(0, 10);
const last = lastBackupDate ? lastBackupDate.slice(0, 10) : '';
if (last !== today) {
// Perform backup
await performBackup(true);
}
};
const timer = setTimeout(() => {
check();
}, 5000); // Check 5s after load to allow state to settle
return () => clearTimeout(timer);
}, [dailyBackupEnabled, lastBackupDate, performBackup]);
return ( return (
@@ -388,6 +483,7 @@ Thanks!`
selectedModel={selectedModel} selectedModel={selectedModel}
onModelChange={handleModelChange} onModelChange={handleModelChange}
isVisible={view === 'recorder'} isVisible={view === 'recorder'}
blockedApps={blockedApps}
/> />
</div> </div>
@@ -474,6 +570,19 @@ Thanks!`
smtpConfig={smtpConfig} smtpConfig={smtpConfig}
azureConfig={azureConfig} azureConfig={azureConfig}
emailTemplates={emailTemplates} emailTemplates={emailTemplates}
// Pass new backup props
dailyBackupEnabled={dailyBackupEnabled}
dailyBackupPath={dailyBackupPath}
lastBackupDate={lastBackupDate}
// Pass history and update callback
history={history}
onHistoryUpdate={(newHistory) => {
setHistory(newHistory);
localStorage.setItem('infomaniak_history', JSON.stringify(newHistory));
}}
blockedApps={blockedApps}
/> />
)} )}
</div> </div>

View File

@@ -43,6 +43,7 @@ interface RecorderProps {
selectedModel: string; selectedModel: string;
onModelChange: (model: string) => void; onModelChange: (model: string) => void;
isVisible: boolean; isVisible: boolean;
blockedApps: string[];
} }
interface AudioDevice { interface AudioDevice {
@@ -54,15 +55,15 @@ const Recorder: React.FC<RecorderProps> = ({
apiKey, productId, prompts, apiKey, productId, prompts,
setTranscription, setSummary, setTranscription, setSummary,
onSaveToHistory, savePath, onRecordingComplete, onSaveToHistory, savePath, onRecordingComplete,
onOpenSettings, addToast, selectedModel, onModelChange, ...props onOpenSettings, addToast, selectedModel, onModelChange, blockedApps, ...props
}) => { }) => {
const [isRecording, setIsRecording] = useState(false); const [isRecording, setIsRecording] = useState(false);
const [isStopping, setIsStopping] = useState(false); // New lock state const [isStopping, setIsStopping] = useState(false); // New lock state
const [isPaused, setIsPaused] = useState(false); const [isPaused, setIsPaused] = useState(false);
const [isWaiting, setIsWaiting] = useState(false); // New state for Auto-Start const [isWaiting, setIsWaiting] = useState(false); // New state for Auto-Start
const [hasSpeechDetected, setHasSpeechDetected] = useState(false); // New tracking state
const [autoStartEnabled, setAutoStartEnabled] = useState(false); // Toggle state const [autoStartEnabled, setAutoStartEnabled] = useState(false); // Toggle state
const [status, setStatus] = useState<string>('Ready to record'); const [status, setStatus] = useState<string>('Ready to record');
const [selectedDevice, setSelectedDevice] = useState<string>(''); const [selectedDevice, setSelectedDevice] = useState<string>('');
const [selectedPromptId, setSelectedPromptId] = useState<string>(''); const [selectedPromptId, setSelectedPromptId] = useState<string>('');
@@ -73,11 +74,8 @@ const Recorder: React.FC<RecorderProps> = ({
const [lastSpeechTime, setLastSpeechTime] = useState<number>(Date.now()); const [lastSpeechTime, setLastSpeechTime] = useState<number>(Date.now());
const [silenceDuration, setSilenceDuration] = useState(0); const [silenceDuration, setSilenceDuration] = useState(0);
// Filtered devices based on mode // Show all devices for both modes now (System Audio is captured natively)
const filteredDevices = devices.filter(d => { const filteredDevices = devices;
const isVirtual = d.name.toLowerCase().includes('hearbit') || d.name.toLowerCase().includes('blackhole');
return recordingMode === 'meeting' ? isVirtual : !isVirtual;
});
useEffect(() => { useEffect(() => {
loadDevices(); loadDevices();
@@ -126,15 +124,18 @@ const Recorder: React.FC<RecorderProps> = ({
const aggregateDev = aliasedDevs.find(d => d.name === 'Hearbit Audio'); const aggregateDev = aliasedDevs.find(d => d.name === 'Hearbit Audio');
const virtualDev = aliasedDevs.find(d => d.name.includes('Hearbit Virtual')); const virtualDev = aliasedDevs.find(d => d.name.includes('Hearbit Virtual'));
if (recordingMode === 'meeting') {
if (aggregateDev) { if (aggregateDev) {
setRecordingMode('meeting');
setSelectedDevice(aggregateDev.id); setSelectedDevice(aggregateDev.id);
} else if (virtualDev) { } else if (virtualDev) {
setRecordingMode('meeting');
setSelectedDevice(virtualDev.id); setSelectedDevice(virtualDev.id);
} else { } else if (aliasedDevs.length > 0) {
setRecordingMode('voice'); setSelectedDevice(aliasedDevs[0].id);
if (aliasedDevs.length > 0) setSelectedDevice(aliasedDevs[0].id); }
} else if (aliasedDevs.length > 0) {
// Voice mode: just pick first non-virtual if possible, otherwise first
const physicalMic = aliasedDevs.find(d => !d.name.includes('Hearbit') && !d.name.includes('BlackHole'));
setSelectedDevice(physicalMic ? physicalMic.id : aliasedDevs[0].id);
} }
} }
} catch (e) { } catch (e) {
@@ -163,13 +164,16 @@ const Recorder: React.FC<RecorderProps> = ({
deviceId: targetDeviceId, deviceId: targetDeviceId,
savePath: savePath || null, savePath: savePath || null,
customFilename: props.recordingSubject || null, customFilename: props.recordingSubject || null,
waitForSpeech: autoStartEnabled // Pass the toggle state waitForSpeech: autoStartEnabled, // Pass the toggle state
mode: recordingMode,
excludedApps: blockedApps
}); });
setIsRecording(true); setIsRecording(true);
setIsPaused(false); setIsPaused(false);
setTranscription(''); setTranscription('');
setSummary(''); setSummary('');
setHasSpeechDetected(false); // Reset check for new session
if (autoStartEnabled) { if (autoStartEnabled) {
setIsWaiting(true); setIsWaiting(true);
@@ -189,9 +193,10 @@ const Recorder: React.FC<RecorderProps> = ({
} }
}; };
// Refs for interval access to avoid dependency cycles // Refs for interval access to avoid dependency cycles and stale closures
const lastSpeechTimeRef = useRef<number>(Date.now()); const lastSpeechTimeRef = useRef<number>(Date.now());
const isStoppingRef = useRef(false); const isStoppingRef = useRef(false);
const autoStartEnabledRef = useRef(autoStartEnabled);
// Update refs when state changes // Update refs when state changes
useEffect(() => { useEffect(() => {
@@ -202,6 +207,10 @@ const Recorder: React.FC<RecorderProps> = ({
isStoppingRef.current = isStopping; isStoppingRef.current = isStopping;
}, [isStopping]); }, [isStopping]);
useEffect(() => {
autoStartEnabledRef.current = autoStartEnabled;
}, [autoStartEnabled]);
// 1. Event Listeners Effect (Run ONCE when recording starts) // 1. Event Listeners Effect (Run ONCE when recording starts)
useEffect(() => { useEffect(() => {
let unlistenVAD: () => void; let unlistenVAD: () => void;
@@ -215,15 +224,16 @@ const Recorder: React.FC<RecorderProps> = ({
unlistenVAD = await listen<{ is_speech: boolean, probability: number }>('vad-event', (event) => { unlistenVAD = await listen<{ is_speech: boolean, probability: number }>('vad-event', (event) => {
if (event.payload.is_speech) { if (event.payload.is_speech) {
setLastSpeechTime(Date.now()); setLastSpeechTime(Date.now());
lastSpeechTimeRef.current = Date.now(); // Update ref immediately lastSpeechTimeRef.current = Date.now();
setSilenceDuration(0); setSilenceDuration(0);
setHasSpeechDetected(true); // Track positive speech
} }
}); });
// Auto-Start Trigger Listener // Auto-Start Trigger Listener
unlistenTrigger = await listen('auto-recording-triggered', () => { unlistenTrigger = await listen('auto-recording-triggered', () => {
console.log("Auto-Start Triggered from Backend!"); console.log("Auto-Start Triggered from Backend!");
// Only trigger if we are actually waiting setHasSpeechDetected(true); // Trigger counts as speech
setIsWaiting((prev) => { setIsWaiting((prev) => {
if (prev) { if (prev) {
addToast("Audio detected! Recording started.", 'success', 4000); addToast("Audio detected! Recording started.", 'success', 4000);
@@ -264,7 +274,7 @@ const Recorder: React.FC<RecorderProps> = ({
// AUTO STOP Logic // AUTO STOP Logic
// Use Ref to get LATEST visibility instantly // Use Ref to get LATEST visibility instantly
if (isVisibleRef.current && timeSinceSpeech > 20 && !isStoppingRef.current) { if (isVisibleRef.current && timeSinceSpeech > 25 && !isStoppingRef.current) {
console.log("Auto-stopping due to silence..."); console.log("Auto-stopping due to silence...");
isStoppingRef.current = true; isStoppingRef.current = true;
addToast('Auto-stopped due to silence', 'info'); addToast('Auto-stopped due to silence', 'info');
@@ -341,9 +351,24 @@ const Recorder: React.FC<RecorderProps> = ({
setIsRecording(false); setIsRecording(false);
setIsPaused(false); setIsPaused(false);
setIsWaiting(false); // Reset waiting state setIsWaiting(false); // Reset waiting state
setTranscription('');
setSummary('');
setHasSpeechDetected(false); // Reset checkiting state
setStatus('Saving recording...'); setStatus('Saving recording...');
const filePath = await invoke<string>('stop_recording'); const filePath = await invoke<string>('stop_recording');
// NEW: Check if speech was actually detected during the session
// If we recorded 20s of silence (Auto-Stop), we shouldn't transcribe.
// IMPORTANT: Check applies to BOTH 'voice' and 'meeting' modes to prevent "Batch Null" errors on false triggers.
if (!hasSpeechDetected) {
console.log("No speech detected during recording. Skipping transcription.");
addToast("Recording discarded (No speech/audio detected)", 'info');
setStatus('Ready to record'); // Reset status text
// If auto-start is on, we just loop back (in finally block).
// But we skip the expensive/failing API call.
} else {
// Wait a moment for file flush (safety) // Wait a moment for file flush (safety)
await new Promise(r => setTimeout(r, 500)); await new Promise(r => setTimeout(r, 500));
@@ -407,8 +432,9 @@ const Recorder: React.FC<RecorderProps> = ({
setStatus('Done (No speech detected)'); setStatus('Done (No speech detected)');
setTranscription('(No speech detected. Check your microphone settings.)'); setTranscription('(No speech detected. Check your microphone settings.)');
setTimeout(() => setStatus('Ready to record'), 3000); setTimeout(() => setStatus('Ready to record'), 3000);
return; // allow finally block to restart loop
} } else {
// Logic continues...
// Find selected prompt content - SMART SELECTION // Find selected prompt content - SMART SELECTION
let activePrompt = prompts.find(p => p.id === selectedPromptId); let activePrompt = prompts.find(p => p.id === selectedPromptId);
@@ -439,8 +465,6 @@ const Recorder: React.FC<RecorderProps> = ({
setStatus(`Smart Select: Using "${newPrompt.name}"...`); setStatus(`Smart Select: Using "${newPrompt.name}"...`);
addToast(`Smart Select: Switched to "${newPrompt.name}"`, 'success', 4000); addToast(`Smart Select: Switched to "${newPrompt.name}"`, 'success', 4000);
activePrompt = newPrompt; activePrompt = newPrompt;
// Optional: Update UI selection? setSelectedPromptId(bestMatchId);
// Let's verify with user preference? For now, we override as "Magic".
} }
} }
@@ -463,12 +487,27 @@ const Recorder: React.FC<RecorderProps> = ({
addToast('Transcription & Summary complete!', 'success', 4000); addToast('Transcription & Summary complete!', 'success', 4000);
onRecordingComplete(); // Auto-switch tab onRecordingComplete(); // Auto-switch tab
setTimeout(() => setStatus('Ready to record'), 3000); setTimeout(() => setStatus('Ready to record'), 3000);
}
}
} catch (e) { } catch (e) {
console.error(e); console.error(e);
setStatus(`Error: ${e}`); setStatus(`Error: ${e}`);
addToast(`Error processing: ${e}`, 'error'); addToast(`Error processing: ${e}`, 'error');
} finally { } finally {
setIsStopping(false); setIsStopping(false);
// AUTO-RESTART LOGIC
// Use REF to get the latest state (fix for "starts again even if I uncheck")
if (autoStartEnabledRef.current) {
console.log("Auto-Start enabled: Restarting listener loop...");
// Short delay to ensure backend cleanup
setTimeout(() => {
// Double check ref before restarting
if (autoStartEnabledRef.current) {
startRecording();
}
}, 1000);
}
} }
}; };
@@ -634,12 +673,20 @@ const Recorder: React.FC<RecorderProps> = ({
</div> </div>
<div className="flex flex-col gap-2 mt-2 w-full"> <div className="flex flex-col gap-2 mt-2 w-full">
{recordingMode === 'meeting' && filteredDevices.length === 0 && ( {recordingMode === 'meeting' && (
<button <button
onClick={onOpenSettings} onClick={async () => {
const allowed = await invoke<boolean>('check_screen_recording_permission');
if (allowed) {
addToast('System Audio Permission: GRANTED ✅', 'success');
} else {
addToast('System Audio Permission: MISSING ❌. Please enable in System Settings -> Privacy -> Screen Recording', 'error', 5000);
// Open Settings?
}
}}
className="text-xs bg-primary/10 text-primary hover:bg-primary/20 w-full text-center border border-primary/20 rounded p-2 mb-2 font-semibold" className="text-xs bg-primary/10 text-primary hover:bg-primary/20 w-full text-center border border-primary/20 rounded p-2 mb-2 font-semibold"
> >
🪄 Create "Hearbit Audio" Device 🔒 Check Audio Permission
</button> </button>
)} )}
<button <button

View File

@@ -1,9 +1,7 @@
import React, { useState, useEffect } from 'react'; import React, { useState, useEffect } from 'react';
import { Save, FolderOpen, Lock, Upload, Download, Eye, EyeOff, Mail, FileText, ScrollText } from 'lucide-react'; import { Save, FolderOpen, Lock, Upload, Download, Mail, FileText, ScrollText, Headphones } from 'lucide-react';
import { save, open } from '@tauri-apps/plugin-dialog'; import { save, open } from '@tauri-apps/plugin-dialog';
// Removed writeTextFile as we use invoke 'save_text_file'
import { invoke } from '@tauri-apps/api/core'; import { invoke } from '@tauri-apps/api/core';
import { encryptData, decryptData } from '../utils/backup';
import EmailTemplateEditor from './EmailTemplateEditor'; import EmailTemplateEditor from './EmailTemplateEditor';
import logo from '../assets/logo.png'; import logo from '../assets/logo.png';
@@ -17,6 +15,10 @@ interface SettingsProps {
emailTemplates: EmailTemplate[]; emailTemplates: EmailTemplate[];
smtpConfig: SmtpConfig; smtpConfig: SmtpConfig;
azureConfig: AzureConfig; azureConfig: AzureConfig;
dailyBackupEnabled: boolean;
dailyBackupPath: string;
lastBackupDate: string;
history: any[];
onSave: ( onSave: (
apiKey: string, apiKey: string,
productId: string, productId: string,
@@ -24,9 +26,14 @@ interface SettingsProps {
savePath: string, savePath: string,
smtp: SmtpConfig, smtp: SmtpConfig,
azure: AzureConfig, azure: AzureConfig,
emailTemplates: EmailTemplate[] emailTemplates: EmailTemplate[],
dailyBackupEnabled: boolean,
dailyBackupPath: string,
blockedApps: string[]
) => void; ) => void;
onHistoryUpdate: (history: any[]) => void;
onClose: () => void; onClose: () => void;
blockedApps: string[];
} }
export interface SmtpConfig { export interface SmtpConfig {
@@ -43,6 +50,30 @@ export interface AzureConfig {
tenantId: string; tenantId: string;
} }
const APP_PRESETS = [
{
category: 'Browsers',
apps: [
{ name: 'Safari', id: 'com.apple.Safari' },
{ name: 'Chrome', id: 'com.google.Chrome' },
{ name: 'Firefox', id: 'org.mozilla.firefox' },
{ name: 'Arc', id: 'company.thebrowser.Browser' },
{ name: 'Edge', id: 'com.microsoft.edgemac' },
{ name: 'Brave', id: 'com.brave.Browser' },
]
},
{
category: 'Music & Media',
apps: [
{ name: 'Apple Music', id: 'com.apple.Music' },
{ name: 'Spotify', id: 'com.spotify.client' },
{ name: 'Tidal', id: 'com.aspiro.Tidal' },
{ name: 'VLC', id: 'org.videolan.vlc' },
{ name: 'Podcasts', id: 'com.apple.podcasts' },
]
}
];
const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePath, onSave, onClose, ...props }) => { const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePath, onSave, onClose, ...props }) => {
const [localApiKey, setLocalApiKey] = useState(apiKey); const [localApiKey, setLocalApiKey] = useState(apiKey);
const [localProductId, setLocalProductId] = useState(productId); const [localProductId, setLocalProductId] = useState(productId);
@@ -52,27 +83,36 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
const [localEmailTemplates, setLocalEmailTemplates] = useState<EmailTemplate[]>(props.emailTemplates); // New state const [localEmailTemplates, setLocalEmailTemplates] = useState<EmailTemplate[]>(props.emailTemplates); // New state
const [localSmtp, setLocalSmtp] = useState<SmtpConfig>(props.smtpConfig); const [localSmtp, setLocalSmtp] = useState<SmtpConfig>(props.smtpConfig);
const [localAzure, setLocalAzure] = useState<AzureConfig>(props.azureConfig); const [localAzure, setLocalAzure] = useState<AzureConfig>(props.azureConfig);
const [localDailyBackupEnabled, setLocalDailyBackupEnabled] = useState(props.dailyBackupEnabled);
const [localDailyBackupPath, setLocalDailyBackupPath] = useState(props.dailyBackupPath);
const [localBlockedApps, setLocalBlockedApps] = useState<string[]>(props.blockedApps);
const [runningApps, setRunningApps] = useState<{ name: string, bundle_id: string }[]>([]);
const [statusIdx, setStatusIdx] = useState<string | null>(null); const [statusIdx, setStatusIdx] = useState<string | null>(null);
// Backup & Restore State
const [backupPassword, setBackupPassword] = useState('');
const [showPassword, setShowPassword] = useState(false);
const [isImportModalOpen, setIsImportModalOpen] = useState(false);
const [importFileContent, setImportFileContent] = useState<string | null>(null);
// Email Template Editor State // Email Template Editor State
const [editingTemplate, setEditingTemplate] = useState<EmailTemplate | null>(null); const [editingTemplate, setEditingTemplate] = useState<EmailTemplate | null>(null);
const [isEmailEditorOpen, setIsEmailEditorOpen] = useState(false); const [isEmailEditorOpen, setIsEmailEditorOpen] = useState(false);
const [activeTab, setActiveTab] = useState<'general' | 'prompts' | 'email' | 'backup' | 'logs'>('general'); const [activeTab, setActiveTab] = useState<'general' | 'prompts' | 'email' | 'backup' | 'logs' | 'apps'>('general');
const [logs, setLogs] = useState<string>('Loading logs...'); const [logs, setLogs] = useState<string>('Loading logs...');
useEffect(() => { useEffect(() => {
if (activeTab === 'logs') { if (activeTab === 'logs') {
loadLogs(); loadLogs();
} else if (activeTab === 'general') {
loadRunningApps();
} }
}, [activeTab]); }, [activeTab]);
const loadRunningApps = async () => {
try {
const apps = await invoke<{ name: string, bundle_id: string }[]>('get_running_apps');
setRunningApps(apps);
} catch (e) {
console.error("Failed to load running apps:", e);
}
};
const loadLogs = async () => { const loadLogs = async () => {
try { try {
const content = await invoke<string>('read_log_file'); const content = await invoke<string>('read_log_file');
@@ -133,7 +173,18 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
}; };
const handleSave = () => { const handleSave = () => {
onSave(localApiKey, localProductId, localPrompts, localSavePath, localSmtp, localAzure, localEmailTemplates); onSave(
localApiKey,
localProductId,
localPrompts,
localSavePath,
localSmtp,
localAzure,
localEmailTemplates,
localDailyBackupEnabled,
localDailyBackupPath,
localBlockedApps
);
onClose(); onClose();
}; };
@@ -154,10 +205,6 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
}; };
const handleExport = async () => { const handleExport = async () => {
if (!backupPassword) {
setStatusIdx('Error: Password required to encrypt backup.');
return;
}
try { try {
const data = { const data = {
apiKey: localApiKey, apiKey: localApiKey,
@@ -165,21 +212,28 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
prompts: localPrompts, prompts: localPrompts,
savePath: localSavePath, savePath: localSavePath,
smtp: localSmtp, smtp: localSmtp,
azure: localAzure azure: localAzure,
emailTemplates: localEmailTemplates,
history: props.history,
dailyBackup: {
enabled: localDailyBackupEnabled,
path: localDailyBackupPath,
}
}; };
const encrypted = await encryptData(data, backupPassword);
// Always save as JSON (no encryption)
const content = JSON.stringify(data, null, 2);
const filePath = await save({ const filePath = await save({
defaultPath: `hearbit_backup_${new Date().toISOString().slice(0, 10)}.conf`, defaultPath: `hearbit_backup_${new Date().toISOString().slice(0, 10)}.json`,
filters: [{ filters: [{
name: 'Hearbit Config', name: 'Hearbit Config',
extensions: ['conf'] extensions: ['json']
}] }]
}); });
if (filePath) { if (filePath) {
// Use backend invoke to write file (bypasses fs scope issues) await invoke('save_text_file', { path: filePath, content });
await invoke('save_text_file', { path: filePath, content: encrypted });
setStatusIdx(`Configuration exported to: ${filePath}`); setStatusIdx(`Configuration exported to: ${filePath}`);
} }
} catch (e) { } catch (e) {
@@ -199,41 +253,40 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
const reader = new FileReader(); const reader = new FileReader();
reader.onload = (event) => { reader.onload = (event) => {
if (event.target?.result) { if (event.target?.result) {
setImportFileContent(event.target.result as string); const content = event.target.result as string;
setIsImportModalOpen(true); // Directly import without password modal since we don't use encryption
setBackupPassword(''); try {
const data = JSON.parse(content);
if (data.apiKey) setLocalApiKey(data.apiKey);
if (data.productId) setLocalProductId(data.productId);
if (data.prompts) setLocalPrompts(data.prompts);
if (data.emailTemplates) setLocalEmailTemplates(data.emailTemplates);
if (data.savePath) setLocalSavePath(data.savePath);
if (data.smtp) setLocalSmtp(data.smtp);
if (data.azure) setLocalAzure(data.azure);
if (data.dailyBackup) {
if (data.dailyBackup.enabled !== undefined) setLocalDailyBackupEnabled(data.dailyBackup.enabled);
if (data.dailyBackup.path) setLocalDailyBackupPath(data.dailyBackup.path);
}
if (data.blockedApps) setLocalBlockedApps(data.blockedApps);
// Import history!
if (data.history && Array.isArray(data.history)) {
props.onHistoryUpdate(data.history);
}
setStatusIdx('Configuration imported! Click Save to apply.');
} catch (error) {
console.error(error);
setStatusIdx(`Import failed: ${error}`);
}
} }
}; };
reader.readAsText(file); reader.readAsText(file);
e.target.value = ''; e.target.value = '';
}; };
const confirmImport = async () => {
if (!backupPassword) {
setStatusIdx('Error: Password required to decrypt.');
return;
}
if (!importFileContent) return;
try {
const data = await decryptData(importFileContent, backupPassword);
if (data.apiKey) setLocalApiKey(data.apiKey);
if (data.productId) setLocalProductId(data.productId);
if (data.prompts) setLocalPrompts(data.prompts);
if (data.emailTemplates) setLocalEmailTemplates(data.emailTemplates);
if (data.savePath) setLocalSavePath(data.savePath);
if (data.smtp) setLocalSmtp(data.smtp);
if (data.azure) setLocalAzure(data.azure);
setStatusIdx('Configuration imported! Click Save to apply.');
setIsImportModalOpen(false);
setImportFileContent(null);
} catch (e) {
console.error(e);
setStatusIdx('Import failed: Wrong password or corrupted file.');
}
};
const handleCreateDevice = async () => { const handleCreateDevice = async () => {
try { try {
@@ -251,55 +304,13 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
{ id: 'prompts', label: 'Prompts', icon: <FileText size={14} /> }, { id: 'prompts', label: 'Prompts', icon: <FileText size={14} /> },
{ id: 'email', label: 'Email', icon: <Mail size={14} /> }, { id: 'email', label: 'Email', icon: <Mail size={14} /> },
{ id: 'backup', label: 'Backup', icon: <Lock size={14} /> }, { id: 'backup', label: 'Backup', icon: <Lock size={14} /> },
{ id: 'apps', label: 'Apps', icon: <Headphones size={14} /> },
{ id: 'logs', label: 'Logs', icon: <ScrollText size={14} /> }, { id: 'logs', label: 'Logs', icon: <ScrollText size={14} /> },
] as const; ] as const;
return ( return (
<div className="flex flex-col h-full bg-background font-mono text-sm relative"> <div className="flex flex-col h-full bg-background font-mono text-sm relative">
{/* Import Password Modal */} {/* Import Password Modal */}
{isImportModalOpen && (
<div className="absolute inset-0 z-50 bg-black/50 flex items-center justify-center p-4">
<div className="bg-background border border-border rounded-lg shadow-xl p-6 w-full max-w-sm space-y-4">
<div className="flex items-center gap-2 text-foreground font-semibold">
<Lock size={16} /> Import Configuration
</div>
<p className="text-muted-foreground text-xs">
Enter the password used to encrypt this backup file.
</p>
<div className="relative">
<input
type={showPassword ? "text" : "password"}
value={backupPassword}
onChange={(e) => setBackupPassword(e.target.value)}
placeholder="Backup Password"
className="w-full p-2 pr-8 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none"
/>
<button
onClick={() => setShowPassword(!showPassword)}
className="absolute right-2 top-2.5 text-muted-foreground hover:text-foreground"
>
{showPassword ? <EyeOff size={14} /> : <Eye size={14} />}
</button>
</div>
<div className="flex justify-end gap-2 pt-2">
<button
onClick={() => setIsImportModalOpen(false)}
className="px-3 py-1.5 text-xs font-medium rounded border border-border hover:bg-secondary text-foreground transition-colors"
>
Cancel
</button>
<button
onClick={confirmImport}
disabled={!backupPassword}
className="px-3 py-1.5 text-xs font-medium rounded bg-primary text-primary-foreground hover:bg-primary/90 transition-colors disabled:opacity-50"
>
Decrypt & Import
</button>
</div>
</div>
</div>
)}
{/* Email Template Editor Modal */} {/* Email Template Editor Modal */}
<EmailTemplateEditor <EmailTemplateEditor
isOpen={isEmailEditorOpen} isOpen={isEmailEditorOpen}
@@ -471,7 +482,7 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
onClick={() => removePrompt(prompt.id)} onClick={() => removePrompt(prompt.id)}
className="absolute top-2 right-2 text-muted-foreground hover:text-destructive opacity-0 group-hover:opacity-100 transition-opacity text-xs flex items-center gap-1" className="absolute top-2 right-2 text-muted-foreground hover:text-destructive opacity-0 group-hover:opacity-100 transition-opacity text-xs flex items-center gap-1"
> >
<EyeOff size={14} /> Remove Remove
</button> </button>
<input <input
type="text" type="text"
@@ -619,31 +630,13 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
{activeTab === 'backup' && ( {activeTab === 'backup' && (
<div className="space-y-6 max-w-xl"> <div className="space-y-6 max-w-xl">
{/* Manual Configuration Backup */}
<div className="space-y-4"> <div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Configuration Backup</h3> <h3 className="text-foreground font-semibold border-b border-border pb-2">Manual Configuration Backup</h3>
<p className="text-xs text-muted-foreground"> <p className="text-xs text-muted-foreground">
Securely export your settings, including API keys and prompts. You must set a password to encrypt the backup file. Export all your settings, including API keys, prompts, email templates, and history as JSON files.
</p> </p>
<div className="relative">
<label className="block text-xs font-semibold text-muted-foreground mb-1 uppercase tracking-wide">
Encryption Password
</label>
<input
type={showPassword ? "text" : "password"}
value={backupPassword}
onChange={(e) => setBackupPassword(e.target.value)}
placeholder="Enter a strong password"
className="w-full p-2 pr-8 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none text-sm"
/>
<button
onClick={() => setShowPassword(!showPassword)}
className="absolute right-2 top-8 text-muted-foreground hover:text-foreground"
>
{showPassword ? <EyeOff size={14} /> : <Eye size={14} />}
</button>
</div>
<div className="flex gap-4 pt-2"> <div className="flex gap-4 pt-2">
<button <button
onClick={handleExport} onClick={handleExport}
@@ -660,12 +653,148 @@ const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePat
<input <input
type="file" type="file"
id="import-file-input" id="import-file-input"
accept=".conf" accept=".json"
className="hidden" className="hidden"
onChange={handleFileSelect} onChange={handleFileSelect}
/> />
</div> </div>
</div> </div>
{/* Daily Automated Backup */}
<div className="space-y-4 border-t border-border pt-6">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Daily Automated Backup</h3>
<p className="text-xs text-muted-foreground">
Automatically backup your configuration once per day to prevent data loss. Backups are saved as JSON files.
</p>
<div className="flex items-center gap-2">
<input
type="checkbox"
id="enable-daily-backup"
checked={localDailyBackupEnabled}
onChange={(e) => setLocalDailyBackupEnabled(e.target.checked)}
className="w-4 h-4"
/>
<label htmlFor="enable-daily-backup" className="text-sm text-foreground cursor-pointer">
Enable daily automated backup
</label>
</div>
{localDailyBackupEnabled && (
<div>
<label className="block text-xs font-semibold text-muted-foreground mb-1 uppercase tracking-wide">
Backup Location
</label>
<div className="flex gap-2">
<input
type="text"
value={localDailyBackupPath}
onChange={(e) => setLocalDailyBackupPath(e.target.value)}
placeholder="Leave empty to use recordings folder"
className="flex-1 p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none text-sm"
/>
<button
onClick={async () => {
try {
const selected = await open({ directory: true, multiple: false });
if (selected && typeof selected === 'string') {
setLocalDailyBackupPath(selected);
}
} catch (e) {
console.error(e);
}
}}
className="p-2 aspect-square flex items-center justify-center bg-secondary hover:bg-secondary/80 border border-border rounded text-foreground transition-all active:scale-95"
>
<FolderOpen size={16} />
</button>
</div>
<p className="text-[10px] text-muted-foreground mt-1">
Last backup: {props.lastBackupDate || 'Never'}
</p>
</div>
)}
</div>
</div>
)}
{activeTab === 'apps' && (
<div className="space-y-6 max-w-2xl h-full flex flex-col overflow-hidden">
<div className="space-y-4 flex-none">
<h3 className="text-foreground font-semibold border-b border-border pb-2 flex items-center gap-2">
<Lock size={16} /> Recording Filters
</h3>
<p className="text-xs text-muted-foreground">
Block specific apps to keep their audio out of your recordings (e.g., music or browser sounds).
</p>
</div>
<div className="flex-1 overflow-y-auto pr-2 space-y-6 custom-scrollbar">
{/* App Presets */}
<div className="space-y-4">
<h4 className="text-[10px] font-bold text-muted-foreground uppercase tracking-wider">Quick Select (Classic Apps)</h4>
<div className="grid grid-cols-2 gap-4">
{APP_PRESETS.map(group => (
<div key={group.category} className="space-y-2">
<h5 className="text-[11px] font-medium text-foreground/80">{group.category}</h5>
<div className="flex flex-wrap gap-1.5">
{group.apps.map(app => (
<button
key={app.id}
onClick={() => {
if (localBlockedApps.includes(app.id)) {
setLocalBlockedApps(localBlockedApps.filter(id => id !== app.id));
} else {
setLocalBlockedApps([...localBlockedApps, app.id]);
}
}}
className={`text-[10px] px-2 py-1 rounded border transition-all active:scale-95 ${localBlockedApps.includes(app.id)
? 'bg-primary/20 border-primary text-primary font-medium'
: 'bg-secondary/50 border-border/50 text-muted-foreground hover:border-border hover:text-foreground'
}`}
>
{app.name}
</button>
))}
</div>
</div>
))}
</div>
</div>
{/* Running Apps */}
<div className="space-y-4">
<h4 className="text-[10px] font-bold text-muted-foreground uppercase tracking-wider">Currently Running</h4>
<div className="grid gap-2">
{runningApps.length > 0 ? (
runningApps.map(app => (
<label key={app.bundle_id} className="flex items-center justify-between p-3 bg-secondary/10 rounded border border-border/50 hover:bg-secondary/20 transition-colors cursor-pointer capitalize">
<div className="flex flex-col">
<span className="font-medium text-sm">{app.name}</span>
<span className="text-[10px] text-muted-foreground font-mono">{app.bundle_id}</span>
</div>
<input
type="checkbox"
checked={localBlockedApps.includes(app.bundle_id)}
onChange={(e) => {
if (e.target.checked) {
setLocalBlockedApps([...localBlockedApps, app.bundle_id]);
} else {
setLocalBlockedApps(localBlockedApps.filter(id => id !== app.bundle_id));
}
}}
className="w-4 h-4 rounded border-border bg-secondary text-primary focus:ring-primary"
/>
</label>
))
) : (
<div className="text-center py-8 text-muted-foreground text-xs italic border border-dashed border-border rounded">
No running applications detected.
</div>
)}
</div>
</div>
</div>
</div> </div>
)} )}

View File

@@ -0,0 +1,716 @@
import React, { useState, useEffect } from 'react';
import { Save, FolderOpen, Lock, Upload, Download, Mail, FileText, ScrollText } from 'lucide-react';
import { save, open } from '@tauri-apps/plugin-dialog';
import { invoke } from '@tauri-apps/api/core';
import EmailTemplateEditor from './EmailTemplateEditor';
import logo from '../assets/logo.png';
import { PromptTemplate, EmailTemplate } from '../App';
interface SettingsProps {
apiKey: string;
productId: string;
savePath: string;
prompts: PromptTemplate[];
emailTemplates: EmailTemplate[];
smtpConfig: SmtpConfig;
azureConfig: AzureConfig;
dailyBackupEnabled: boolean;
dailyBackupPath: string;
lastBackupDate: string;
history: any[];
onSave: (
apiKey: string,
productId: string,
prompts: PromptTemplate[],
savePath: string,
smtp: SmtpConfig,
azure: AzureConfig,
emailTemplates: EmailTemplate[],
dailyBackupEnabled: boolean,
dailyBackupPath: string
) => void;
onHistoryUpdate: (history: any[]) => void;
onClose: () => void;
}
export interface SmtpConfig {
host: string;
port: string;
user: string;
pass: string;
sender: string;
senderName: string;
}
export interface AzureConfig {
clientId: string;
tenantId: string;
}
const Settings: React.FC<SettingsProps> = ({ apiKey, productId, prompts, savePath, onSave, onClose, ...props }) => {
const [localApiKey, setLocalApiKey] = useState(apiKey);
const [localProductId, setLocalProductId] = useState(productId);
const [localSavePath, setLocalSavePath] = useState(savePath);
const [localPrompts, setLocalPrompts] = useState<PromptTemplate[]>(prompts);
const [localEmailTemplates, setLocalEmailTemplates] = useState<EmailTemplate[]>(props.emailTemplates); // New state
const [localSmtp, setLocalSmtp] = useState<SmtpConfig>(props.smtpConfig);
const [localAzure, setLocalAzure] = useState<AzureConfig>(props.azureConfig);
const [localDailyBackupEnabled, setLocalDailyBackupEnabled] = useState(props.dailyBackupEnabled);
const [localDailyBackupPath, setLocalDailyBackupPath] = useState(props.dailyBackupPath);
const [statusIdx, setStatusIdx] = useState<string | null>(null);
// Email Template Editor State
const [editingTemplate, setEditingTemplate] = useState<EmailTemplate | null>(null);
const [isEmailEditorOpen, setIsEmailEditorOpen] = useState(false);
const [activeTab, setActiveTab] = useState<'general' | 'prompts' | 'email' | 'backup' | 'logs'>('general');
const [logs, setLogs] = useState<string>('Loading logs...');
useEffect(() => {
if (activeTab === 'logs') {
loadLogs();
}
}, [activeTab]);
const loadLogs = async () => {
try {
const content = await invoke<string>('read_log_file');
setLogs(content);
} catch (e) {
setLogs(`Failed to load logs: ${e}`);
}
};
const handleSaveLogs = async () => {
try {
const filePath = await save({
defaultPath: `hearbit_logs_${new Date().toISOString().slice(0, 10)}.log`,
filters: [{
name: 'Log File',
extensions: ['log', 'txt']
}]
});
if (filePath) {
await invoke('save_text_file', { path: filePath, content: logs });
setStatusIdx(`Logs exported to: ${filePath}`);
}
} catch (e) {
console.error(e);
setStatusIdx(`Failed to export logs: ${e}`);
}
};
const handlePromptChange = (id: string, field: 'name' | 'content', value: string) => {
setLocalPrompts(localPrompts.map(p => p.id === id ? { ...p, [field]: value } : p));
};
const addPrompt = () => {
setLocalPrompts([...localPrompts, { id: Date.now().toString(), name: 'New Prompt', content: '' }]);
};
const removePrompt = (id: string) => {
setLocalPrompts(localPrompts.filter(p => p.id !== id));
};
const handleSaveEmailTemplate = (template: EmailTemplate) => {
const exists = localEmailTemplates.find(t => t.id === template.id);
if (exists) {
setLocalEmailTemplates(localEmailTemplates.map(t => t.id === template.id ? template : t));
} else {
setLocalEmailTemplates([...localEmailTemplates, template]);
}
};
const openEmailEditor = (template: EmailTemplate | null) => {
setEditingTemplate(template);
setIsEmailEditorOpen(true);
};
const removeEmailTemplate = (id: string) => {
setLocalEmailTemplates(localEmailTemplates.filter(t => t.id !== id));
};
const handleSave = () => {
onSave(
localApiKey,
localProductId,
localPrompts,
localSavePath,
localSmtp,
localAzure,
localEmailTemplates,
localDailyBackupEnabled,
localDailyBackupPath
);
onClose();
};
const handleSelectFolder = async () => {
try {
const selected = await open({
directory: true,
multiple: false,
defaultPath: localSavePath || undefined,
});
if (selected && typeof selected === 'string') {
setLocalSavePath(selected);
}
} catch (e) {
console.error("Failed to open directory picker", e);
setStatusIdx('Error: Failed to open directory picker.');
}
};
const handleExport = async () => {
try {
const data = {
apiKey: localApiKey,
productId: localProductId,
prompts: localPrompts,
savePath: localSavePath,
smtp: localSmtp,
azure: localAzure,
emailTemplates: localEmailTemplates,
history: props.history,
dailyBackup: {
enabled: localDailyBackupEnabled,
path: localDailyBackupPath,
}
};
// Always save as JSON (no encryption)
const content = JSON.stringify(data, null, 2);
const filePath = await save({
defaultPath: `hearbit_backup_${new Date().toISOString().slice(0, 10)}.json`,
filters: [{
name: 'Hearbit Config',
extensions: ['json']
}]
});
if (filePath) {
await invoke('save_text_file', { path: filePath, content });
setStatusIdx(`Configuration exported to: ${filePath}`);
}
} catch (e) {
console.error(e);
setStatusIdx(`Export failed: ${e}`);
}
};
const triggerImport = () => {
document.getElementById('import-file-input')?.click();
};
const handleFileSelect = (e: React.ChangeEvent<HTMLInputElement>) => {
const file = e.target.files?.[0];
if (!file) return;
const reader = new FileReader();
reader.onload = (event) => {
if (event.target?.result) {
const content = event.target.result as string;
// Directly import without password modal since we don't use encryption
try {
const data = JSON.parse(content);
if (data.apiKey) setLocalApiKey(data.apiKey);
if (data.productId) setLocalProductId(data.productId);
if (data.prompts) setLocalPrompts(data.prompts);
if (data.emailTemplates) setLocalEmailTemplates(data.emailTemplates);
if (data.savePath) setLocalSavePath(data.savePath);
if (data.smtp) setLocalSmtp(data.smtp);
if (data.azure) setLocalAzure(data.azure);
if (data.dailyBackup) {
if (data.dailyBackup.enabled !== undefined) setLocalDailyBackupEnabled(data.dailyBackup.enabled);
if (data.dailyBackup.path) setLocalDailyBackupPath(data.dailyBackup.path);
}
// Import history!
if (data.history && Array.isArray(data.history)) {
props.onHistoryUpdate(data.history);
}
setStatusIdx('Configuration imported! Click Save to apply.');
} catch (error) {
console.error(error);
setStatusIdx(`Import failed: ${error}`);
}
}
};
reader.readAsText(file);
e.target.value = '';
};
const handleCreateDevice = async () => {
try {
setStatusIdx('Creating Hearbit Audio device...');
await invoke('create_hearbit_audio_device');
setStatusIdx('Success! "Hearbit Audio" device created.');
} catch (e) {
console.error(e);
setStatusIdx(`Error: ${e}`);
}
};
const tabs = [
{ id: 'general', label: 'General', icon: <Save size={14} /> },
{ id: 'prompts', label: 'Prompts', icon: <FileText size={14} /> },
{ id: 'email', label: 'Email', icon: <Mail size={14} /> },
{ id: 'backup', label: 'Backup', icon: <Lock size={14} /> },
{ id: 'logs', label: 'Logs', icon: <ScrollText size={14} /> },
] as const;
return (
<div className="flex flex-col h-full bg-background font-mono text-sm relative">
{/* Import Password Modal */}
{/* Email Template Editor Modal */}
<EmailTemplateEditor
isOpen={isEmailEditorOpen}
onClose={() => setIsEmailEditorOpen(false)}
template={editingTemplate}
onSave={handleSaveEmailTemplate}
smtpConfig={localSmtp}
addToast={(msg, type) => setStatusIdx(`${type === 'error' ? 'Error' : 'Success'}: ${msg}`)}
/>
<div className="flex flex-col border-b border-border/40 bg-secondary/10">
<div className="p-4 flex justify-between items-center">
<h2 className="text-lg font-semibold tracking-tight">Settings</h2>
<button onClick={handleSave} className="flex items-center gap-2 bg-primary text-primary-foreground px-4 py-2 rounded font-semibold hover:bg-primary/90 transition-all active:scale-95 text-xs">
<Save size={16} /> Save Changes
</button>
</div>
{/* Tabs */}
<div className="flex px-4 gap-2">
{tabs.map(tab => (
<button
key={tab.id}
onClick={() => setActiveTab(tab.id)}
className={`px-4 py-2 text-xs font-medium border-b-2 transition-colors flex items-center gap-2 ${activeTab === tab.id ? 'border-primary text-primary' : 'border-transparent text-muted-foreground hover:text-foreground'}`}
>
{tab.icon} {tab.label}
</button>
))}
</div>
</div>
<div className="flex-1 overflow-y-auto p-6">
{activeTab === 'general' && (
<div className="space-y-6 max-w-2xl">
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Application Keys</h3>
<div>
<label htmlFor="apiKey" className="block text-sm font-medium mb-1 text-foreground">API Key</label>
<input
id="apiKey"
type="password"
value={localApiKey}
onChange={(e) => setLocalApiKey(e.target.value)}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
/>
</div>
<div>
<label htmlFor="productId" className="block text-sm font-medium mb-1 text-foreground">Product ID</label>
<input
id="productId"
type="text"
value={localProductId}
onChange={(e) => setLocalProductId(e.target.value)}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
/>
</div>
</div>
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Storage</h3>
<div>
<label htmlFor="savePath" className="block text-sm font-medium mb-1 text-foreground">Custom Recordings Folder</label>
<div className="flex gap-2">
<input
id="savePath"
type="text"
value={localSavePath}
onChange={(e) => setLocalSavePath(e.target.value)}
placeholder="/Users/username/Desktop/Recordings"
className="flex-1 p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
/>
<button
onClick={handleSelectFolder}
className="p-2 aspect-square flex items-center justify-center bg-secondary hover:bg-secondary/80 border border-border rounded text-foreground transition-all active:scale-95"
title="Pick Folder"
>
<FolderOpen size={16} />
</button>
</div>
</div>
</div>
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">📸 Branding</h3>
<div className="p-4 bg-secondary/20 rounded border border-border/50">
<div className="mb-3">
<div className="font-medium text-sm mb-2">Custom Logo</div>
<div className="text-xs text-muted-foreground mb-3">Upload your company logo to replace the default Livtec branding throughout the app.</div>
</div>
{/* Logo Preview */}
<div className="flex items-center gap-4 mb-3">
<div className="w-20 h-20 bg-background border border-border rounded flex items-center justify-center overflow-hidden">
<img
src={localStorage.getItem('customLogo') || logo}
alt="Logo Preview"
className="max-w-full max-h-full object-contain"
/>
</div>
<div className="flex-1">
<button
onClick={async () => {
try {
const selected = await open({
filters: [{ name: 'Images', extensions: ['png', 'jpg', 'jpeg', 'svg'] }]
});
if (selected && typeof selected === 'string') {
const dataUrl = await invoke<string>('read_image_as_base64', { filePath: selected });
localStorage.setItem('customLogo', dataUrl);
setStatusIdx('Logo uploaded! Save settings to apply.');
// Force re-render
window.dispatchEvent(new Event('storage'));
}
} catch (e) {
setStatusIdx(`Logo upload failed: ${e}`);
}
}}
className="bg-secondary hover:bg-secondary/80 text-xs px-3 py-2 rounded border border-border transition-all flex items-center gap-2"
>
<Upload size={14} /> Upload Logo
</button>
<button
onClick={() => {
localStorage.removeItem('customLogo');
setStatusIdx('Logo reset to default. Save to apply.');
window.dispatchEvent(new Event('storage'));
}}
className="mt-2 bg-secondary hover:bg-secondary/80 text-xs px-3 py-2 rounded border border-border transition-all text-muted-foreground"
>
Reset to Default
</button>
</div>
</div>
<p className="text-[10px] text-muted-foreground">Supported: PNG, JPG, SVG. Recommended: Square format, transparent background.</p>
</div>
</div>
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">System Intergration</h3>
<div className="flex items-center justify-between p-4 bg-secondary/20 rounded border border-border/50">
<div>
<div className="font-medium text-sm">Hearbit Audio Device</div>
<div className="text-xs text-muted-foreground">Required for recording system audio (Teams, Zoom, etc.)</div>
</div>
<button
onClick={handleCreateDevice}
className="bg-secondary hover:bg-secondary/80 text-xs px-3 py-2 rounded border border-border transition-all active:scale-95 flex items-center gap-2"
>
<span>🪄</span> Create / Repair
</button>
</div>
</div>
</div>
)}
{activeTab === 'prompts' && (
<div className="space-y-4 max-w-3xl">
<div className="flex justify-between items-center border-b border-border pb-2">
<h3 className="text-foreground font-semibold">AI Prompts</h3>
<button onClick={addPrompt} className="text-xs bg-primary text-primary-foreground px-3 py-1.5 rounded hover:bg-primary/90 transition-all active:scale-95">
+ Add Prompt
</button>
</div>
<div className="grid gap-4">
{localPrompts.map((prompt) => (
<div key={prompt.id} className="space-y-2 p-4 bg-secondary/10 rounded border border-border/50 relative group">
<button
onClick={() => removePrompt(prompt.id)}
className="absolute top-2 right-2 text-muted-foreground hover:text-destructive opacity-0 group-hover:opacity-100 transition-opacity text-xs flex items-center gap-1"
>
Remove
</button>
<input
type="text"
value={prompt.name}
onChange={(e) => handlePromptChange(prompt.id, 'name', e.target.value)}
className="w-full p-1 bg-transparent border-b border-border/50 focus:border-primary outline-none font-semibold text-sm"
placeholder="Prompt Name"
/>
<textarea
value={prompt.content}
onChange={(e) => handlePromptChange(prompt.id, 'content', e.target.value)}
className="w-full p-2 bg-secondary/50 rounded border border-border/30 focus:border-primary outline-none text-xs resize-y min-h-[100px] font-mono"
placeholder="Prompt Content"
/>
</div>
))}
</div>
</div>
)}
{activeTab === 'email' && (
<div className="space-y-8 max-w-2xl">
<div className="space-y-4">
<div className="flex justify-between items-center border-b border-border pb-2">
<h3 className="text-foreground font-semibold">Email Templates</h3>
<button
onClick={() => openEmailEditor(null)}
className="text-xs bg-primary text-primary-foreground px-3 py-1.5 rounded hover:bg-primary/90 transition-all active:scale-95"
>
+ Add Template
</button>
</div>
<div className="space-y-2">
{localEmailTemplates.map((template) => (
<div key={template.id} className="flex justify-between items-center p-4 bg-secondary/10 rounded border border-border/50 group hover:border-border/80 transition-colors">
<div className="flex-1 min-w-0 pr-4">
<div className="font-semibold text-sm truncate">{template.name}</div>
<div className="text-xs text-muted-foreground truncate">{template.subject}</div>
</div>
<div className="flex gap-2 shrink-0">
<button
onClick={() => openEmailEditor(template)}
className="px-2 py-1 text-xs font-medium text-primary hover:bg-primary/10 rounded transition-colors"
>
Edit
</button>
<button
onClick={() => removeEmailTemplate(template.id)}
className="px-2 py-1 text-xs font-medium text-muted-foreground hover:text-destructive hover:bg-destructive/10 rounded transition-colors"
>
Remove
</button>
</div>
</div>
))}
{localEmailTemplates.length === 0 && (
<div className="text-center p-8 text-muted-foreground text-xs italic border border-dashed border-border rounded">
No templates created yet.
</div>
)}
</div>
</div>
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">SMTP Configuration</h3>
<div className="grid grid-cols-2 gap-4">
<div>
<label className="block text-sm font-medium mb-1 text-foreground">SMTP Host</label>
<input
type="text"
value={localSmtp.host}
onChange={(e) => setLocalSmtp({ ...localSmtp, host: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
placeholder="smtp.office365.com"
/>
</div>
<div>
<label className="block text-sm font-medium mb-1 text-foreground">Port</label>
<input
type="text"
value={localSmtp.port}
onChange={(e) => setLocalSmtp({ ...localSmtp, port: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
placeholder="587"
/>
</div>
<div className="col-span-2">
<label className="block text-sm font-medium mb-1 text-foreground">Sender Email</label>
<input
type="text"
value={localSmtp.sender}
onChange={(e) => setLocalSmtp({ ...localSmtp, sender: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
placeholder="you@company.com"
/>
</div>
<div className="col-span-2">
<label className="block text-sm font-medium mb-1 text-foreground">Sender Name (Optional)</label>
<input
type="text"
value={localSmtp.senderName}
onChange={(e) => setLocalSmtp({ ...localSmtp, senderName: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
placeholder="Hearbit Assistant"
/>
</div>
<div>
<label className="block text-sm font-medium mb-1 text-foreground">Username</label>
<input
type="text"
value={localSmtp.user}
onChange={(e) => setLocalSmtp({ ...localSmtp, user: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
/>
</div>
<div>
<label className="block text-sm font-medium mb-1 text-foreground">Password</label>
<input
type="password"
value={localSmtp.pass}
onChange={(e) => setLocalSmtp({ ...localSmtp, pass: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
/>
</div>
</div>
</div>
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Microsoft 365 (Azure AD)</h3>
<p className="text-xs text-muted-foreground">Optional configuration for advanced MS Graph integrations.</p>
<div>
<label className="block text-sm font-medium mb-1 text-foreground">Client ID</label>
<input
type="text"
value={localAzure.clientId}
onChange={(e) => setLocalAzure({ ...localAzure, clientId: e.target.value })}
className="w-full p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none font-mono text-sm"
placeholder="Application (client) ID"
/>
</div>
</div>
</div>
)}
{activeTab === 'backup' && (
<div className="space-y-6 max-w-xl">
{/* Manual Configuration Backup */}
<div className="space-y-4">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Manual Configuration Backup</h3>
<p className="text-xs text-muted-foreground">
Export all your settings, including API keys, prompts, email templates, and history as JSON files.
</p>
<div className="flex gap-4 pt-2">
<button
onClick={handleExport}
className="flex-1 flex items-center justify-center gap-2 py-3 px-4 rounded-lg bg-secondary hover:bg-secondary/80 border border-border text-foreground transition-all font-semibold active:scale-95"
>
<Download size={18} /> Export Config
</button>
<button
onClick={triggerImport}
className="flex-1 flex items-center justify-center gap-2 py-3 px-4 rounded-lg bg-secondary hover:bg-secondary/80 border border-border text-foreground transition-all font-semibold active:scale-95"
>
<Upload size={18} /> Import Config
</button>
<input
type="file"
id="import-file-input"
accept=".json"
className="hidden"
onChange={handleFileSelect}
/>
</div>
</div>
{/* Daily Automated Backup */}
<div className="space-y-4 border-t border-border pt-6">
<h3 className="text-foreground font-semibold border-b border-border pb-2">Daily Automated Backup</h3>
<p className="text-xs text-muted-foreground">
Automatically backup your configuration once per day to prevent data loss. Backups are saved as JSON files.
</p>
<div className="flex items-center gap-2">
<input
type="checkbox"
id="enable-daily-backup"
checked={localDailyBackupEnabled}
onChange={(e) => setLocalDailyBackupEnabled(e.target.checked)}
className="w-4 h-4"
/>
<label htmlFor="enable-daily-backup" className="text-sm text-foreground cursor-pointer">
Enable daily automated backup
</label>
</div>
{localDailyBackupEnabled && (
<div>
<label className="block text-xs font-semibold text-muted-foreground mb-1 uppercase tracking-wide">
Backup Location
</label>
<div className="flex gap-2">
<input
type="text"
value={localDailyBackupPath}
onChange={(e) => setLocalDailyBackupPath(e.target.value)}
placeholder="Leave empty to use recordings folder"
className="flex-1 p-2 rounded border border-border bg-secondary text-foreground focus:ring-2 focus:ring-primary outline-none text-sm"
/>
<button
onClick={async () => {
try {
const selected = await open({ directory: true, multiple: false });
if (selected && typeof selected === 'string') {
setLocalDailyBackupPath(selected);
}
} catch (e) {
console.error(e);
}
}}
className="p-2 aspect-square flex items-center justify-center bg-secondary hover:bg-secondary/80 border border-border rounded text-foreground transition-all active:scale-95"
>
<FolderOpen size={16} />
</button>
</div>
<p className="text-[10px] text-muted-foreground mt-1">
Last backup: {props.lastBackupDate || 'Never'}
</p>
</div>
)}
</div>
</div>
)}
{activeTab === 'logs' && (
<div className="space-y-4 max-w-4xl h-full flex flex-col">
<div className="flex justify-between items-center border-b border-border pb-2">
<h3 className="text-foreground font-semibold">Application Logs</h3>
<div className="flex gap-2">
<button
onClick={handleSaveLogs}
className="text-xs bg-secondary hover:bg-secondary/80 px-2 py-1 rounded border border-border transition-all active:scale-95 flex items-center gap-1"
>
<Download size={12} /> Export Logs
</button>
<button
onClick={loadLogs}
className="text-xs bg-secondary hover:bg-secondary/80 px-2 py-1 rounded border border-border transition-all active:scale-95"
>
Refresh Logs
</button>
</div>
</div>
<div className="flex-1 bg-black text-white p-4 rounded font-mono text-xs overflow-auto whitespace-pre-wrap leading-relaxed shadow-inner">
{logs}
</div>
</div>
)}
</div>
{
statusIdx && (
<div className={`p-2 text-xs text-center border-t ${statusIdx.includes('Error') || statusIdx.includes('failed') ? 'bg-destructive/10 border-destructive text-destructive' : 'bg-green-500/10 border-green-500 text-green-500'}`}>
{statusIdx}
</div>
)
}
</div >
);
};
export default Settings;