vad 0.0.2 vad: ^0.0.2 copied to clipboard
VAD is a cross-platform Dart binding for the VAD JavaScript library. This package provides access to a Voice Activity Detection (VAD) system, allowing Flutter applications to start and stop VAD-based [...]
VAD Package for Flutter #
VAD Package is a versatile Dart binding for the VAD JavaScript library , enabling Voice Activity Detection (VAD) in Flutter applications across iOS , Android , and Web platforms. This package allows Flutter applications to start and stop VAD-based listening and handle various VAD events seamlessly on multiple platforms.
Under the hood, the VAD Package uses dart:js_interop
for Web and flutter_inappwebview
in headless mode to run the VAD JavaScript library. The package provides a simple API to start and stop VAD listening, configure VAD parameters, and handle VAD events such as speech start, speech end, errors, and misfires.
Table of Contents #
Live Demo #
Check out the VAD Package Example App to see the VAD Package in action on the Web platform.
Features #
-
Cross-Platform Support: Works seamlessly on iOS, Android, and Web.
-
Event Streams: Listen to events such as speech start, speech end, errors, and misfires.
-
Configurable Parameters: Customize VAD behavior with various parameters.
-
Easy Integration: Simple setup and usage within Flutter applications.
Getting Started #
Prerequisites #
Before integrating the VAD Package into your Flutter application, ensure that you have the necessary configurations for each target platform.
Web
To use VAD on the web, include the following scripts in your web/index.html
file to load the necessary VAD libraries:
<!-- VAD Dependencies -->
<script src="https://cdn.jsdelivr.net/npm/onnxruntime-web/dist/ort.js"></script>
<script src="https://cdn.jsdelivr.net/npm/@ricky0123/vad-web@0.0.19/dist/bundle.min.js"></script>
<script src="https://cdn.jsdelivr.net/gh/keyur2maru/vad_dart@latest/dist/vad_web.js"></script>
iOS
For iOS, you need to configure microphone permissions and other settings in your Info.plist
file.
- Add Microphone Usage Description: Open
ios/Runner/Info.plist
and add the following entries to request microphone access:
<key>NSMicrophoneUsageDescription</key>
<string>This app needs access to the microphone for Voice Activity Detection.</string>
- Configure Build Settings: Ensure that your
Podfile
includes the necessary build settings for microphone permissions:
post_install do |installer|
installer.pods_project.targets.each do |target|
flutter_additional_ios_build_settings(target)
target.build_configurations.each do |config|
config.build_settings['GCC_PREPROCESSOR_DEFINITIONS'] ||= [
'$(inherited)',
'PERMISSION_MICROPHONE=1',
]
end
end
end
Android
For Android, configure the required permissions and build settings in your AndroidManifest.xml
and build.gradle
files.
- Add Permissions: Open
android/app/src/main/AndroidManifest.xml
and add the following permissions:
<uses-permission android:name="android.permission.RECORD_AUDIO" />
<uses-permission android:name="android.permission.INTERNET"/>
<uses-permission android:name="android.permission.MODIFY_AUDIO_SETTINGS"/>
- Configure Build Settings: Open
android/app/build.gradle
and add the following settings:
android {
compileSdkVersion 34
...
}
Installation #
Add the VAD Package to your pubspec.yaml
dependencies:
Note: audioplayers
is used for audio playback in the example below. You can replace it with any other audio player package of your choice.
dependencies:
flutter:
sdk: flutter
vad: ^0.0.1
permission_handler: ^11.3.1
audioplayers: ^6.1.0
Then, run flutter pub get
to fetch the packages.
Usage #
Example #
Below is a comprehensive example demonstrating how to integrate and use the VAD Package in a Flutter application supporting iOS, Android, and Web.
import 'package:flutter/material.dart';
import 'package:permission_handler/permission_handler.dart';
import 'package:audioplayers/audioplayers.dart' as audioplayers;
import 'package:vad/vad.dart';
import 'audio_utils.dart'; // Ensure this utility exists in your project, you can find the implementation in the example app.
void main() async {
WidgetsFlutterBinding.ensureInitialized();
runApp(const MyApp());
}
class MyApp extends StatelessWidget {
const MyApp({super.key});
@override
Widget build(BuildContext context) {
return MaterialApp(
title: 'VAD Example',
theme: ThemeData(
primarySwatch: Colors.blue,
),
home: const MyHomePage(title: 'VAD Example Home Page'),
);
}
}
class MyHomePage extends StatefulWidget {
const MyHomePage({super.key, required this.title});
final String title;
@override
State<MyHomePage> createState() => _MyHomePageState();
}
class Recording {
final List<double> samples;
Recording(this.samples);
}
class _MyHomePageState extends State<MyHomePage> {
List<Recording> recordings = [];
final audioplayers.AudioPlayer _audioPlayer = audioplayers.AudioPlayer();
final VadHandler _vadHandler = VadHandler.create(isDebug: true);
bool isListening = false;
int preSpeechPadFrames = 10;
int redemptionFrames = 8;
@override
void initState() {
super.initState();
_vadHandler.onSpeechStart.listen((_) {
debugPrint('Speech detected.');
});
_vadHandler.onSpeechEnd.listen((audioData) {
debugPrint('Speech ended with audio data: ${audioData.length} samples.');
recordings.add(Recording(audioData));
});
_vadHandler.onError.listen((error) {
debugPrint('VAD Error: $error');
});
_vadHandler.onVADMisfire.listen((_) {
debugPrint('VAD Misfire detected.');
});
}
@override
void dispose() {
_vadHandler.dispose();
_audioPlayer.dispose();
super.dispose();
}
Future<void> _playRecording(Recording recording) async {
try {
String uri = AudioUtils.createWavUrl(recording.samples);
await _audioPlayer.play(audioplayers.UrlSource(uri));
} catch (e) {
debugPrint('Error playing audio: $e');
}
}
@override
Widget build(BuildContext context) {
return Scaffold(
appBar: AppBar(
title: Text(widget.title),
),
// UI Elements
body: Center(
child: Column(
mainAxisAlignment: MainAxisAlignment.center,
children: <Widget>[
ElevatedButton(
onPressed: () async {
await _audioPlayer.setAudioContext(audioplayers.AudioContext(
iOS: audioplayers.AudioContextIOS(
options: {audioplayers.AVAudioSessionOptions.mixWithOthers},
),
));
setState(() {
if (isListening) {
_vadHandler.stopListening();
} else {
_vadHandler.startListening(
submitUserSpeechOnPause: true,
);
}
isListening = !isListening;
});
},
child: Text(isListening ? 'Stop Listening' : 'Start Listening'),
),
ElevatedButton(
onPressed: () async {
final status = await Permission.microphone.request();
debugPrint("Microphone permission status: $status");
// Optionally handle the permission status
},
child: const Text("Request Microphone Permission"),
),
Expanded(
child: ListView.builder(
itemCount: recordings.length,
itemBuilder: (context, index) {
return ListTile(
title: Text('Recording ${index + 1}'),
onTap: () => _playRecording(recordings[index]),
);
},
),
),
],
),
),
);
}
}
Explanation of the Example
- Initialization:
-
Initializes the
VadHandler
with debugging enabled. -
Sets up listeners for various VAD events (
onSpeechStart
,onSpeechEnd
,onError
,onVADMisfire
).
- Permissions:
- Requests microphone permission when the "Request Microphone Permission" button is pressed.
- Listening Controls:
-
Toggles listening on and off with the "Start Listening"/"Stop Listening" button.
-
Configures the audio player to mix with other audio sources on iOS.
- Playback:
-
Displays a list of recordings.
-
Allows playback of recordings by tapping on them. Note: Ensure that you have an
audio_utils.dart
file with thecreateWavUrl
method implemented to handle audio data conversion as shown in the example app.
VADHandler API #
Methods #
- create
Creates a new instance of the VadHandler
with optional debugging enabled with the isDebug
parameter.
- startListening
Starts the VAD with configurable parameters.
void startListening({
double positiveSpeechThreshold = 0.5,
double negativeSpeechThreshold = 0.35,
int preSpeechPadFrames = 1,
int redemptionFrames = 8,
int frameSamples = 1536,
int minSpeechFrames = 3,
bool submitUserSpeechOnPause = false,
});
- stopListening
Stops the VAD session.
void stopListening();
- dispose
Disposes the VADHandler and closes all streams.
void dispose();
Event Streams #
- onSpeechEnd
Triggered when speech ends with optional audio data.
Stream<List<double>> get onSpeechEnd;
- onSpeechStart
Triggered when speech starts.
Stream<void> get onSpeechStart;
- onVADMisfire
Triggered when the VAD misfires.
Stream<void> get onVADMisfire;
- onError
Triggered when an error occurs.
Stream<String> get onError;
Permissions #
Proper handling of microphone permissions is crucial for the VAD Package to function correctly on all platforms.
iOS #
-
Configuration: Ensure that
NSMicrophoneUsageDescription
is added to yourInfo.plist
with a descriptive message explaining why the app requires microphone access. -
Runtime Permission: Request microphone permission at runtime using the
permission_handler
package.
Android #
-
Configuration: Add the
RECORD_AUDIO
,MODIFY_AUDIO_SETTINGS
, andINTERNET
permissions to yourAndroidManifest.xml
. -
Runtime Permission: Request microphone permission at runtime using the
permission_handler
package.
Web #
- Browser Permissions: Microphone access is managed by the browser. Users will be prompted to grant microphone access when the VAD starts listening.
Cleaning Up #
To prevent memory leaks and ensure that all resources are properly released, always call the dispose
method on the VadHandler
instance when it's no longer needed.
vadHandler.dispose();
Tested Platforms #
The VAD Package has been tested on the following platforms:
- iOS: Tested on iPhone 15 Pro Max running iOS 18.1.
- Android: Tested on Lenovo Tab M8 Running Android 10.
- Web: Tested on Chrome Mac/Windows/Android/iOS, Safari Mac/iOS.
Contributing #
Contributions are welcome! Please feel free to submit a pull request or open an issue if you encounter any problems or have suggestions for improvements.
Acknowledgements #
Special thanks to Ricky0123 for creating the VAD JavaScript library that powers the VAD Package and Silero Team for the VAD model used in the library.
License #
This project is licensed under the MIT License . See the LICENSE file for details.
For any issues or contributions, please visit the GitHub repository .