This library provides common speech features for ASR including MFCCs and filterbank energies for Android and iOS.

Merlyn Mind

Last update: Oct 7, 2022

Related tags

Kotlin android kotlin ios feature-extraction speech-processing speech-feature-extraction speech-features

Overview

Kotlin Speech Features

Quick Links

📒 Introduction

This library is a complete port of python_speech_features in pure Kotlin available for Android and iOS projects.

It provides common speech features for Automated speech recognition (ASR) including MFCCs and filterbank energies.
To know more about MFCCs read more.

Features

🙋 How to use

We support multiple platforms using Kotlin multiplatform.

Android

Integration

Add jitpack.io to your project's repositories:

allProjects {
  repositories {
    google()
    maven { url 'https://jitpack.io' }
  }
}

Add the dependency:

dependencies {
    implementation "com.github.MerlynMind:kotlin_speech_features:${version}"
}

Example implementation

A sample app is included in this repo to help understand the implementation.

Convert your audio signal in the form of a float array. (A demo provided in the sample app)

Initialize speech features

private val speechFeatures = SpeechFeatures()

Perform any of the 4 operations:

val result = speechFeatures.mfcc(MathUtils.normalize(wav), nFilt = 64)
val result = speechFeatures.fbank(MathUtils.normalize(wav), nFilt = 64)
val result = speechFeatures.logfbank(MathUtils.normalize(wav), nFilt = 64)
val result = speechFeatures.ssc(MathUtils.normalize(wav), nFilt = 64)

The result will contain metrices with the expected features. Pass in these features for further processes (e.g. classification, speech recognition).

iOS

Integration

In XCode, go to File > Add Packages...
Paste in the URL of this repo in the search box
Select the package found
Click Add Package button

Example implementation

A sample app is included in this repo to help understand the implementation.

Convert your audio signal in the form of an KotlinIntArray and normalize it.

import KotlinSpeechFeatures

let signal = [Int](1...1000) // Example signal
let normalized = MathUtils.Companion.init().normalize(sig: toKotlinIntArray(arr: signal))

func toKotlinIntArray(arr: [Int]) -> KotlinIntArray {
    let result = KotlinIntArray(size: Int32(arr.capacity))
    for i in 0...(arr.count-1) {
        result.set(index: Int32(i), value: Int32(arr[i]))
    }
    return result
}

Initialize speech features
```
let speechFeatures = SpeechFeatures()
```

Perform any of the 4 operations:

let result = speechFeatures.mfcc(signal: normalized, sampleRate: 16000, winLen: 0.025, winStep: 0.01, numCep: 13, nFilt: 64, nfft: 512, lowFreq: 0, highFreq: ni;, preemph: 0.97, ceplifter: 22, appendEnergy: true, winFunc: nil)
let result = speechFeatures.fbank(signal: normalized, sampleRate: 16000, winLen: 0.025, winStep: 0.01, nFilt: 64, nfft: 512, lowFreq: 0, highFreq: nil, preemph: 0.97, winFunc: nil)
let result = speechFeatures.logfbank(signal: normalized, sampleRate: 16000, winLen: 0.025, winStep: 0.01, nFilt: 64, nfft: 512, lowFreq: 0, highFreq: nil, preemph: 0.97, winFunc: nil)
let result = speechFeatures.ssc(signal: normalized, sampleRate: 16000, winLen: 0.025, winStep: 0.01, nFilt: 64, nfft: 512, lowFreq: 0, highFreq: nil, preemph: 0.97, winFunc: nil)

The result will contain metrices with the expected features. Pass in these features for further processes (e.g. classification, speech recognition).

JavaScript

Coming soon...

✍️ Contributing

Interested in contributing to the library? Thank you so much for your interest! We are always looking for improvements to the project and contributions from open-source developers are greatly appreciated.

Clone repo and create a new branch:

git checkout https://github.com/merlynmind/kotlin_speech_features -b name_for_new_branch

Make changes and test
Submit Pull Request with comprehensive description of changes

🌟 Spread the word!

If you want to say thank you and/or support active development of this library:

Add a GitHub Star to the project!
Tweet about the project on your Twitter! Tag @MerlynMind and/or #heyMerlnyn

Thank you so much for your interest in growing the reach of our library!

🧡 Credits

Arjun Sunil - Original Author of kotlin speech features
Raquib-Ul Alam - For major refactoring and making the code presentable
Rob Smith - For Mentoring and helping us to navigate through the task

📝 References

Original library - Python Speech Features
Reference Library - C Speech Features
Sample english.wav was obtained from

wget http://voyager.jpl.nasa.gov/spacecraft/audio/english.au
sox english.au -e signed-integer english.wav

You might also like...

Create an application with Kotlin/JVM and Kotlin/JS, and explore features around code sharing, serialization, server- and client

Building a Full Stack Web App with Kotlin Multiplatform 본 저장소는 INFCON 2022에서 코틀린 멀티플랫폼 기반 웹 프로그래밍 핸즈온랩을 위해 작성된 템플릿 프로젝트가 있는 곳입니다. 핸즈온 과정에서 코틀린 멀티플랫폼을

19 Sep 8, 2022

Bego Chat is chat application in Kotlin and Firebase with the following features: last seen , user status like typing ,online and last seen with MVVM pattern and clean architecture

Compose ChatApp(Bego Chat) Bego Chat is Compose chat application in Kotlin and Firebase with the following features: sending all file types and abilit

5 Dec 20, 2022

LifecycleMvp 1.2 0.0 Kotlin is MVP architecture implementation with Android Architecture Components and Kotlin language features

MinSDK 14+ Download Gradle Add to project level build.gradle allprojects { repositories { ... maven { url 'https://jitpack.io' }

20 Nov 9, 2021

Enable and customize hidden features on Android 12.

Android 12 Extensions This is a Magisk + Xposed module that adds customization and enables hidden features on Android 12. It also includes a configura

383 Jan 1, 2023

Example Multi module architecture Android project using MVVM, Dynamic Features, Dagger-Hilt, Coroutines and Navigation Components

ModularDynamicFeatureHilt An Android template project following a multi module approach with clean architecture. It has been built following Clean Arc

25 Nov 23, 2022

This library provides common speech features for ASR including MFCCs and filterbank energies for Android and iOS.

Related tags

Overview

Kotlin Speech Features

Quick Links

📒 Introduction

Features

🙋 How to use

Integration

Example implementation

Integration

Example implementation

✍️ Contributing

🌟 Spread the word!

🧡 Credits

📝 References

You might also like...

Create an application with Kotlin/JVM and Kotlin/JS, and explore features around code sharing, serialization, server- and client

Bego Chat is chat application in Kotlin and Firebase with the following features: last seen , user status like typing ,online and last seen with MVVM pattern and clean architecture

LifecycleMvp 1.2 0.0 Kotlin is MVP architecture implementation with Android Architecture Components and Kotlin language features

Enable and customize hidden features on Android 12.

Example Multi module architecture Android project using MVVM, Dynamic Features, Dagger-Hilt, Coroutines and Navigation Components

Easy Android camera integration, advanced features.

Basic Android app to use Jetpack WorkManager API features

Reusable login template to learn Kotlin & Android additional features

An amazing expense tracker app, with great features and beautiful UI. Check it out!

Releases(v1.0.0)

v1.0.0(Sep 19, 2022)

Owner

Merlyn Mind

JavaScript evaluation from kotlin common code for android & iOS

Provides Kotlin libs and some features for building Kotlin plugins

Com.hhvvg.anytext - An application provides features to modify any TextView in any other applications

A nice weather that helps you get all information including: current weather, hourly weather and also forecasts for 16 days

👋 A common toolkit (utils) ⚒️ built to help you further reduce Kotlin boilerplate code and improve development efficiency. Do you think 'kotlin-stdlib' or 'android-ktx' is not sweet enough? You need this! 🍭

This lib implements the most common CoroutineScopes used in Android apps.

Common Android/Kotlin extensions

Actions are things that run, with parameters. Serves as a common dependency for a variety of Cepi extensions.

A library with many useful and easy-to-use features

Create an application with Kotlin/JVM and Kotlin/JS, and explore features around code sharing, serialization, server- and client