Machine Learning

This section will talk about some algorithms commonly used for machine learning.

Articles

Here's some stuff for machine learning.

A Guide to Principal Component Analysis (PCA)

PCA is a widely used operation for decorrelating data, whitening, and dimensionality reduction. Given its array of uses, a good understanding of exactly how it works is helpful.
A Tutorial on Cepstrum and LPCCs

LPCCs are Linear Prediction Cepstral Coefficents, this tutorial explains what the Cepstrum is and how LPCCs are computed as well as why they are useful.
A tutorial on Automatic Language Identification - ngram based

This page deals with automatically classifying a piece of text as being a certain language. A training corpus is assembled which contains examples from each of the languages we wish to identify, then we use the training information to guess what language a set of test sentences is in.
A tutorial on Automatic Language Identification - word based

This page deals with automatically classifying a piece of text as being a certain language. A training corpus is assembled which contains examples from each of the languages we wish to identify, then we use the training information to guess what language a set of test sentences is in.
An Intuitive Discrete Fourier Transform Tutorial

The Discrete Fourier Transform is widely used in signal processing. To beginners in the field, the equation can be confusing. This page tries to convey an intuitive understanding of how the DFT does what it does.
Approximating a function with a polynomial

This page describes approximation of functions with polynomials. It is relies on using polynomials to interpolate a function, but the trick is to choose which points to use to interpolate between so that the error between the function and the interpolating polynomial is minimised.
Documentation for matlab_speech_features

this is documentation for the matlab_speech_features matlab library.
Encoding Variables For Neural Networks

Deep Neural Networks have become more popular over the last few years, this page will deal with methods for encoding target variables e.g. what if you want to predict categories instead of real numbers? what about angles?
Fitting a polynomial to a set of points

In this page we will go over some of the main ways polynomials can be found that go through a set of points.
Gaussian Mixture Model Tutorial

This is an intuitive guide to Gaussian Mixture Models along with an worked example applying them to a speaker identification problem.
Graphically Determining Backpropagation Equations

The hardest part about implementing neural networks is figuring out the backpropagation equations to train the weights. This article goes through a simple graphical method for deriving the equations.
Hidden Markov Model (HMM) Tutorial

Hidden Markov Models are used to model sequences probabilistically. They are related to Markov chains, but are used when the observations don't tell you exactly what state you are in.
Implementing the Dolph-Chebyshev Window

The Dolph-Chebyshev Window is commonly used in signal processing. Its advantages include constant height side lobes and tunable attenuation.
Linear Prediction Tutorial

Linear Prediction is an important tool for time series analysis, and is quite simple to understand too. This article provides an intuitive introduction to the topic.
Mel Frequency Cepstral Coefficient (MFCC) tutorial

Mel Frequency Cepstral Coefficients, or MFCCs, are a type of feature widely used in speech and speaker recognition systems. This page tells you how to calculate MFCCs, and why MFCCs are the way they are i.e. how do steps involved in calculating MFCCs relate to psychoacoustic properties of the human ear and brain.
Speech Enhancement tutorial: Spectral Subraction

Spectral Subraction is one of the simplest speech enhancement algorithms. Its results are not great, but it is a good starting point for getting into more advanced speech enhancement algorithms.
Voice Activity Detection (VAD) Tutorial

The job of Voice Activity Detection (VAD) is to accurately discriminate between speech presence and speech absence. This page presents the basics of VAD.
Yet Another K-Means Tutorial

The purpose of K-means is to identify groups, or clusters of data points in a multidimensional space. This tutorial covers K-means in a hopefully intuitive manner.

Crypto

Machine Learning

Articles

A Guide to Principal Component Analysis (PCA)

A Tutorial on Cepstrum and LPCCs

A tutorial on Automatic Language Identification - ngram based

A tutorial on Automatic Language Identification - word based

An Intuitive Discrete Fourier Transform Tutorial

Approximating a function with a polynomial

Documentation for matlab_speech_features

Encoding Variables For Neural Networks

Fitting a polynomial to a set of points

Gaussian Mixture Model Tutorial

Graphically Determining Backpropagation Equations

Hidden Markov Model (HMM) Tutorial

Implementing the Dolph-Chebyshev Window

Linear Prediction Tutorial

Mel Frequency Cepstral Coefficient (MFCC) tutorial

Speech Enhancement tutorial: Spectral Subraction

Voice Activity Detection (VAD) Tutorial

Yet Another K-Means Tutorial

Copyright & Usage

Questions/Feedback