Working with speech data in order to create synthetic voices, to train speech recognizers or to search for new features of the speech signal requires the management of a huge number of audio and data files as well as labels and features. Most of the time these files, labels and features are highly intertwined and there are dependencies, which need to be considered.