preprocessing_operations module

preprocessing_operations.createVoabularyFeatures(lstRepos)[source]

Here the vocabulary-list is created by using the given list of GithubRepo-Objects

Parameters:lstRepos – list of GithubRepo-Objects
Returns:vocabList - list of the feature names
preprocessing_operations.initInputParameters(strVocabPath, lstGithubRepo)[source]

Initialies the vocabulary set

Parameters:
  • strVocabPath – path were the vocab list is stored
  • lstGithubRepo – list of the githubRepository-objects
Returns:

preprocessing_operations.readVocabFromFile(strVocabPath)[source]

reads the stored vocab list from a given file-path

Parameters:strVocabPath – path where the vocab is stored
Returns: