prototype.utility_funcs package¶
Submodules¶
prototype.utility_funcs.io_agent module¶
@file: io_agent.py Created on 11.12.2016 19:59 @project: GitHubRepositoryClassifier
@author: QueensGambit
- The InputOutputAgent loads data (json-Data, README...) from a given repository which
- is defined by strUser and strName. If the needed data has already been requested before, then is loaded from a file. Otherwise a new connection is created. By default the autorization of the connection is done with an API-Token
-
class
prototype.utility_funcs.io_agent.
InputOutputAgent
(strUser, strName)[source]¶ Bases:
object
-
getReadme
(strPathReadme)[source]¶ Gets the content from the Redme as a string. The Readme is either loaded from file or web.
Parameters: strPathReadme – path were the readme is loaded and exported to Returns:
-
loadJSONdata
(strPathJSON)[source]¶ loads the requested json-data either from a file or alternatively from the web files are exported in the ‘./json/’ directory if they were requested
-
prototype.utility_funcs.preprocessing_operations module¶
-
prototype.utility_funcs.preprocessing_operations.
createVoabularyFeatures
(lstRepos)[source]¶ Here the vocabulary-list is created by using the given list of GithubRepo-Objects
Parameters: lstRepos – list of GithubRepo-Objects Returns: vocabList - list of the feature names
prototype.utility_funcs.reliableNormalizer module¶
@file: reliableNormalizer.py Created on 15.01.2017 15:53 @project: GitHubRepositoryClassifier
@author: Lukas
don’t use this
prototype.utility_funcs.string_operation module¶
-
prototype.utility_funcs.string_operation.
prepare_words
(raw_text, bApplyStemmer=True, bCheckStopWords=False)[source]¶ prepares the word for the comparision with the vocab list
Parameters: - raw_text – text with control characters, number,
- bApplyStemmer – true if is stemming shall be applied
- bCheckStopWords – true if stopwords shall be removed
Returns: normed word list