Conversation
|
@shahrukhx01 Thank you for working on it. |
|
I have added the baseline pipeline, following are my comments about what I did and what I skipped:
|
|
I'll continue on adding the rest of the functions, let me know about the design and styling when you get time to see it |
… into add_preprocessor delete .vscode directory
|
I'm more or less done with the first iteration of text cleaner, could you please review the code now. |
|
@shahrukhx01 Your PR is in good shape. Few points -
|
|
… into add_preprocessor rebase with master
|
Could you please have a look now? I was thinking about how the end-user would pick the cleaning functions, I ended up with the current design, using the from obsei.preprocessor.base_text_cleaner import CleaningFunctions as clean_funcs
request = AnalyzerRequest(
processed_text="Peter drinks likely likes to tea at 16:45 #datascience @shahrukh "
)
conf = BaseTextProcessorConfig(
text_cleaning_functions=[
clean_funcs.to_lower_case,
clean_funcs.remove_stop_words,
clean_funcs.stem_text,
]
)
print(TextCleaner().clean_input(config=conf, input_list=[request])) |
|
Hi @shahrukhx01 |
|
@shahrukhx01 Thank you very much. You PR is in very good shape now. There are few comments, which we can discuss on our call. Sometime it is difficult to explain in text :) |
|
@lalitpagaria I have made the changes added the tests. Please review when you get time. |
|
@shahrukhx01 Thanks I will review it today and share feedback. |
|
@shahrukhx01 I added few changes to your PR based on future design consideration. |
Could you review the structure of the code, is it as per your expectation and style guide we have for this repo? I'll then start adding functions to this and keep updating this PR. thanks
issues: #75