Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
69068af
add tensorflow_datasets dependency for vocab generation
marianelamin Sep 17, 2020
286fb1c
implement and generate POC for the vocab
marianelamin Sep 17, 2020
cdb767e
set r before regex expression
marianelamin Sep 17, 2020
b00b6c3
refactoring and setting up template for cleaner class
marianelamin Sep 19, 2020
5c87df8
refactor methods and fix small typos
marianelamin Sep 20, 2020
0fce291
fix typo
marianelamin Sep 20, 2020
8df7739
Merge pull request #36 from dieko95/improvement/class/datacleaning
marianelamin Sep 20, 2020
0883eb5
needs unit tests
marianelamin Sep 20, 2020
04c83d6
need complete the tests cases
marianelamin Oct 5, 2020
23740f5
Add functionality to accept input as string or dataframe. Add test c…
marianelamin Oct 10, 2020
ec88a18
Merge pull request #37 from dieko95/improvement/class/datacleaning
marianelamin Oct 10, 2020
17bcfe5
Merge branch 'feature/diego/tweet_loader' into feature/vocab-generator
marianelamin Oct 24, 2020
d3ffa63
attempt to integrate with angostura loader
marianelamin Oct 24, 2020
9037659
add pd.series as a valid input type for the data cleaning class
marianelamin Nov 15, 2020
6db80a0
- clean
marianelamin Nov 15, 2020
51bf82e
add tensorflow datasets library
marianelamin Nov 15, 2020
bad8bce
add tensorflow datasets on toml
marianelamin Nov 15, 2020
82e8b58
Merge branch 'master' into feature/vocab-generator
marianelamin Nov 15, 2020
3e4acae
add optional parameter to some data cleaning methods
marianelamin Nov 16, 2020
76278f4
add method showing before and after cleaning with # @ and https?: mar…
marianelamin Nov 16, 2020
48ff897
implement and generate POC for the vocab
marianelamin Sep 17, 2020
a6d18a5
set r before regex expression
marianelamin Sep 17, 2020
7f2b647
refactoring and setting up template for cleaner class
marianelamin Sep 19, 2020
722fb48
refactor methods and fix small typos
marianelamin Sep 20, 2020
34ead58
fix typo
marianelamin Sep 20, 2020
b9b7ed6
needs unit tests
marianelamin Sep 20, 2020
869500e
need complete the tests cases
marianelamin Oct 5, 2020
ce11f41
Add functionality to accept input as string or dataframe. Add test c…
marianelamin Oct 10, 2020
235d236
attempt to integrate with angostura loader
marianelamin Oct 24, 2020
90908d7
add pd.series as a valid input type for the data cleaning class
marianelamin Nov 15, 2020
754711e
- clean
marianelamin Nov 15, 2020
7c9a731
add tensorflow datasets library
marianelamin Nov 15, 2020
0e6d310
add tensorflow datasets on toml
marianelamin Nov 15, 2020
f9bd681
add optional parameter to some data cleaning methods
marianelamin Nov 16, 2020
cbe5c7d
add method showing before and after cleaning with # @ and https?: mar…
marianelamin Nov 16, 2020
17df53a
fix path to sample tweets
marianelamin Apr 23, 2021
acda796
corresponding to a different pr
marianelamin Apr 23, 2021
17f3b0b
Merge remote-tracking branch 'origin/improvement/data-cleaning-class'…
marianelamin Apr 23, 2021
ee3ac3b
corresponding to a different pr
marianelamin Apr 23, 2021
5d722e3
corresponding to a different pr
marianelamin Apr 23, 2021
f319f7c
black formatter
marianelamin Apr 23, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -40,4 +40,6 @@
/.pytest_cache
/.coverage*

angostura_connection.json

# key
angostura_connection.json
Loading