by Ednalyn C. De Dios | Dec 5, 2019 | Programming
Below are three Panda functions that I use to load data into a dataframe. I use read_csv() to read a regular .csv file. For excel files, I use read_excel(). And finally, for those weird tab-delimited files the extension of .tsv, I also use read_csv() but with the...
by Ednalyn C. De Dios | Nov 4, 2019 | Programming
The function below show any null, NaN, or empty values in the dataframe. It takes a dataframe as a parameter and returns a dataframe with columns that shows the count and percentage of any null, NaNs, and empty values. def missing_values_col(df): “””...
by Ednalyn C. De Dios | Oct 3, 2019 | Data Science
The Python function below cleans up textual data. For parameters, it takes a dataframe and a column name. The function encodes and decodes the text. After that, it performs some basic regex parsing. Finally, all the words that are designated as stop words are then...
by Ednalyn C. De Dios | Sep 2, 2019 | Data Science, Programming
Here’s the code I use in my jupyter notebook to display all the columns, rows, and characters. I usually put in the beginning of my notebook, after the import statements. # set display options pd.set_option(‘display.max_columns’, None)...
by Ednalyn C. De Dios | Aug 21, 2019 | Data Science Career
Michael Watkins It’s a mistake to believe that you will be successful in your new job by continuing to do what you did in your previous job, only more so. Preparing yourself means letting go of the past and embracing the imperatives of the new situation to give...