- Get rid of pandas dependency - Use Pyarrow for table registration/data intechange etc. - Where we need to do internal calculations on dataframes, can use duckdb