ETL Process with PySpark, Spark SQL, SQL tuning, data validation, dimensional and relational data modelingo Building data architectures and data pipelines in support of analyticso Big-data technologies such as Hadoop, SparkML, etc.o Data-related operations (e.g. SQL, UNIX)o Experienced with Agile/SCRUM framework.Streamlining and automating current processes(based on xls or Alteryx) for forecasting, cap planning & analytics consumption with PySparko Convert existing Alteryx workflow into analytics supported formats, define the data and reporting operations processes & performance monitoring metrics. o Implement rapid prototyping & deployment of existing capability to the target state platform