Rodrigo Soares WurdigData Engineer
Data Engineer with 5+ years of experience working in medium and high-criticality operating environments. Knowledgeable in various technologies like Python, Scala, SQL, MS SQL Server, Databricks, Snowflake and PostgreSQL developing projects related to design and implementation of data pipelines for real-time analytics, including extensive use of APIs, improving decision-making process in a timely manner for companies in financial, telecommunications, industries, and education.
Certifications
Python
Python
11/26/2019
SQL
SQL
11/26/2019
Data Science - Python
Python, Data Engineering, Data Science
03/26/2021
C1 Advanced
C1 Advanced, B2 Upper Intermediate, B1 Intermediate
10/07/2021
Tech stack
Python (6)
SQL (6)
B2 Upper Intermediate
C1 Advanced
Data Engineering
B1 Intermediate
Data Science
Spark (5)
Azure (5)
HBase (3)
Apache (3)
AWS Cloud Architecture (3)
Data Analytics (3)
PySpark (3)
Google Cloud (2)
Database Development (2)
Machine Learning (2)
PostgreSQL (2)
MongoDB (1)
Big Data (1)
Terraform (1)
Tableau (1)
Apache Kafka
Big Data Architecture
Data Warehousing
Experience
Senior Data EngineerHCLTech
06/2023 - 01/2024

● Design, build, and maintain ELT data pipelines/DAGs using Airflow, Snowflake, DBT, and specific AWS services like AWS Glue and Amazon S3. ● Collaborate with other members of the data team to ensure that our data is accurate, timely, and available to stakeholders. ● Write and maintain efficient SQL and Jinja code to transform and manipulate data using DBT. ● Develop and maintain data models that support business requirements. ● Implement data quality checks and DBT tests to ensure the accuracy and completeness of our data. ● Work with stakeholders to understand and document their data requirements and provide solutions that meet their needs. ● Troubleshoot and resolve data issues as they arise.

SQL
MongoDB
PostgreSQL
Terraform
Python
Spark
AWS Cloud Architecture
Tableau
Big Data
Data Analytics
Senior Data EngineerSemantix AI
09/2021 - 05/2023

● Created and implemented the ML project using Python, TensorFlow, Pandas, and SQL, utilizing Azure Machine Learning for robust model development and deployment, to enhance predictive accuracy for the Phoenix team of the Brain project conducting Score analysis for legal entities of a Brazilian Bank. ● Integrated APIs and managed batch processes, proficient in API standards like XML, JSON, and SOAP. ● Developed ETL routines using PySpark, SQL, and Hadoop to streamline data processing and integration for the bank’s data engineering team, resulting in a 25% reduction in data processing time. ● Automated processes using Python and Bash scripting, enhancing productivity by 30% for the operations team. ● Structured relational and non-relational databases using PostgreSQL and Apache HBase, developed new features, and maintained an application using Python and Spark. This work enhanced application performance for the product development team of a financial institution.

SQL
Spark
PySpark
Azure
AWS Cloud Architecture
Google Cloud
Database Development
Data Analytics
Machine Learning
Python
Data Engineer / Analyst InternshipDevTown
02/2021 - 08/2021

● Treated, manipulated, and prepared complex data for analysis and created visualizations in Power BI for data exploration and storytelling, enhancing data comprehension and decision-making for the marketing analytics project of an educational institution. ● Created queries for PostgreSQL database using SQL and harnessed the power of Python's psycopg2 library in conjunction with PySpark to build informative data tables enhancing data accessibility and analysis for the data science team. ● Refactored On-Premises pipelines to Azure Cloud, enhancing scalability and reliability for the data engineering project.

Python
SQL
PySpark
Azure
PostgreSQL
Data Engineer/Analytic Engineer, Randon Companies IndustryEmpresas Randon
07/2018 - 01/2021

● Processed, manipulated, and prepared data for analysis and created visualizations in Power BI for data exploration, enhancing data comprehension and decision-making for the marketing analytics project of an industrial company. ● Structured relational and non-relational databases using Microsoft SQL Server and Apache HBase, developed new features and maintained an application using Python and Spark. This work enhanced application performance for the product development team. ● Supported data preparation in an Azure and GCP environment using Databricks, improving data quality and accessibility for the data engineering team. ● Monitored supplier action plans and controlled project execution deadlines, ensuring timely project completion and supplier accountability for the supply chain management team. ● Managed project reports and quality indicators (KPIs), providing critical performance insights for project stakeholders and facilitating data-driven decision-making.

Python
SQL
HBase
Apache
Spark
Azure
Education
Specialization in Big Data EngineeringFaculdade Unyleya
01/2023 - 11/2023
Specialization in Biomedical EngineeringFaculdade Unyleya
01/2023 - 11/2023
AAS., Data ArchitectureFaculdade Ampli
01/2023 - Currently
BSc., Civil EngineerCentro Universitario Ritter dos Reis
02/2016 - 12/2022