Fernando Rodrigues NepomucenoData Scientist
Having more than nineteen years experienced working with information technology acting as a technical and support analyst, business analyst, BI/Data Analyst, Data Engineering, and Data Scientist. Experience in financial, utilities, retail, agriculture and automotive companies.
Certifications
B1 Intermediate
B1 Intermediate
10/07/2021
Tech stack
B1 Intermediate
SQL (6)
Python (5)
Cloud Computing (4)
Microsoft Power BI (3)
ETL Implementation & Design (3)
AWS CLI (3)
APIs (3)
R (3)
Kibana (2)
XGBoost (2)
Redis (2)
Sklearn (2)
Google BigQuery (2)
ETL (2)
Oracle Database (1)
Bash (1)
Linux (1)
Cloudera (1)
Tableau (1)
SAS (1)
Web Scraping (1)
Spark (1)
Jenkins (1)
MySQL (1)
AWS Cloud Architecture
Big Data Architecture
Data Science
PySpark
Data Engineering
Experience
Data ScientistRadware
08/2022 - Currently

- Perform exploratory and statistical analysis on data petabytes. - Develop PoCs for machine learning process. - Carry out AD-Hoc data reports extraction. - Serve as escalation point-of-contact for customers on data analytics and security settings - Provide technical support and guidance to customers - Manage customer communications for analytic and security settings - Collaborate with internal organizations on projects and initiatives - Identify and document process improvements - Identify expansion opportunities, future use cases and implementation rollouts with customer

Google BigQuery
Python
SQL
Cloud Computing
Kibana
Redis
Sklearn
XGBoost
Data ScientistSyngenta
07/2020 - 07/2022

- Acting as data specialist in Decision Science project to R&D area with goal to promote fine-tune on locations of product tests and market share areas. Responsible for solution design and data modeling using environmental (weather/soil) and market data it has been used for PoC machine learning approach in AWS Cloud and a Power BI dashboard was final product to the best regions clustering recommendations. - Development of management and operational dashboards for several areas inside the R&D organization: HR, HSE, Field Operations, and Project Management. - Building of ETL processes having no-structured and structured data coming from APIs and legacy systems database. AD-Hoc queries using mixed advanced SQL scripts and Python solutions for numerous areas. Acting together with statisticians in order to define best practices to deploy models in production.

AWS CLI
APIs
ETL Implementation & Design
Cloud Computing
Microsoft Power BI
SQL
R
Python
Data EngineerSemantix
09/2019 - 06/2020

- Pipeline development performing ingestion of file generated from legacy systems and relational databases (Oracle, SQL Server and Apache Hive/Impala/BigQuery). - Data ingestion using Apache Scoop and Apache NiFi, having as job management and scheduling shell scripts for automatization the automation server Jenkins. - Design and development of automation process of downstream/upstream monitoring and outages and anomalies alerts with Python and Apache Spark. - Web scrapping process using python, selenium and Beatifullsoup for information extraction related legal process.

ETL Implementation & Design
Bash
Linux
SQL
Web Scraping
Spark
Python
Jenkins
Oracle Database
Cloudera
Data AnalystCielo
08/2018 - 07/2019

- Process mapping of charging area, having as outcome the building of automated process inherent to the department. - ETL process building in SAS 7.1 Guide with files from mainframe, TXT, CSV, spreadsheets and positional files using proc SQL, dataset manipulations with macro and SAS statements. - Data pipeline structuration to metrics and indexes generation to operational and management levels.

Tableau
Microsoft Power BI
SQL
SAS
ETL
BI/ETL/Data Integration AnalystITWV
02/2018 - 05/2018

- Allocated in a deploying analytics project, having as responsibilities of table analysis and building of views for information extraction and data junction on MySQL. - ETL process building and automatization with data sources from Hadoop, prediction model files, legacy system files and MySQL database using Oracle Data Integrator as integration tool. - Table buildings in Cloudera Hadoop distribution using HIVE and Impala. File manipulation between Hadoop clusters, directory creation and bash scripts in Oracle Linux environment. - Dimensional modeling using star schema for fact and dimension tables. Building of panels and dashboards using Oracle Analytics Cloud data visualization tool.

Bash
Linux
Cloudera
Oracle Database
MySQL
ETL
Education
Bachelor in Information Systems Universidade de Mogi das Cruzes
02/2007 - 12/2010