patotricks15

Data Scientist

Data Scientist with over 3 years of experience, and a bachelor degree in Economics Science. My expertise focuses on developing Machine Learning models, Econometrics, Exploratory Data Analysis, and Generative AI. Proficient in Python, R, STATA and SQL, with knowledge in databases like MySQL, PostgreSQL and MongoDB, cloud services as Google Cloud Platform and Amazon Web Services.






Experience: 4 years

Yearly salary: $50,000

Hourly rate: $50

Nationality: 🇧🇷 Brazil

Residency: 🇧🇷 Brazil


Experience

Researcher - Technology
CNPQ
2023 - 2024
"Interactive Geology: Conscious Society" 1) Database Creation for Mineral Information Led the data science team in establishing a comprehensive database of minerals used on the website, facilitating user access to detailed and accurate mineral information, enhancing educational content and user engagement. 2) Generative AI Directed the implementation of Generative AI technologies (LangChain and Google Vertex AI) for the creation of text, images, and chatbot functionalities on the website. This initiative significantly improved user interaction, provided personalized content, and streamlined user inquiries, setting a new standard for digital engagement in the mineral information sector.
Data Scientist
ZapGPT
2023 - 2023
Leadership: Established the data science department within an early-stage startup, fostering collaboration with Business, Product, Marketing, and Software Engineering teams, enhancing communication, ownership, leadership, and collaboration skills. Generative AI: Developed backend systems to create AI Agents from user-submitted documents, refining prompts, and establishing best practices for developing LLM-powered POC/MVP/products using Hugging Face and LangChain. Machine Learning Research: Employed Machine Learning techniques to estimate retention, churn, forecast ROI, and subscription probability using TensorFlow and scikit-learn, delivering information 300% faster with 80% fewer errors than traditional Excel sheets. Applied Technologies: Leveraged LangChain, Hugging Face, Python, Scikit-learn, Numpy, MySQL to develop Generative AI applications and machine learning models, and used Streamlit to build POCs and MVPs.
Researcher - Applied Economics
CNPQ
2021 - 2023
"Assessment of Poverty in the Brazilian Economy Using Microdata from Continuous PNAD and Machine Learning." Main Field: Applied Social Sciences Field: Economics Subfield: Quantitative Methods in Economics Specialty: Mathematical, Econometric, and Statistical Methods and Models
Data Scientist
KeyCash
2021 - 2021
Exploratory Data Analysis: Conducted exploratory data analysis using Python, pandas, numpy, and Jupyter Notebook to uncover insights, addressing critical business questions and supporting decision-making in real estate financing. Artificial Intelligence: Implemented computer vision using Street View and Google Cloud Vision APIs for advanced pattern recognition, reducing house photo analysis time by 92%. Data Engineering: Created Python scripts for automated document collection and spreadsheet manipulation through web scraping and crawlers, significantly improving operational efficiency and data handling accuracy. Applied Technologies: Utilized Python, Google Cloud Platform, Open Vision, MySQL, Docker, Airflow, requests, and BeautifulSoup for computer vision and ETL pipeline projects.
Python teacher
SEG Rural Geophysics Student Chapter
2021 - 2022
Data Scientist
RankMyAPP
2021 - 2023
Product Building: Developed internal and external MVPs, POCs, and tools for geolocated keyword search; created tools for exploratory data analysis focusing on sentiment analysis (NLP) in Play Store comments, generating revenue for the company. Generative AI: Identified bottlenecks in customer service and conceptualized a product using LLM and NLP, reducing costs by about 90%. Exploratory Data Analysis: Produced analyses and reports for the marketing team using statistical analysis, which were published on the company’s website and blogs, increasing visibility. Applied Technologies: Utilized Python, Scikit-learn, Numpy, Matplotlib, NLTK, OpenAI API, GeoPandas, Plotly, Leaflet, MongoDB, Airflow for machine learning models, geospatial analysis, and ETL pipelines in data engineering projects.

Skills

analyst
backend
big-data
data viz
dataops
docker
economy-designer
fintech
nosql
python
quantitative-analyst
sql
data-science
english
portuguese
spanish