🎵 While you're here, why not enhance your visit with a melodious twist? Tune into this enchanting Spanish AI Song. A perfect blend of technology and art. Enjoy the vibes! 🎷🎶
🌟 Crafting each piece of content is a journey that demands both time and passion. If you enjoy my work, consider fueling my creativity Buying me a Coffee ☕ or supporting me on GitHub Sponsors 🚀
💰 My website has been created using Hostinger. If you want to create your own one, using this link (https://hostinger.com?REFERRALCODE=1BENITO83) will provide you 20% discount on the selected plan 💶
👉 CONTACT ME! 👉 Book a Consultation, or use this Form 🚀
Innovative and dynamic Data Scientist providing a diverse range of services, including project development, teaching, workshops, technical writing, and career coaching. My skill set includes (not limited to):
- ✅ Data Science, Analytics & ML: Python, TensorFlow, Scikit-learn
- ✅ AI: Langchain, LlamaIndex, Hugging Face, Transformers, Vector Databases
- ✅ Key Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Neural Networks, Ensemble Methods, Clustering, Dimensionality Reduction
- ✅ Data Engineering: dbt, Terraform, SQL, BigQuery, PySpark, Databricks
- ✅ MLOps: MLflow, Prefect, Comet, Docker, Kubernetes
- ✅ APIs: Flask, FastAPI
- ✅ Apps: Streamlit, Gradio
- ✅ Cloud Platforms: GCP, AWS
- ✅ Version Control: Git
💰 My personal end-to-end projects can be found in these repositories. Feel free to click ⭐ if you like them 😎
Project Name | Main Libraries/Tools | Cloud Service | App | DevOps Best Practices |
---|---|---|---|---|
ML/MLOps | ||||
MLOps Credit Default | Scikit-learn LightGBM MLflow Databricks |
AWS/Databricks | Experiment Tracking Model Registry Model/Data Monitoring Data Validation Linting Formatting Testing Error Handling Pre-Commit IaC CI/CD |
|
Medical Insurance Costs Prediction | Scikit-learn TensorFlow SageMaker Comet ML Flask |
AWS | Experiment Tracking Model Registry Model/Data Monitoring Model/Data Monitoring Linting Formatting Testing Error Handling Coverage CI/CD |
|
Stroke Prediction | Scikit-learn XGBoost SageMaker Comet ML Flask Docker |
AWS | Experiment Tracking Model Registry Model Monitoring Containerization Testing Error Handling |
|
Car Price Prediction | Scikit-learn TensorFlow MLFlow Prefect Flask Docker Grafana Terraform |
AWS | Experiment Tracking Model Registry Model Monitoring Orchestration Containerization Linting Formatting Testing Error Handling IaC CI/CD |
|
Taxi Rides Prediction | Scikit-learn TensorFlow MLFlow Prefect FastAPI Docker |
GCP | Experiment Tracking Model Registry Model Monitoring Orchestration Containerization Error Handling |
|
Music Clustering | Scikit-learn FastAPI Docker |
AWS | Streamlit | Containerization CI/CD |
Birds Classification | Pytorch | Gradio | ||
Food Prediction | Scikit-learn TensorFlow OpenCV FastAPI Docker |
GCP | Streamlit | Containerization |
LLM, RAG and Fine-tuning | ||||
RAG Hybrid Search and Semantic Caching | Qdrant FastEmbed SPLADE Hugging Face Transformers |
Error Handling Linting Formatting |
||
Multimodal Bill Scan System | AWS Bedrock AWS DynamoDB AWS SQS/SNS AWS CDK Claude 3 Sonnet |
AWS | Error Handling Linting Formatting IaC |
|
IaC in RAG Applications with Terraform | AWS Bedrock LangChain AWS Opensearch Terraform Titan |
AWS | Testing Error Handling Linting Formatting IaC |
|
Scalable RAG in AWS with Fargate | OpenAI LlamaIndex Qdrant AWS CDK/Fargate FastAPI |
AWS | Testing Error Handling |
|
RAG Deployment with Azure Functions | OpenAI LangChain Qdrant Azure Functions App |
Azure | Linting Formatting Testing Error Handling |
|
Scalable RAG with Kubernetes | OpenAI LlamaIndex Qdrant Docker FastAPI GKE |
GCP | Streamlit | Containerization Linting Formatting Testing Error Handling CI/CD |
Research Papers Semantic Search | OpenAI LangChain Qdrant Docker AWS API Gateway |
AWS | Streamlit | Containerization Linting Formatting Testing Error Handling |
Video Summarization | Hugging Face Transformers Whisper Langchain ChromaDB |
Streamlit | Error Handling | |
Multimodal RAG with Video Frames | Gemini LlamaIndex Qdrant |
|||
Books Reranking Semantic Search | OpenAI LlamaIndex Deep Lake |
|||
RAG Evaluation with Ragas | OpenAI Hugging Face Transformers Faiss LangChain Ragas |
|||
PII RAG LlamaIndex Milvus | OpenAI Presidio LlamaIndex Milvus |
|||
Multimodal RAG with PyMuPDF | OpenAI Qdrant LlamaIndex PyMuPDF |
|||
Agentic RAG LlamaIndex Milvus | OpenAI Claude LlamaIndex Milvus |
|||
Agentic RAG with LangChain | OpenAI Groq LangChain Pinecone |
|||
Agentic RAG with CrewAI | OpenAI LangChain Qdrant CrewAI Agents |
|||
Fine Tuning Gemma 2B | Hugging Face Transformers PEFT (LoRA/QLoRA) |
Hugging Face | ||
Data Analysis + Modeling | ||||
News Classification | Scikit-learn (Multinomial Naive Bayes) Tensorflow (CNN, RNN, feedforward) |
Streamlit | ||
Breast Cancer Classification | Scikit-learn Spark |
IBM | ||
Bank Churn Classification | Scikit-learn LightGBM XGBoost CatBoost |
|||
Data Engineering | ||||
Hotel Reviews | Prefect Spark SQL BigQuery dbt Terraform Looker |
GCP | Orchestration Linting Formatting Error Handling Pre-Commit IaC CI/CD |
|
Air Quality Switzerland | Mage dbt SQL BigQuery Docker Terraform Looker |
GCP | Orchestration IaC Containerization CI/CD |
|
Miscellaneous | ||||
Justicio Web Scraping | Beautiful Soup MySQL |
Error Handling |
💸 Additionally, you can find my Power BI projects:
- Personal Finance: Analysis and Comparison of Income, Bills, Profits and Available Money
- Product Sales Comparison: Product Sales Comparison using DAX functions
Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions