courses I have taken
Udacity
Story Telling (Data Visualization) Books
Data Visualization
- Awesome Panel
- hvplot user guide
- Beautiful Python Visualization Dashboard With Panel/Hvplot: Thu Vu
- Panel
- Data science
ML Interpretability:
Model Versioning
Version control in data science can be tricky, because there are many pieces involved that can be hard to track, such as large amounts of data, model versions, seeds, and hyperparameters. The following resources offer useful methods and tools for managing model versions and large amounts of data
Machine Learning University with Visual explanation
Machine Learning Algorithms
- Linear Regression
Data Versioning tools
ML Debugging
hyperparameters tuning
Model Calibration
Top 6 Errors Novice Machine Learning Engineers Make
Machine Learning Systems Design
Deployment - Prediction Service
Log and wait
Machine learning books
- Machine Learning from the scratch Derivations in concept and Code
- Deep Learning Fundamentals with Sebastian Raschka
Automated Canary Analysis with Kayenta
Privacy
Kubernates
- An introduction to Kubernates by Jeremy Jordan
- Inspector gadget is a tool designed to introspect and debug Kubernetes applications using eBPF
- Learnk8s
Feature Store
- Feature Store: A special kind of database (such as SageMaker Feature Store) that can store machine learning inputs. It is often characterized by support for batch (training) workloads, low-latency (real-time inference) workloads, and search.
- Feature Group: A logical grouping of features that can be discovered together. You might group features by topic or category, such as “weather” or “top secret”.
- Feature Stores: A Hierarchy of Needs
- Developing Scalable Feature Engineering DAGs
- Eugen Yan
- Amazon Sagemaker
Transfer Learning
Gradient Descent optimization optimization algorithms
Fine tuning models
Prebuilt Docker Images
- Prebuilt Docker Images
- Prebuilt Amazon SageMaker Docker Images for Scikit-learn and Spark ML
- Using Docker containers with SageMaker
- Building your own TensorFlow container
- Extending our PyTorch containers
- Example Notebooks: Use Your Own Algorithm or Model
- Advanced Functionality in SageMaker
Useful Pandas
Useful Stack Overflow
Rebound is a command-line tool that instantly fetches Stack Overflow results when an exception is thrown. Just use the rebound command to execute your file.
Useful Sagemaker
- Instance auto shutdown
- Save Costs
- Develop, Train, Optimize and Deploy Scikit-Learn Random Forest
- AWS EVENTS
- certification resources
Useful Bayesian
Distributed Databases
Useful Autogluon
EHR FHIR
Useful Stack Overflow
The Practical Guides for Large Language Models
- [https://github.com/masinde70/LLMsPracticalGuide]