Data Science Resources

courses I have taken

Udacity

Story Telling (Data Visualization) Books

Data Visualization

ML Interpretability:

Model Versioning

Version control in data science can be tricky, because there are many pieces involved that can be hard to track, such as large amounts of data, model versions, seeds, and hyperparameters. The following resources offer useful methods and tools for managing model versions and large amounts of data

Machine Learning University with Visual explanation

Machine Learning University

Machine Learning Algorithms

Linear Regression
- Linear Regression

Data Versioning tools

ML Debugging

hyperparameters tuning

hyperparameters tuning

Model Calibration

Machine learning books

Automated Canary Analysis with Kayenta

Automated Canary Analysis

Privacy

Privacy-preserving AI for Synthetic and Anonymous Health Data

Kubernates

Feature Store

Feature Store: A special kind of database (such as SageMaker Feature Store) that can store machine learning inputs. It is often characterized by support for batch (training) workloads, low-latency (real-time inference) workloads, and search.
Feature Group: A logical grouping of features that can be discovered together. You might group features by topic or category, such as “weather” or “top secret”.

Transfer Learning

Gradient Descent optimization optimization algorithms

An overview of gradient descent optimization algorithms

Fine tuning models

Prebuilt Docker Images

Useful Pandas

Useful Pandas

Useful Stack Overflow

Rebound is a command-line tool that instantly fetches Stack Overflow results when an exception is thrown. Just use the rebound command to execute your file.

Rebound

Useful Sagemaker

Useful Bayesian

Aki Vehtari

Distributed Databases

Useful Autogluon

Autogluon

EHR FHIR

Useful Stack Overflow

Stack Command line

The Practical Guides for Large Language Models

[https://github.com/masinde70/LLMsPracticalGuide]

courses I have taken#

Story Telling (Data Visualization) Books#

Data Visualization#

ML Interpretability:#

Model Versioning#

Machine Learning University with Visual explanation#

Machine Learning Algorithms#

Data Versioning tools#

ML Debugging#

hyperparameters tuning#

Model Calibration#

Top 6 Errors Novice Machine Learning Engineers Make#

Machine Learning Systems Design#

Log and wait#

Machine learning books#

Automated Canary Analysis with Kayenta#

Privacy#

Kubernates#

Feature Store#

Transfer Learning#

Gradient Descent optimization optimization algorithms#

Fine tuning models#

Prebuilt Docker Images#

Useful Pandas#

Useful Stack Overflow#

Useful Sagemaker#

Useful Bayesian#

Distributed Databases#

Useful Autogluon#

EHR FHIR#

Useful Stack Overflow#

The Practical Guides for Large Language Models#

courses I have taken

Story Telling (Data Visualization) Books

Data Visualization

ML Interpretability:

Model Versioning

Machine Learning University with Visual explanation

Machine Learning Algorithms

Data Versioning tools

ML Debugging

hyperparameters tuning

Model Calibration

Top 6 Errors Novice Machine Learning Engineers Make

Machine Learning Systems Design

Log and wait

Machine learning books

Automated Canary Analysis with Kayenta

Privacy

Kubernates

Feature Store

Transfer Learning

Gradient Descent optimization optimization algorithms

Fine tuning models

Prebuilt Docker Images

Useful Pandas

Useful Stack Overflow

Useful Sagemaker

Useful Bayesian

Distributed Databases

Useful Autogluon

EHR FHIR

Useful Stack Overflow

The Practical Guides for Large Language Models