♎
Limited AI
  • Machine Learning
    • Linear Model Cheating Sheet
    • Nonlinear Model Cheating Sheet
    • General Linear Model 1
    • General Linear Model 2
    • General Linear Model 3
    • Tree Based Methods
    • Tree Based Methods Supplement
    • XG,Cat,Light__Boosting
    • KNN&PCA
    • Model Performance
    • Model Evaluation
    • Code Practice
      • KNN
      • Decision Tree Python Code
    • Data and Feature Engineering
      • Handle Bias Data
      • Cold Start Problem
  • Deep Learning
    • Summary v2
    • Basic Neural Network
      • From Linear to Deep
      • Perceptron and Activation Function
      • NN network Details
      • Backpropagation Details
      • Gradient Vanishing vs Gradient Exploding
    • Basic CNN
      • Why CNN
      • Filter/ Convolution Kernel and Its Operation
      • Padding& Stride
      • Layers
      • Extra:From Fully Connected Layers to Convolutions
      • Extra: Multiple Input and Multiple Output Channels
    • Advance CNN
      • Convolutional Neural Networks(LeNet)
      • Deep Convolution Neural Networks(AlexNet)
      • Networks Using Blocks (VGG)
      • Network in Network(NiN)
      • Multi-Branch Networks(GoogLeNet&I mageNet)
      • Residual Networks(ResNet) and ResNeXt
      • Densely Connected Networks(DenseNet)
      • Batch Normalization
    • Basic RNN
      • Seq Model
      • Raw Text to Seq
      • Language Models
      • Recurrent Neural Networks(RNN)
      • Backpropagation Through Time
    • Advance RNN
      • Gated Recurrent Units(GRU)
      • Long Short-Term Memory(LSTM)
      • Bidirectional Recurrent Neural Networks(BRNN)
      • Encoder-Decoder Architecture
      • Seuqence to Sequence Learning(Seq2Seq)
    • Attention Mechanisms and Transformers
      • Queries, Keys, and Values
      • Attention is all you need
        • Attention and Kernel
        • Attention Scoring Functions
        • The Bahdanau Attention Mechanism
        • Multi-Head Attention
        • Self-Attention
        • Attention的实现
      • The Transformer Architecture
        • Extra Reading
        • 最短的最大路径长度
      • Large-Scaling Pretraning with Transformers
        • BERT vs OpenAI GPT vs ELMo
        • Decoder Model框架
        • Bert vs XLNet
        • T5& GPT& Bert比较
        • 编码器-解码器架构 vs GPT 模型
        • Encoder vs Decoder Reference
      • Transformers for Vision
      • Transformer for Multiomodal
    • NLP Pretraining
      • Word Embedding(word2vec)
        • Extra Reading
      • Approximate Training
      • Word Embedding with Global Vectors(GloVe)
        • Extra Reading
        • Supplement
      • Encoder(BERT)
        • BERT
        • Extra Reading
      • Decoder(GPT&XLNet&Lamma)
        • GPT
        • XLNet
          • XLNet架构
          • XLNet特点与其他比较
      • Encoder-Decoder(BART& T5)
        • BART
        • T5
  • GenAI
    • Introduction
      • GenAI Paper Must Read
      • GenAI六个阶段
    • Language Models Pre-training
      • Encoder-Decoder Architecture
      • Encoder Deep Dive
      • Decoder Deep Dive
      • Encoder VS Decoder
      • Attention Mechanism
      • Transformers
    • Example: Llama 3 8B架构
    • Fine-Tuning Generation Models
    • RAG and Adavance RAG
    • AI Agent
  • Statistics and Optimization
    • A/B testing
    • Sampling/ABtesting/GradientMethod
    • Gradient Decent Deep Dive
  • Machine Learning System Design
    • Extra Reading
    • Introduction
  • Responsible AI
    • AI Risk and Uncertainty
      • What is AI risk
      • General Intro for Uncertainty Quantification
      • Calibration
      • Conformal Prediction
        • Review the linear regression
        • Exchangeability
        • Split Conformal Prediction
        • Conformalized Quantile Regression
        • Beyond marginal coverage
        • Split Conformal Classification
        • Full Conformal Coverage
        • Cross-Validation +
        • Conformal Histgram Regression
    • xAI
      • SHAP value
  • Extra Research
    • Paper Reading
    • Reference
Powered by GitBook
On this page
  1. Machine Learning System Design

Extra Reading

Functional Requirements

Functional requirements describe what the system should do. They define the specific behaviors, functions, or tasks the system needs to accomplish. For a Machine Learning (ML) system, functional requirements might include:

  1. Input Data Requirements:

    • What data sources will be used?

    • What format will the data be in?

    • How often will the data be updated?

  2. Output Specifications:

    • What form should the output take?

    • What actions or decisions should the system support?

  3. Model Requirements:

    • What type of model is needed (e.g., classification, regression)?

    • What performance metrics are important (e.g., accuracy, precision, recall)?

  4. Integration Points:

    • How will the ML system integrate with other systems or components?

    • What APIs or interfaces are required?

Non-Functional Requirements

Non-functional requirements describe how the system performs a task rather than the specific behaviors or functions. They include quality attributes, constraints, and performance measures. For an ML system, non-functional requirements might include:

  1. Performance:

    • How quickly should the system produce results?

    • What are the latency requirements?

  2. Scalability:

    • How should the system handle increasing amounts of data?

    • Can the system scale horizontally or vertically?

  3. Reliability and Availability:

    • What is the expected uptime?

    • How will the system handle failures?

  4. Security:

    • How will data be secured in transit and at rest?

    • What authentication and authorization mechanisms are needed?

  5. Maintainability:

    • How easy is it to update the model?

    • What processes are in place for monitoring and improving model performance?

  6. Compliance:

    • Are there any regulatory requirements that need to be met (e.g., GDPR)?

Driving the Conversation

Once you have clarified these requirements, you can structure the conversation around key aspects of ML system design:

  1. Data Collection and Preparation:

    • Discuss the sources, volume, and quality of the data.

    • Outline the steps for data cleaning, transformation, and augmentation.

  2. Feature Engineering:

    • Identify the key features that will drive model performance.

    • Discuss methods for feature extraction and selection.

  3. Model Selection and Training:

    • Compare different algorithms and justify your choice.

    • Explain the training process, including any hyperparameter tuning.

  4. Evaluation and Validation:

    • Define how the model's performance will be measured.

    • Discuss validation techniques such as cross-validation and A/B testing.

  5. Deployment:

    • Describe the steps for deploying the model into a production environment.

    • Consider how the model will be monitored and maintained post-deployment.

By starting with a clear understanding of both functional and non-functional requirements, you can guide the discussion through the critical stages of ML system design. This approach ensures that you address all necessary aspects and demonstrate a thorough understanding of the entire ML lifecycle.

PreviousMachine Learning System DesignNextIntroduction

Last updated 8 months ago