AI Learning Material Creator
An AI Learning Material Creator designs, produces, and iterates on educational content that teaches individuals and organizations …
Skill Guide
The engineering discipline of writing self-contained, reproducible, and pedagogically effective Python code snippets within interactive notebook environments (like Jupyter) to demonstrate algorithms, analyze data, or teach concepts.
Scenario
Given a public dataset (e.g., Titanic survival data), create a single Jupyter Notebook that loads, cleans, summarizes, and visualizes the data to answer a basic question.
Scenario
Compare the performance of two classification models (e.g., Logistic Regression vs. Random Forest) on a standardized task, with the notebook structured to allow easy swapping of parameters or datasets.
Scenario
Create a system where a parameterized Jupyter Notebook is executed nightly by a scheduler (e.g., Airflow) to generate a performance report, with outputs (HTML, PDF) automatically distributed.
The primary interactive development environments. JupyterLab is the standard for local development; VS Code offers superior debugging and code navigation; Colab provides free GPU/TPU access and easy sharing.
`nbqa` enforces style on notebooks. `Papermill` parameterizes and executes notebooks as jobs. `Jupytext` synchronizes notebooks with plain `.py` or `.md` files for clean version control.
The foundational stack for data manipulation (`pandas`/`numpy`), visualization (`seaborn`/`plotly` for interactivity), and ML demos (`scikit-learn`). Proficiency in these is non-negotiable for effective technical examples.
Thinking of notebooks as narratives (Literate Programming), where execution flow is a graph (DAG) to manage, and using platforms like Binder for creating zero-install, executable examples are key architectural patterns.
Answer Strategy
Demonstrate a structured debugging methodology. Start with the kernel and environment: 1) Check for missing package versions (`pip freeze`) and recommend a `requirements.txt` or `environment.yml`. 2) Inspect cell execution order and hidden global state by using 'Restart & Run All'. 3) Look for non-deterministic operations (e.g., unseeded random number generators, data fetches from live APIs without caching). The fix involves pinning dependencies, reordering cells for clarity, and seeding all random processes.
Answer Strategy
Testing communication and architectural skills. The answer should focus on separation of concerns and narrative flow. 'I structure it as a sequence of clearly demarcated sections using Markdown headers: Executive Summary (key plots and conclusions upfront), Data Acquisition & Validation, Methodology (with a high-level flowchart), Detailed Analysis, and Appendix (for heavy code/utilities). I use a consistent visual style for plots and hide implementation details in helper functions. The goal is for the notebook to be a standalone, linear story that can be consumed at different depths of technical detail.'
1 career found
Try a different search term.