Commit Graph

15 Commits

Author SHA1 Message Date
Littlefisher
75ec466daa feat: Add book recommendation for eBPF observability and introduce configuration for GPU flamegraph 2025-11-03 22:13:59 -08:00
Littlefisher
cf6abdeaf2 Implement feature X to enhance user experience and optimize performance 2025-10-28 20:35:37 -07:00
Littlefisher
5afd7fd348 Enhance Flamegraph Documentation and GPU Profiling Scripts
- Added an example flamegraph for Qwen3 LLM inference, highlighting key insights and performance bottlenecks.
- Updated README.md to include detailed explanations of CPU and GPU profiling results, emphasizing the correlation between CPU stacks and GPU kernels.
- Modified gpuperf.py to ensure absolute paths are used for output files, improving reliability across different working directories.
- Enhanced merge_gpu_cpu_trace.py to strip ANSI escape sequences from CPU stack traces, ensuring cleaner output for analysis.
- Introduced a new SVG file for the Qwen3 flamegraph, providing a visual representation of profiling data with interactive features.
2025-10-28 13:23:16 -07:00
Littlefisher
ad583766a8 Implement structural updates and optimizations across multiple modules 2025-10-28 12:40:06 -07:00
Littlefisher
a9af348a46 feat: Enhance CUPTI activity selection and merge logic for improved profiling accuracy 2025-10-27 22:58:57 -07:00
Littlefisher
f5ee20e333 fix: Ensure CUPTI flushes events before stopping CPU profiler; update timestamp handling to use microseconds 2025-10-27 20:05:45 -07:00
Littlefisher
05ca05aa7c Refactor code structure for improved readability and maintainability 2025-10-27 19:41:40 -07:00
yunwei37
b8cc834d7f Enhance GPU and NPU kernel driver documentation and monitoring scripts
- Updated README.zh.md for GPU kernel driver to improve clarity and formatting.
- Added nvidia_driver.bt script for monitoring NVIDIA proprietary GPU driver activity using kernel probes.
- Revised README.md for NPU kernel driver to enhance explanations and correct minor grammatical issues.
2025-10-13 07:18:50 -07:00
yunwei37
277ecbaf9d Add HID-BPF tutorial and implementation for virtual mouse input modification
- Introduced a comprehensive tutorial in README.md explaining how to fix broken HID devices using eBPF without kernel patches.
- Implemented a userspace program (hid-input-modifier.c) that creates a virtual HID mouse using the uhid interface and sends synthetic mouse events.
- Developed a BPF program (hid-input-modifier.bpf.c) that intercepts HID events and modifies mouse movement data, effectively doubling the X and Y movement.
- Created necessary header files (hid_bpf.h, hid_bpf_defs.h, hid_bpf_helpers.h) to define structures and helper functions for the BPF program.
- Added functionality to find and manage the virtual HID device, ensuring seamless integration with the BPF program.
2025-10-05 22:40:58 -07:00
yunwei37
a2e7b20d63 Add eBPF tutorial templates and update SUMMARY files
- Created SUMMARY.md.template and SUMMARY.zh.md.template for eBPF tutorial.
- Updated generate_toc.py to generate table of contents for English and Chinese versions.
- Added configuration files for new eBPF examples, categorizing them by level and type.
- Updated SUMMARY.md and SUMMARY.zh.md to reflect new lessons and reorganized sections.
- Introduced new features related to GPU and tracing in the tutorial.
2025-10-05 20:56:50 -07:00
yunwei37
29e8113081 docs: add tutorial for tracking Intel NPU kernel driver operations using eBPF 2025-10-05 19:40:13 -07:00
yunwei37
8c38c01cae Refactor GPU kernel driver documentation: Update eBPF tutorial for monitoring GPU activities, streamline content, and enhance clarity on kernel tracepoints and their applications across Intel, AMD, and Nouveau drivers. 2025-10-05 19:23:31 -07:00
yunwei37
131b79e13f docs: update eBPF tutorial to enhance GPU monitoring insights and script usage 2025-10-05 18:59:10 -07:00
yunwei37
f3c4a3ee64 Refactor test output in arena_list.c to remove redundant print statements
- Removed unnecessary print statements for arena sum and number of elements in the test_arena_list_add_del function.
- Simplified output to focus on essential test results, improving clarity and conciseness of the test logs.
2025-10-06 06:24:08 +08:00
yunwei37
6042594b8c Add eBPF tutorial and monitoring scripts for GPU activity
- Introduced a comprehensive README.md detailing the use of eBPF for monitoring GPU activity through kernel tracepoints.
- Added bpftrace scripts for monitoring AMD GPU operations, including buffer object creation, command submission, and interrupts.
- Created a bpftrace script for tracking DRM GPU scheduler activity across all modern GPU drivers.
- Developed a bpftrace script to monitor display vertical blanking events for frame timing analysis.
- Implemented a bpftrace script for Intel i915 GPU activity, focusing on GEM object management, memory operations, and page faults.
2025-10-05 00:26:31 -07:00