bpf-developer-tutorial

kernel/bpf-developer-tutorial

Fork 0

mirror of https://github.com/eunomia-bpf/bpf-developer-tutorial.git synced 2026-02-03 18:24:27 +08:00

Commit Graph

Author	SHA1	Message	Date
Littlefisher	5afd7fd348	Enhance Flamegraph Documentation and GPU Profiling Scripts - Added an example flamegraph for Qwen3 LLM inference, highlighting key insights and performance bottlenecks. - Updated README.md to include detailed explanations of CPU and GPU profiling results, emphasizing the correlation between CPU stacks and GPU kernels. - Modified gpuperf.py to ensure absolute paths are used for output files, improving reliability across different working directories. - Enhanced merge_gpu_cpu_trace.py to strip ANSI escape sequences from CPU stack traces, ensuring cleaner output for analysis. - Introduced a new SVG file for the Qwen3 flamegraph, providing a visual representation of profiling data with interactive features.	2025-10-28 13:23:16 -07:00
Littlefisher	f5ee20e333	fix: Ensure CUPTI flushes events before stopping CPU profiler; update timestamp handling to use microseconds	2025-10-27 20:05:45 -07:00
Littlefisher	05ca05aa7c	Refactor code structure for improved readability and maintainability	2025-10-27 19:41:40 -07:00

Author

SHA1

Message

Date

Littlefisher

5afd7fd348

Enhance Flamegraph Documentation and GPU Profiling Scripts

- Added an example flamegraph for Qwen3 LLM inference, highlighting key insights and performance bottlenecks.
- Updated README.md to include detailed explanations of CPU and GPU profiling results, emphasizing the correlation between CPU stacks and GPU kernels.
- Modified gpuperf.py to ensure absolute paths are used for output files, improving reliability across different working directories.
- Enhanced merge_gpu_cpu_trace.py to strip ANSI escape sequences from CPU stack traces, ensuring cleaner output for analysis.
- Introduced a new SVG file for the Qwen3 flamegraph, providing a visual representation of profiling data with interactive features.

2025-10-28 13:23:16 -07:00

Littlefisher

f5ee20e333

fix: Ensure CUPTI flushes events before stopping CPU profiler; update timestamp handling to use microseconds

2025-10-27 20:05:45 -07:00

Littlefisher

05ca05aa7c

Refactor code structure for improved readability and maintainability

2025-10-27 19:41:40 -07:00

3 Commits