bpf-developer-tutorial/src/xpu/flamegraph/qwen3_flamegraph.svg at 6d3ba3ea341ce9a2193ea3322d58ab9481c2e2a8

mirror of https://github.com/eunomia-bpf/bpf-developer-tutorial.git synced 2026-02-11 06:05:19 +08:00

Files

Littlefisher 5afd7fd348 Enhance Flamegraph Documentation and GPU Profiling Scripts

- Added an example flamegraph for Qwen3 LLM inference, highlighting key insights and performance bottlenecks.
- Updated README.md to include detailed explanations of CPU and GPU profiling results, emphasizing the correlation between CPU stacks and GPU kernels.
- Modified gpuperf.py to ensure absolute paths are used for output files, improving reliability across different working directories.
- Enhanced merge_gpu_cpu_trace.py to strip ANSI escape sequences from CPU stack traces, ensuring cleaner output for analysis.
- Introduced a new SVG file for the Qwen3 flamegraph, providing a visual representation of profiling data with interactive features.

2025-10-28 13:23:16 -07:00

18 KiB

Raw Blame History

/kernel/bpf-developer-tutorial/raw/commit/6d3ba3ea341ce9a2193ea3322d58ab9481c2e2a8/src/xpu/flamegraph/qwen3_flamegraph.svg

18 KiB Raw Blame History

18 KiB

Raw Blame History