You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: .github/ISSUE_TEMPLATE/400-bug report.yml
+9
Original file line number
Diff line number
Diff line change
@@ -30,6 +30,15 @@ body:
30
30
</details>
31
31
validations:
32
32
required: true
33
+
- type: textarea
34
+
attributes:
35
+
label: Model Input Dumps
36
+
description: |
37
+
If you are facing crashing due to illegal memory access or other issues with model execution, vLLM may dump the problematic input of the model. In this case, you will see the message `Error in model execution (input dumped to /tmp/err_xxx.pkl)`. If you see this message, please zip the file (because GitHub doesn't support .pkl file format) and upload it here. This will help us to reproduce the issue and facilitate the debugging process.
Copy file name to clipboardexpand all lines: .github/PULL_REQUEST_TEMPLATE.md
+10
Original file line number
Diff line number
Diff line change
@@ -39,6 +39,16 @@ FIX #xxxx (*link existing issues this PR will resolve*)
39
39
<li>Please add documentation to <code>docs/source/</code> if the PR modifies the user-facing behaviors of vLLM. It helps vLLM user understand and utilize the new features or changes.</li>
40
40
</ul>
41
41
42
+
<h3>Adding or changing kernels</h3>
43
+
<p>Each custom kernel needs a schema and one or more implementations to be registered with PyTorch.</p>
44
+
<ul>
45
+
<li>Make sure custom ops are registered following PyTorch guidelines: <a href="https://pytorch.org/tutorials/advanced/cpp_custom_ops.html#cpp-custom-ops-tutorial">Custom C++ and CUDA Operators</a> and <a href="https://docs.google.com/document/d/1_W62p8WJOQQUzPsJYa7s701JXt0qf2OfLub2sbkHOaU">The Custom Operators Manual</a></li>
46
+
<li>Custom operations that return <code>Tensors</code> require meta-functions. Meta-functions should be implemented and registered in python so that dynamic dims can be handled automatically. See above documents for a description of meta-functions.</li>
47
+
<li>Use <a href="https://pytorch.org/docs/stable/library.html#torch.library.opcheck"><code>torch.libary.opcheck()</code></a> to test the function registration and meta-function for any registered ops. See <code>tests/kernels</code> for examples.</li>
48
+
<li>When changing the C++ signature of an existing op, the schema must be updated to reflect the changes.</li>
49
+
<li>If a new custom type is needed, see the following document: <a href="https://docs.google.com/document/d/18fBMPuOJ0fY5ZQ6YyrHUppw9FA332CpNtgB6SOIgyuA">Custom Class Support in PT2</a>.
50
+
</ul>
51
+
42
52
<h3>Notes for Large Changes</h3>
43
53
<p>Please keep the changes as concise as possible. For major architectural changes (>500 LOC excluding kernel/data/config/test), we would expect a GitHub issue (RFC) discussing the technical design and justification. Otherwise, we will tag it with <code>rfc-required</code> and might not go through the PR.</p>
0 commit comments