Draft for vector calling convention. #171

Hsiangkai · 2021-01-21T13:48:31Z

Describe the purpose of vector registers and the calling convention for vector arguments.

kito-cheng · 2021-01-21T15:34:55Z

Could you add description about vector tuple types (NF > 1)?

ebahapo · 2021-01-21T23:49:02Z

Also, please explain briefly that LMUL < 1 is the same as LMUL = 1.

The maximum LMUL is 8. We need 16 vector registers for two LMUL-8 arguments. The modification follows the proposal of psABI in riscv-non-isa/riscv-elf-psabi-doc#171 Differential Revision: https://reviews.llvm.org/D95134

Hsiangkai · 2021-01-25T03:20:46Z

Also, please explain briefly that LMUL < 1 is the same as LMUL = 1.

I have described it as "Vectors that are LMUL = 1 or fractional LMUL are passed in a single vector argument register."

rofirrim · 2021-01-29T09:37:04Z

riscv-elf.md

+size of the elements in the vector. The addresses of the vector or mask values on
+the stack are passed according to the integer calling convention.
+
+Vector tuple types for Zvlsseg will be decoupled as `NF` vector values when passing


I'm not a native speaker but I wonder if flattened might be a better term than decoupling (the concept is already used in the Hard FP ABI above)

rofirrim · 2021-01-29T09:37:35Z

Except for a minor comment, overall looks good to me.

jrtc27 · 2021-03-14T17:07:50Z

riscv-elf.md

@@ -91,6 +93,17 @@ The Floating-Point Control and Status Register (fcsr) must have thread storage
 duration in accordance with C11 section 7.6 "Floating-point environment
 <fenv.h>".

+Vector Register Convention <a name=vector-register-convention>


Please mark all these as "(Draft)" for now until the extension is ratified.

jrtc27 · 2021-03-14T17:08:55Z

riscv-elf.md

+v1-v7   |              | Temporary registers          | No
+v8-v23  |              | Argument registers           | No
+v24-v31 |              | Temporary registers          | No


It'd be nice if "va0" were v10 to match x and f, but given this nicely splits things up into quarters and we have far more vector argument registers than integer and float ones anyway I think this is probably the right choice, and at the end of the day it really doesn't matter.

jrtc27 · 2021-03-14T17:14:12Z

riscv-elf.md

+v8-v23  |              | Argument registers           | No
+v24-v31 |              | Temporary registers          | No
+
+*: v0 is used as the mask register for masked vector instructions. It is also used as the first mask argument in the procedure calling convention. If there is no need to use it as the mask, it can be considered a temporary register.


How are mask arguments distinguished from normal vector arguments as far as the compiler is concerned looking at a function call? Is it just anything that's a vector of bools? This needs to be specified, likely in the body of text below.

Yes, it is distinguished by the argument type.

aswaterman · 2021-03-14T23:28:07Z

See also https://github.com/riscv/riscv-v-spec/blob/c05d4a7e10b1ae474f6f4bf102aeaf04d6930bfc/calling-convention.adoc -- this describes the discipline for vl, vtype, and vcsr, as well as the fact that vector state is not preserved across system calls. (Not sure if the latter belongs in the Linux-specific section or what.)

jrtc27 · 2021-03-14T23:57:37Z

See also https://github.com/riscv/riscv-v-spec/blob/c05d4a7e10b1ae474f6f4bf102aeaf04d6930bfc/calling-convention.adoc -- this describes the discipline for vl, vtype, and vcsr, as well as the fact that vector state is not preserved across system calls. (Not sure if the latter belongs in the Linux-specific section or what.)

Yes, system calls are out of scope for a psABI, though not preserving the state is surprising to be honest; interrupts and system calls take the same path into the kernel, so you'd have to go out of your way to make them behave differently. I'd expect the kernel to either not touch vector registers outside of context switches or, if it wants to use them, lazily save/restore them like it does for floating-point registers.

kito-cheng · 2021-03-18T02:06:32Z

I just realized this calling convention need to consider to lazy binding issue[1] too, my first impression is oh, we don't have callee-save register so we might don't have such issue, but in this proposal we have passed argument on vector register, which possible to be clobbered by lazy binding/ifunc resolver, so I guess the only possible choice for baseline vector calling convention is all argument/return values are passed in memory.

@jrtc27 @jim-wilson what do you think?

[1] #66

The maximum LMUL is 8. We need 16 vector registers for two LMUL-8 arguments. The modification follows the proposal of psABI in riscv-non-isa/riscv-elf-psabi-doc#171 Differential Revision: https://reviews.llvm.org/D95134

rjiejie · 2021-06-10T06:09:30Z

riscv-elf.md

+
+There is no scalar values passed through vector registers. There is no vector
+values passed through scalar registers. There is no need to define a new ABI
+for vector. Vector calling convention is appliable for existing ABIs.


How to compatible between this convention and origin vector call convention
if assembly code is deployed in some libs, why is it not necessary to define a new ABI ?

What is the origin vector call convention?

arguments passed through memory as "origin vector call convention" ?

arguments passed through memory as "origin vector call convention" ?

Its's not part of standard, so I think there is no compatible issue from the standard ABI view.

frasercrmck · 2021-06-21T09:40:51Z

riscv-elf.md

+Vectors that are LMUL = 1 or fractional LMUL are passed in a single vector
+argument register. Vectors that are LMUL = 2 are passed in 2-aligned vector
+argument registers. Vectors that are LMUL = 4 are passed in 4-aligned vector
+argument registers. Vectors that are LMUL = 8 are passed in 8-aligned vector


Apologies if I've missed what this document is specifying (and for whom) but should it also specify what happens for vector types which are conceptually (to the programmer and/or compiler) LMUL = 16 or above? The LLVM compiler as we've implemented it, for instance, would split these vectors in half (and again until a legal type is found) and the ABI is applied to each split part independently.

Current spec only list LMUL could be 1/8, 1/4, 1/2, 1, 2, 4, 8, so I think this should be enough?
https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#332-vector-register-grouping-vlmul20

Yes that's true, which is why I wasn't sure if my comment applied. I was viewing this doc from a user/compiler perspective, where it's possible (at least in LLVM) to define something that ends up as void @foo(<vscale x 128 x i8> %v) in the IR, which is LMUL > 8. It would be legal for a compiler to pre-define a type like __rvv_int8m16_t. Shouldn't that case be covered by this CC specification?

I only ask because the rest of this psABI document defines the lowering of high-level concepts like "aggregates" and "C structs", and how they must be broken down into registers and/or stack.

This vector calling convention document, on the other hand, only defines things in terms of LMUL so is more hardware-focused and feels slightly out of place with the other CC definitions. That's why it's not clear to me whether it's intended to spec out software concepts like which high-level "types" it applies to, what happens to "wide" LMUL>8 vectors, or even what is supposed to happen if the V extension isn't enabled.

The restriction LMUL <= 8 is pretty firmly baked into the RISC-V V-extension at this point, so I think it's reasonable to have the same restriction in the associated C language dialect ("RVV intrinsics").

This being said, I don't see a problem with LLVM implementing its own extension of the RVV intrinsics dialect, with new "m16" types, as long as it's compatible with the existing dialect. I do think some discussion needs to happen over at https://github.com/riscv/rvv-intrinsic-doc before attempting to codify the calling convention here.

Yeah I agree that LMUL <= 8 seems firm enough ground to stand on for the purposes of this document. The issue I have is mostly stemming from the unclear lowering from software to LMUL which leaves me with questions about whom this calling convention specification is aimed at.

I may be thinking of things which may be out of scope -- and apologies if I am -- but I've not just been thinking of the RVV intrinsic types. If this document is purely aimed at supporting the pre-defined set of RVV intrinsic types in https://github.com/riscv/rvv-intrinsic-doc then I think that could be clarified here.

However, as it stands, this comes across like as generic a calling convention as the integer/floating-point ones. I'll go through a list of the use cases which I believe are currently left ambiguous.

At the language level, I concede that m16 doesn't make much sense when used in conjunction with the intrinsics. Though I can foresee compilers exposing "scalable" vector types (think ext_vector_type) and having support for operators on them, much like they do for the fixed vector types: something like __rvv_int8m16_t v = x + y;

Auto-vectorization

LLVM's support for lowering fixed-length vectors to RVV instructions (e.g. by passing the known vector length to vsetivli)

All of the above may introduce vectors into the program which may not necessarily fall into the hardware-supported LMUL categories, and I think it's natural to wonder if and how they should be handled by this vector calling convention. It's analogous to how we have to specify the ABI for integer scalars > XLEN, isn't it?

Consider the HW only support up to 8 for LMUL, SW eventually need to legalize those type into HW legal type to make it code-gen-able, so from this point I think it should not too much benefit to generate such software m16 type?

In theory vscale could be arbitrary positive integer in LLVM type system, but it's not meaning all of those types are legal for code gen, we need to handle in LLVM back-end: reject those type, legalized to nearest type or break down into several value, either is fine since it's kind of LLVM extension, so I would prefer only included that in psABI until those type become part RISC-V standard, of cause RVV intrinsic specs is one of RISC-V standard.

For fixed-length vector over RVV, it's another story, we don't have well define that like SVE's VLS yet, I am happy to see this happen but it should be separated issue.

Thanks for your input. I think this all makes sense. I think that this draft is solid as far as hardware-level "legal types" are concerned. It's just the missing link between software and that hardware which I think is currently left open to interpretation and may cause confusion to other readers. Especially considering both the Integer and Floating-Point Calling Convention specifications are careful to explain how all of these language constructs like structs and unions and large scalars and complex numbers are handled. This vector one goes straight to LMUL without explanation.

It's sounding to me like, at the language level, this calling convention is only trying to cover the vector types defined by the RVV intrinsic specification and nothing else. And that's perfectly valid, but I do think it could be clarified in this document to correctly narrow down the scope of this calling convention.

Thanks for your feedback. From the LLVM IR perspective, there is no limit on the LMUL values. Actually, it has no concept of LMUL in the LLVM IR. All kinds of <vscale x n x ty> are possible. However, from C/C++ language perspective, there is no LMUL = 16 types. In addition, there is no struct or array for scalable vector types. I have no idea how to create LMUL = 16 from the high level language. I think the description could be more clear. I will think about how to clarify it. Thanks.

@frasercrmck, I have added some description about the types we focus on in the vector calling convention. Please help me to review if it is appropriate or not, thanks.

rjiejie · 2021-09-17T01:45:41Z

riscv-cc.adoc

+|v24-v31  |              | Temporary registers          | No
+|vl       |              | Vector length                | No
+|vtype    |              | Vector data type register    | No
+|===


we should consider function call as @kito-cheng mentioned before,
so maybe we need to make some preserved / callee saver registers.

i post a vector call convention long time ago [0], so I suggestion that
v24-v31 as preserved / callee saver in some conditions.

[0] : https://lists.riscv.org/g/tech-vector-ext/message/7?p=%2C%2C%2C20%2C0%2C0%2C0%3A%3Arecentpostdate%2Fsticky%2C%2Ccall+convention%2C20%2C2%2C0%2C69238312

We are not going to add any callee-saved registers, so that we can add the V extension without an ABI break. This is a first-order goal for a number of reasons, including vectorizing library routines dynamically linked to applications that are compiled without the V extension. See related discussion about ABI breakages with callee-saved state here. https://sourceware.org/pipermail/libc-alpha/2021-September/130897.html

The maximum LMUL is 8. We need 16 vector registers for two LMUL-8 arguments. The modification follows the proposal of psABI in riscv-non-isa/riscv-elf-psabi-doc#171 Differential Revision: https://reviews.llvm.org/D95134

kito-cheng · 2023-07-06T15:41:51Z

This proposal is deprecated and moved to #389

Hsiangkai force-pushed the vector-calling-convention branch 2 times, most recently from 13d6969 to 31fcb90 Compare January 25, 2021 03:11

zakk0610 mentioned this pull request Jan 28, 2021

Calling convention for vector arguments riscv-non-isa/rvv-intrinsic-doc#38

Closed

rofirrim reviewed Jan 29, 2021

View reviewed changes

jrtc27 reviewed Mar 14, 2021

View reviewed changes

kito-cheng mentioned this pull request Mar 16, 2021

On RISC-V, lazily bound functions must follow the ABI #66

Closed

Hsiangkai mentioned this pull request May 12, 2021

Calling conventions for the lazily bound functions. #190

Merged

rjiejie reviewed Jun 10, 2021

View reviewed changes

frasercrmck reviewed Jun 21, 2021

View reviewed changes

jrtc27 added this to the First Release DoD milestone Jul 9, 2021

Draft vector calling convention.

488f96b

Hsiangkai force-pushed the vector-calling-convention branch from 94ac182 to 488f96b Compare September 8, 2021 08:40

rjiejie approved these changes Sep 17, 2021

View reviewed changes

jrtc27 mentioned this pull request Sep 27, 2021

Add minimal description for vector register convention #218

Merged

kito-cheng removed this from the Freeze 2021 milestone Sep 28, 2021

nick-knight mentioned this pull request Jun 17, 2023

Proposal for Vector Calling Convention #389

Merged

kito-cheng closed this Jul 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft for vector calling convention. #171

Draft for vector calling convention. #171

Hsiangkai commented Jan 21, 2021

kito-cheng commented Jan 21, 2021

ebahapo commented Jan 21, 2021

Hsiangkai commented Jan 25, 2021

rofirrim Jan 29, 2021

rofirrim commented Jan 29, 2021

jrtc27 Mar 14, 2021

jrtc27 Mar 14, 2021

jrtc27 Mar 14, 2021

Hsiangkai Mar 17, 2021

aswaterman commented Mar 14, 2021

jrtc27 commented Mar 14, 2021

kito-cheng commented Mar 18, 2021

rjiejie Jun 10, 2021

Hsiangkai Jun 10, 2021

rjiejie Jun 10, 2021

kito-cheng Jun 14, 2021

frasercrmck Jun 21, 2021

kito-cheng Jun 21, 2021

frasercrmck Jun 21, 2021

nick-knight Jun 21, 2021 •

edited

frasercrmck Jun 22, 2021

kito-cheng Jun 23, 2021

frasercrmck Jun 23, 2021

Hsiangkai Jun 24, 2021 •

edited

Hsiangkai Jun 29, 2021

rjiejie Sep 17, 2021

aswaterman Sep 27, 2021 •

edited

kito-cheng commented Jul 6, 2023 •

edited

Draft for vector calling convention. #171

Draft for vector calling convention. #171

Conversation

Hsiangkai commented Jan 21, 2021

kito-cheng commented Jan 21, 2021

ebahapo commented Jan 21, 2021

Hsiangkai commented Jan 25, 2021

Choose a reason for hiding this comment

rofirrim commented Jan 29, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aswaterman commented Mar 14, 2021

jrtc27 commented Mar 14, 2021

kito-cheng commented Mar 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nick-knight Jun 21, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Hsiangkai Jun 24, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aswaterman Sep 27, 2021 • edited

Choose a reason for hiding this comment

kito-cheng commented Jul 6, 2023 • edited

nick-knight Jun 21, 2021 •

edited

Hsiangkai Jun 24, 2021 •

edited

aswaterman Sep 27, 2021 •

edited

kito-cheng commented Jul 6, 2023 •

edited