ZIP (four registers)

Interleave elements from four vectors

This instruction places the four-way interleaved elements from the four source vectors in the corresponding elements of the four destination vectors.

This instruction is unpredicated.

It has encodings from 2 classes: 8-bit to 64-bit elements and 128-bit element

8-bit to 64-bit elements
(FEAT_SME2)

313029282726252423222120191817161514131211109876543210
11000001size110110111000Zn00Zd00
op

Encoding

ZIP { <Zd1>.<T>-<Zd4>.<T> }, { <Zn1>.<T>-<Zn4>.<T> }

Decode for this encoding

if !IsFeatureImplemented(FEAT_SME2) then EndOfDecode(Decode_UNDEF); end; if size == '11' && MaxImplementedSVL() < 256 then EndOfDecode(Decode_UNDEF); end; let esize : integer{} = 8 << UInt(size); let n : integer = UInt(Zn::'00'); let d : integer = UInt(Zd::'00');

128-bit element
(FEAT_SME2)

313029282726252423222120191817161514131211109876543210
1100000100110111111000Zn00Zd00
op

Encoding

ZIP { <Zd1>.Q-<Zd4>.Q }, { <Zn1>.Q-<Zn4>.Q }

Decode for this encoding

if !IsFeatureImplemented(FEAT_SME2) then EndOfDecode(Decode_UNDEF); end; if MaxImplementedSVL() < 512 then EndOfDecode(Decode_UNDEF); end; let esize : integer{} = 128; let n : integer = UInt(Zn::'00'); let d : integer = UInt(Zd::'00');

Assembler Symbols

<Zd1>

Is the name of the first scalable vector register of the destination multi-vector group, encoded as "Zd" times 4.

<T>

Is the size specifier, encoded in size:

size <T>
00 B
01 H
10 S
11 D
<Zd4>

Is the name of the fourth scalable vector register of the destination multi-vector group, encoded as "Zd" times 4 plus 3.

<Zn1>

Is the name of the first scalable vector register of the source multi-vector group, encoded as "Zn" times 4.

<Zn4>

Is the name of the fourth scalable vector register of the source multi-vector group, encoded as "Zn" times 4 plus 3.

Operation

CheckStreamingSVEEnabled(); let VL : integer{} = CurrentVL(); if VL < esize * 4 then EndOfDecode(Decode_UNDEF); end; let quads : integer = VL DIV (esize * 4); let operand0 : bits(VL) = Z{}(n); let operand1 : bits(VL) = Z{}(n+1); let operand2 : bits(VL) = Z{}(n+2); let operand3 : bits(VL) = Z{}(n+3); var result : bits(VL); for r = 0 to 3 do let base : integer = r * quads; for q = 0 to quads-1 do result[(4*q+0)*:esize] = operand0[(base+q)*:esize]; result[(4*q+1)*:esize] = operand1[(base+q)*:esize]; result[(4*q+2)*:esize] = operand2[(base+q)*:esize]; result[(4*q+3)*:esize] = operand3[(base+q)*:esize]; end; Z{VL}(d+r) = result; end;

Operational information

This instruction is a data-independent-time instruction as described in About PSTATE.DIT.


2026-03_rel 2026-03-26 20:48:11

Copyright © 2010-2026 Arm Limited or its affiliates. All rights reserved. This document is Non-Confidential.