Skip to content

Understanding Sierra: From High-Level Cairo to Safe CASM

Sierra (Safe Intermediate REpresentAtion) is a linear intermediate representation of Cairo instructions, designed to bridge the gap between high-level Cairo 1 intermdiate and low-level Cairo Assembly (CASM). Sierra can be compiled to a subset of CASM, known as Safe CASM.

Sierra ensures that programs are always provable by preventing the compilation of programs with infinite loops or invalid constraints.

From Cairo 1 to Sierra

Before Starknet Alpha v0.11.0, developers wrote contracts in Cairo 0, which compiled directly to Cairo Assembly (CASM), and contract classes were submitted to the sequencer via DECLARE transactions. This approach had risks, as the sequencer could not determine whether a given transaction would fail without executing it, and therefore could not charge fees for failed transactions.

Cairo 1 introduced contract class compilation to this new Sierra intermediate representation instead of directly compiling to CASM. The Sierra code is then submitted to the sequencer, compiled down to CASM, and finally executed by the Starknet OS. Using Sierra ensures that all transactions (including failed ones) are provable, and allows sequencers to charge fees for all submitted transactions, making DoS attacks significantly more expensive.

Compilation Pipeline

    Cairo 1.0 Source (High-level)
          |
          |	with Cairo-to-Sierra Compiler
          V
    Sierra IR (Safe, Provable Code)
          |
          |	with Sierra-to-CASM Compiler (Run by Sequencer)
          V
    CASM (Cairo Assembly)
          |
          | with CairoVM Execution
          V
    STARK Proofs (Proof Generation)

At its core, Sierra's compilation process is focused on safety and efficiency. Cairo 1 uses a linear type system and a non-deterministic, immutable, contiguous memory model, which guarantees that dereferencing never fails.

Sierra transforms loops into recursion and keeps track of gas usage, which prevents infinite loops.

Anatomy of a Sierra Program

Type Declarations

Sierra, as a Cairo representation, also uses a linear type system, where each value must be used exactly once. During the compilation, a unique identifier is assigned to each type.

When types can safely be used multiple times, they need to be duplicated using the dup instruction, which will assign two new identifiers to preserve linearity.

Type declaration is done with the following syntax:

type type_id = concrete_type;

Library Function Declarations

Sierra comes with a set of built-in functions (libfuncs) that represent the call to low-level units of code known to be safe. After type declarations, a Sierra program must define all the libfuncs used in the program along with their expected input types.

Libfunc declaration is done with the following syntax:

libfunc libfunc_id = libfunc_name<input_types>;

Statements

This section shows the sequence of operations that occur during execution, describing the actual logic of the program. A statement either invokes a libfunc or returns a value.

Statements are declared with the following syntax:

libfunc_id<input_types>(input_variables) -> (output_variables);

To return a value, we use the return(variable_id) statement.

User Defined Functions Declarations

At the end of a Sierra program, each user-defined function is declared with a unique identifier and the statement index where the function starts. This provides information about the function, such as its signature, while the implementation is defined in the statements section.

An user defined function is declared with the following syntax:

function_id@statement_index(parameters: types) -> (return_types);

Simple Sierra Program Breakdown

Let's go through the following Cairo program:

fn add_numbers(a: felt252, b: felt252) -> felt252 {
    a + b
}

It compiles to the following Sierra code:

type felt252 = felt252 [storable: true, drop: true, dup: true, zero_sized: false];
 
libfunc felt252_add = felt252_add;
libfunc store_temp<felt252> = store_temp<felt252>;
 
felt252_add([0], [1]) -> ([2]); // 0
store_temp<felt252>([2]) -> ([2]); // 1
return([2]); // 2
 
sierra_ir::add_numbers@0([0]: felt252, [1]: felt252) -> (felt252);

Type Declarations:

  • felt252: Represents the field element type

Libfunc Declarations:

  • felt252_add: Performs addition on field elements
  • store_temp<felt252>: Temporarily stores the result

Statements Section:

  • Statement 0: calls the felt252_add libfunc to add the values from memory cells 0 and 1, storing the result in memory cell 2
  • Statement 1: calls the store_temp<felt252> libfunc to prepare the result for the return statement
  • Statement 2: returns the value from memory cell 2

User Defined Functions:

  • add_numbers: Takes two felt252 types in memory cells 0 and 1 and returns a felt252 value by starting at statement 0

Storage Variables Smart Contract Sierra Code

You can find a more complex example of the compiled Sierra code of the Storage Variables Example.

Further Reading

Powered By Nethermind