Greaterthan official
auto_circuit.metrics.official_circuits.circuits.greaterthan_official
Classes
Functions
greaterthan_true_edges
greaterthan_true_edges(model: PatchableModel, token_positions: bool = False, word_idxs: Dict[str, int] = {}, seq_start_idx: int = 0) -> Set[Edge]
The Greaterthan circuit, discovered by Hanna et al. 2023.
Based on the ACDC implementation.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
model |
PatchableModel
|
A patchable TransformerLens GPT-2 |
required |
token_positions |
bool
|
Whether to distinguish between token positions when returning
the set of circuit edges. If |
False
|
word_idxs |
Dict[str, int]
|
A dictionary defining the index of specific named tokens in the circuit definition. For this circuit, token positions are not specified, so this parameter is not used. |
{}
|
seq_start_idx |
int
|
Offset to add to all of the token positions in |
0
|
Returns:
Type | Description |
---|---|
Set[Edge]
|
The set of edges in the circuit. |
Note
The Greaterthan circuit does not specify token positions, so if
token_positions
is True
, all token positions are included for the edges in
the circuit.
Source code in auto_circuit/metrics/official_circuits/circuits/greaterthan_official.py
38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 |
|