Rebase DBraun/sequence-layers mlx.001 onto main; bump to 0.3.dev1 by JulianSlzr · Pull Request #15 · google/sequence-layers

JulianSlzr · 2026-06-02T02:58:31Z

Rebases @DBraun's MLX fork onto sequence-layers 0.02 as an initial next version.

Temporary measure before the refactor (#14) for downstream SL MLX users.

commit f2f2198 Author: David Braun <2096055+DBraun@users.noreply.github.com> Date: Thu Feb 26 09:06:30 2026 -0500 remove depthformer and moshi_demo.py commit 74669fc Author: Kehang Han <kehanghan@gmail.com> Date: Wed Feb 25 15:15:26 2026 -0800 This commit finally makes MLX generate good music! commit 9e4a452 Author: Kehang Han <kehanghan@gmail.com> Date: Wed Feb 25 12:47:09 2026 -0800 Changes to make temporal output match between jax and mlx commit 8d267f2 Author: David Braun <2096055+DBraun@users.noreply.github.com> Date: Wed Feb 25 12:45:53 2026 -0500 Update weight_converter.py commit 68add71 Author: David Braun <2096055+DBraun@users.noreply.github.com> Date: Wed Feb 25 12:41:52 2026 -0500 MLX attention: add per_dim_scale commit b4ae905 Author: David Braun <2096055+DBraun@users.noreply.github.com> Date: Wed Feb 25 11:31:02 2026 -0500 Update normalization.py commit a3e84cf Author: Kehang Han <kehanghan@gmail.com> Date: Tue Feb 24 16:04:27 2026 -0800 To make encoded approx bit level compatible commit c6596c7 Author: Kehang Han <kehanghan@gmail.com> Date: Tue Feb 24 14:35:16 2026 -0800 Updates to MLX port of SL to support magenta-rt commit 08077bb Author: David Braun <2096055+DBraun@users.noreply.github.com> Date: Tue Feb 24 08:54:43 2026 -0500 Create moshi_demo.py

use mx.fast.scaled_dot_product_attention

use mx.fast.rope

These tests exercise the DotProductSelfAttention KV cache ring buffer with time > max_past_horizon, causing buffer wrap-around. They expose a write-before-read bug: the current code overwrites the oldest key in the attention window before queries can attend to it, even with block_size=1. Failures: - test_use_kv_cache_ringbuffer: 33.3% element mismatch at time=6 - LocalDotProductSelfAttention.test_layer: 19.4% element mismatch at time=10 The following commit fixes these by implementing write-after-read.

Includes: - Expanded sweep for KV cache ringbuffer test with max_future_horizon=0. - Restored informative comments about ring buffer and mask logic.

TAG=agy CONV=21ada17b-3411-4090-8450-e69d8ebfeae6

DBraun and others added 17 commits June 2, 2026 02:30

add MLX layers

8aa8034

Update attention.py

8cbc1b6

Create depthformer.py

496f4f4

more efficient attention

6e9db41

Update attention.py

0666d8c

use mx.fast.scaled_dot_product_attention

Update position.py

b246e66

use mx.fast.rope

add to_quantized

9cc56e3

optimizations

14bd756

attention optimizations

c33c0b3

Update convolution.py

0671c97

Updates pyproject w/ recurrentgemma fix

c595bba

fix(mlx): Implement write-after-read for attention ring buffer

2a1092a

Includes: - Expanded sweep for KV cache ringbuffer test with max_future_horizon=0. - Restored informative comments about ring buffer and mask logic.

Adds Parallel.Config on mlx side

3b135db

Fixes cache poisoning for dtype in simple.py

4d27a1e

chore(release): bump version to 0.3.dev1

7b0816c

TAG=agy CONV=21ada17b-3411-4090-8450-e69d8ebfeae6

JulianSlzr requested a review from rryan June 2, 2026 02:58

JulianSlzr closed this Jun 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebase DBraun/sequence-layers mlx.001 onto main; bump to 0.3.dev1#15

Rebase DBraun/sequence-layers mlx.001 onto main; bump to 0.3.dev1#15
JulianSlzr wants to merge 17 commits into
google:mainfrom
JulianSlzr:mlx.001

JulianSlzr commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JulianSlzr commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants