sno2:perf/parser-string - Branch - astral-sh/ruff

Blog Docs

perf(parser): use faster string parser methods(#8227)

Merged

Merging

sno2:perf/parser-string

into

main

+23%

IMPROVEMENTS: 5

REGRESSIONS: 0

UNTOUCHED: 20

NEW: 0

DROPPED: 0

IGNORED: 0

Benchmarks

formatter[large/dataset.py]

crates/ruff_benchmark/benches/formatter.rs::formatter::benchmark_formatter::formatter[large/dataset.py]

52.1 ms

52.2 ms

formatter[numpy/ctypeslib.py]

crates/ruff_benchmark/benches/formatter.rs::formatter::benchmark_formatter::formatter[numpy/ctypeslib.py]

10 ms

formatter[numpy/globals.py]

crates/ruff_benchmark/benches/formatter.rs::formatter::benchmark_formatter::formatter[numpy/globals.py]

1.1 ms

formatter[pydantic/types.py]

crates/ruff_benchmark/benches/formatter.rs::formatter::benchmark_formatter::formatter[pydantic/types.py]

+1%

19.2 ms

18.9 ms

formatter[unicode/pypinyin.py]

crates/ruff_benchmark/benches/formatter.rs::formatter::benchmark_formatter::formatter[unicode/pypinyin.py]

3.4 ms

lexer[large/dataset.py]

crates/ruff_benchmark/benches/lexer.rs::lexer::benchmark_lexer::lexer[large/dataset.py]

8.5 ms

8.4 ms

lexer[numpy/ctypeslib.py]

crates/ruff_benchmark/benches/lexer.rs::lexer::benchmark_lexer::lexer[numpy/ctypeslib.py]

+1%

1.8 ms

lexer[numpy/globals.py]

crates/ruff_benchmark/benches/lexer.rs::lexer::benchmark_lexer::lexer[numpy/globals.py]

+2%

224.7 µs

221 µs

lexer[pydantic/types.py]

crates/ruff_benchmark/benches/lexer.rs::lexer::benchmark_lexer::lexer[pydantic/types.py]

+2%

3.8 ms

3.7 ms

lexer[unicode/pypinyin.py]

crates/ruff_benchmark/benches/lexer.rs::lexer::benchmark_lexer::lexer[unicode/pypinyin.py]

+1%

559.1 µs

551.3 µs

linter/all-rules[large/dataset.py]

crates/ruff_benchmark/benches/linter.rs::all_rules::benchmark_all_rules::linter/all-rules[large/dataset.py]

-1%

165.4 ms

167.3 ms

linter/all-rules[numpy/ctypeslib.py]

crates/ruff_benchmark/benches/linter.rs::all_rules::benchmark_all_rules::linter/all-rules[numpy/ctypeslib.py]

+3%

35 ms

34 ms

linter/all-rules[numpy/globals.py]

crates/ruff_benchmark/benches/linter.rs::all_rules::benchmark_all_rules::linter/all-rules[numpy/globals.py]

+6%

4.1 ms

3.9 ms

linter/all-rules[pydantic/types.py]

crates/ruff_benchmark/benches/linter.rs::all_rules::benchmark_all_rules::linter/all-rules[pydantic/types.py]

+1%

72.6 ms

72.1 ms

linter/all-rules[unicode/pypinyin.py]

crates/ruff_benchmark/benches/linter.rs::all_rules::benchmark_all_rules::linter/all-rules[unicode/pypinyin.py]

+1%

16.2 ms

16 ms

linter/default-rules[large/dataset.py]

crates/ruff_benchmark/benches/linter.rs::default_rules::benchmark_default_rules::linter/default-rules[large/dataset.py]

+1%

88.4 ms

87.7 ms

linter/default-rules[numpy/ctypeslib.py]

crates/ruff_benchmark/benches/linter.rs::default_rules::benchmark_default_rules::linter/default-rules[numpy/ctypeslib.py]

+2%

17 ms

16.7 ms

linter/default-rules[numpy/globals.py]

crates/ruff_benchmark/benches/linter.rs::default_rules::benchmark_default_rules::linter/default-rules[numpy/globals.py]

+15%

2 ms

1.7 ms

linter/default-rules[pydantic/types.py]

crates/ruff_benchmark/benches/linter.rs::default_rules::benchmark_default_rules::linter/default-rules[pydantic/types.py]

+4%

36.3 ms

35 ms

linter/default-rules[unicode/pypinyin.py]

crates/ruff_benchmark/benches/linter.rs::default_rules::benchmark_default_rules::linter/default-rules[unicode/pypinyin.py]

+3%

6 ms

5.8 ms

parser[large/dataset.py]

crates/ruff_benchmark/benches/parser.rs::parser::benchmark_parser::parser[large/dataset.py]

65.5 ms

65.1 ms

parser[numpy/ctypeslib.py]

crates/ruff_benchmark/benches/parser.rs::parser::benchmark_parser::parser[numpy/ctypeslib.py]

+7%

12 ms

11.2 ms

parser[numpy/globals.py]

crates/ruff_benchmark/benches/parser.rs::parser::benchmark_parser::parser[numpy/globals.py]

+23%

1.3 ms

1.1 ms

parser[pydantic/types.py]

crates/ruff_benchmark/benches/parser.rs::parser::benchmark_parser::parser[pydantic/types.py]

+1%

25.2 ms

24.9 ms

parser[unicode/pypinyin.py]

crates/ruff_benchmark/benches/parser.rs::parser::benchmark_parser::parser[unicode/pypinyin.py]

+5%

4.1 ms

3.9 ms

Commits

Click on a commit to change the comparison range

base

main

9792b15

+23%

perf(parser): use faster string parsing This makes use of memchr for parsing strings. It sadly does introduce one use of `unsafe` to create a string that is valid to pass into `u32::from_str_radix` because I was unable to find another method that does not require far more code than required with `unsafe`.

893cf42

7 months ago by sno2

fix escape followed by unicode character

28eba86

7 months ago by sno2

fix panic when unicode name starts with... multi-byte UTF-8 characters

05acb28

7 months ago by sno2

use find() instead of memchr

c577879

7 months ago by sno2

Update crates/ruff_python_parser/src/string.rs Co-authored-by: Dhruv Manilawala <dhruvmanila@gmail.com>

28b823f

7 months ago by sno2

ResourcesHome Pricing Docs Blog GitHub

Getting StartedSample repository Explore repositories Support

AboutTwitter Discord Contact Us Terms of Service Privacy Policy