GPT 5.4

OpenAI · mini-SWE-agent · Rank #8

0%
Resolved help_outline Percentage of instances fully solved as measured by hidden behavioral tests.
0.0%
Almost Resolved help_outline Instances where the agent's solution passes ≥ 95% of all hidden behavioral tests.
$65
Total Cost help_outline Total API cost in USD across all task instances.
3,237
Total Calls help_outline Total number of LLM calls across all task instances.
200 instances
# Repository Lang Score Cost Calls
1 abishekvashok/cmatrix Terminal based "The Matrix" like implementation c 91.7% $0.37 8
2 jqlang/jq Command-line JSON processor c 89.9% $0.15 140
3 wfxr/csview ๐Ÿ“  Pretty and fast csv viewer for cli with cjk/emoji support. rs 88.7% $0.16 8
4 sstadick/hck A sharp cut(1) clone. rs 86.8% $0.25 10
5 tarka/xcp An extended `cp` rs 84.5% $0.28 10
6 chmln/sd Intuitive find & replace CLI (sed alternative) rs 81.6% $0.22 8
7 raviqqe/muffet Fast website link checker in Go go 81.2% $0.30 13
8 psampaz/go-mod-outdated Find outdated dependencies of your Go projects. go-mod-outdated provides a table view of the go list -u -m -json all command which lists all dependencies of a Go project and their available minor and patch updates. It also provides a way to filter indirect dependencies and dependencies without updates. go 80.4% $0.15 10
9 madler/pigz A parallel implementation of gzip for modern multi-processor, multi-core machines. c 78.9% $0.23 10
10 xorg62/tty-clock Clock using lib ncurses c 76.9% $0.22 8
11 jarun/nnn nยณ The unorthodox terminal file manager c 76.7% $0.17 8
12 oppiliappan/eva a calculator REPL, similar to bc(1) rs 75.8% $0.33 13
13 codesnap-rs/codesnap ๐Ÿฆ€๏ธ๐Ÿ“ธ Pure Rust tool to generate beautiful code snapshots, provide CLI and Library rs 75.7% $0.79 11
14 OSGeo/PROJ PROJ - Cartographic Projections and Coordinate Transformations Library cpp 73.8% $0.47 13
15 incu6us/goimports-reviser Right imports sorting & code formatting tool (goimports alternative) go 73.3% $0.60 18
16 junegunn/fzf :cherry_blossom: A command-line fuzzy finder go 71.9% $0.34 12
17 Esubaalew/run Universal multi-language runner and smart REPL written in Rust. rs 69.8% $0.21 11
18 ismaelgv/rnr A command-line tool to batch rename files and directories rs 67.9% $0.37 14
19 JohannesKaufmann/html-to-markdown โš™๏ธ Convert HTML to Markdown. Even works with entire websites and can be extended through rules. go 67.2% $0.23 7
20 sitkevij/hex ๐Ÿ”ฎ Futuristic take on hexdump, made in Rust. rs 67.2% $0.19 9
21 pemistahl/grex A command-line tool and Rust library with Python bindings for generating regular expressions from user-provided test cases rs 66.5% $0.32 7
22 peco/peco Simplistic interactive filtering tool go 66.5% $0.29 11
23 BurntSushi/xsv A fast CSV command line toolkit written in Rust. rs 65.6% $0.32 8
24 chmln/handlr A better xdg-utils rs 65.4% $0.25 10
25 clog-tool/clog-cli Generate beautiful changelogs from your Git commit history rs 65.0% $0.32 12
26 mookid/diffr Yet another diff highlighting tool rs 64.5% $0.19 8
27 dalance/amber A code search / replace tool rs 63.4% $4.75 1,000
28 anordal/shellharden The corrective bash syntax highlighter rs 63.3% $0.16 9
29 KSXGitHub/parallel-disk-usage Highly parallelized, blazing fast directory tree analyzer rs 63.3% $0.60 14
30 unhappychoice/gittype A CLI code-typing game that turns your source code into typing challenges rs 63.1% $0.25 10
31 shashwatah/jot โšกRapid note management for the terminal. rs 63.0% $0.52 14
32 gabotechs/dep-tree Tool for helping developers keep their code bases clean and decoupled. It allows visualising a code base complexity using a 3d force-directed graph of files and the dependencies between them. go 62.5% $0.30 8
33 sheepla/pingu ๐Ÿงping command but with pingu go 62.1% $0.22 10
34 yassinebridi/serpl A simple terminal UI for search and replace, ala VS Code. rs 61.0% $0.36 13
35 mgdm/htmlq Like jq, but for HTML. rs 60.8% $0.30 14
36 bensadeh/tailspin ๐ŸŒ€ A log file highlighter rs 59.7% $0.26 8
37 eradman/entr Run arbitrary commands when files change c 59.2% $0.24 13
38 astaxie/bat Go implement CLI, cURL-like tool for humans go 58.9% $0.17 9
39 nikolassv/bartib A simple timetracker for the command line. It saves a log of all tracked activities as a plaintext file and allows you to create flexible reports. rs 58.6% $0.42 12
40 svenstaro/miniserve ๐ŸŒŸ For when you really just want to serve some files over HTTP right now! rs 58.6% $0.23 8
41 mibk/dupl a tool for code clone detection go 58.4% $0.45 13
42 AmmarAbouZor/tui-journal Your journal app if you live in a terminal rs 58.4% $0.29 10
43 mkj/dropbear Dropbear SSH c 58.1% $0.15 10
44 alexpovel/srgn A grep-like tool which understands source code syntax and allows for manipulation in addition to search rs 58.1% $0.41 12
45 tomnomnom/gron Make JSON greppable! go 58.0% $0.36 13
46 sibprogrammer/xq Command-line XML and HTML beautifier and content extractor go 57.8% $0.66 11
47 pier-cli/pier A CLI to organize and run short Unix shell scripts rs 57.5% $0.52 13
48 blacknon/hwatch A modern alternative to the watch command, records the differences in execution results and can check this differences at after. rs 56.6% $0.27 9
49 segmentio/chamber CLI for managing secrets go 56.2% $0.24 9
50 git-bahn/git-graph Command line tool to show clear git graphs arranged for your branching model rs 56.2% $0.20 10
51 kyoh86/richgo Enrich `go test` outputs with text decorations. go 54.4% $0.26 10
52 WGUNDERWOOD/tex-fmt An extremely fast LaTeX formatter written in Rust rs 54.3% $0.50 13
53 sharkdp/hexyl A command-line hex viewer rs 54.1% $0.32 12
54 riquito/tuc When cut doesn't cut it rs 53.8% $0.30 10
55 rs/curlie The power of curl, the ease of use of httpie. go 53.5% $0.27 10
56 ajeetdsouza/zoxide A smarter cd command. Supports all major shells. rs 53.5% $0.34 9
57 hatoo/oha Ohayou(ใŠใฏใ‚ˆใ†), HTTP load generator, inspired by rakyll/hey with tui animation. rs 52.8% $0.28 8
58 o2sh/onefetch Command-line Git information tool rs 52.2% $0.22 8
59 bootandy/dust A more intuitive version of du in rust rs 52.2% $0.46 11
60 Miserlou/Loop UNIX's missing `loop` command rs 52.0% $0.24 12
61 ggreer/the_silver_searcher A code-searching tool similar to ack, but faster. c 51.7% $0.20 6
62 Isona/dirble Fast directory scanning and scraping tool rs 51.7% $0.39 11
63 yaa110/nomino Batch rename utility for developers rs 51.4% $0.30 15
64 sharkdp/pastel A command-line tool to generate, analyze, convert and manipulate colors rs 50.6% $0.22 7
65 sclevine/yj CLI - Convert between YAML, TOML, JSON, and HCL. Preserves map order. go 50.2% $0.32 13
66 nuta/nsh A command-line shell like fish, but POSIX compatible. rs 50.2% $0.19 8
67 orf/gping Ping, but with a graph rs 49.6% $0.18 8
68 rust-ethereum/ethabi Encode and decode smart contract invocations rs 49.3% $0.32 12
69 antonmedv/fx Terminal JSON viewer & processor go 49.3% $0.20 10
70 multiprocessio/dsq Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more. go 49.3% $0.54 24
71 noborus/trdsql CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats. go 49.1% $0.38 14
72 naggie/dstask Git powered terminal-based todo/note manager -- markdown note page per task. Single binary! go 48.8% $0.49 12
73 sharkdp/hyperfine A command-line benchmarking tool rs 48.8% $0.25 9
74 altdesktop/i3-style ๐ŸŽจ Make your i3 config a little more stylish. rs 48.8% $0.31 8
75 kisielk/errcheck errcheck checks that you checked errors. go 48.7% $0.34 14
76 kaushiksrini/parqeye Peek inside Parquet files right from your terminal rs 48.4% $0.21 14
77 quinn-rs/quinn Async-friendly QUIC implementation in Rust rs 46.9% $0.31 10
78 lz4/lz4 Extremely Fast Compression algorithm c 46.6% $0.24 10
79 sirwart/ripsecrets A command-line tool to prevent committing secret keys into your source code rs 46.3% $0.24 12
80 foriequal0/git-trim Automatically trims your branches whose tracking remote refs are merged or stray rs 45.4% $0.28 15
81 cheat/cheat cheat allows you to create and view interactive cheatsheets on the command-line. It was designed to help remind *nix system administrators of options for commands that they use frequently, but not frequently enough to remember. go 45.1% $0.40 10
82 cweill/gotests Automatically generate Go test boilerplate from your source code. go 45.1% $0.59 16
83 astro/deadnix Scan Nix files for dead code rs 44.5% $0.40 9
84 jrnxf/thokr โœจ sleek typing tui with visualized results and historical logging rs 43.4% $0.09 6
85 rs/jplot iTerm2 expvar/JSON monitoring tool go 43.2% $0.12 11
86 Canop/rhit A nginx log explorer rs 43.2% $0.23 7
87 cmatsuoka/figlet Claudio's FIGlet tree c 42.8% $0.52 16
88 axodotdev/oranda ๐ŸŽ generate beautiful landing pages for your developer tools rs 42.5% $0.22 10
89 cordx56/rustowl Visualize Ownership and Lifetimes in Rust rs 42.4% $0.25 10
90 ekzhang/bore ๐Ÿ•ณ bore is a simple CLI tool for making tunnels to localhost rs 42.1% $0.56 14
91 sayanarijit/xplr A hackable, minimal, fast TUI file explorer rs 41.1% $0.26 11
92 rust-lang/mdBook Create book from markdown files. Like Gitbook but implemented in Rust rs 40.8% $0.26 9
93 zevv/duc Dude, where are my bytes: Duc, a library and suite of tools for inspecting disk usage c 40.7% $0.30 7
94 robertdavidgraham/masscan TCP port scanner, spews SYN packets asynchronously, scanning entire Internet in under 5 minutes. c 40.7% $0.24 9
95 Byron/dua-cli View disk space usage and delete unwanted data, fast. rs 40.5% $0.17 7
96 lh3/seqtk Toolkit for processing sequences in FASTA/Q formats c 40.3% $0.26 12
97 Drew-Alleman/DataSurgeon Quickly Extracts IP's, Email Addresses, Hashes, Files, Credit Cards, Social Security Numbers and a lot More From Text rs 39.8% $0.20 9
98 sharkdp/fd A simple, fast and user-friendly alternative to 'find' rs 39.8% $0.28 12
99 ArthurSonzogni/json-tui A JSON terminal UI made in C++ cpp 39.6% $0.10 7
100 mfridman/tparse CLI tool for summarizing go test output. Pipe friendly. CI/CD friendly. go 39.1% $0.24 8
101 kyoheiu/felix tui file manager with vim-like key mapping rs 39.0% $0.42 12
102 jonas/tig Text-mode interface for git c 38.1% $0.12 7
103 wfxr/code-minimap ๐Ÿ›ฐ A high performance code minimap render. rs 38.0% $0.25 11
104 NikolaDucak/caps-log A small TUI journaling tool. ๐Ÿ“– cpp 37.9% $0.29 16
105 facebookresearch/fastText Library for fast text representation and classification. cpp 37.5% $0.24 9
106 noborus/ov ๐ŸŽ‘Feature-rich terminal-based text viewer. It is a so-called terminal pager. go 37.5% $0.21 11
107 parcel-bundler/lightningcss An extremely fast CSS parser, transformer, bundler, and minifier written in Rust. rs 36.7% $0.25 12
108 trasta298/keifu Git genealogy, untangled. A TUI for navigating commit graphs with color and clarity. rs 36.6% $0.14 9
109 ducaale/xh Friendly and fast tool for sending HTTP requests rs 36.6% $0.25 7
110 ecumene/rust-sloth A 3D software rasterizer... for the terminal! rs 36.6% $0.41 10
111 nachoparker/dutree a tool to analyze file system usage written in Rust rs 36.5% $0.14 8
112 BurntSushi/ripgrep ripgrep recursively searches directories for a regex pattern while respecting your gitignore rs 35.5% $0.26 7
113 tukaani-project/xz XZ Utils c 35.3% $0.32 12
114 lfos/calcurse A text-based calendar and scheduling application c 35.3% $0.54 12
115 facebook/zstd Zstandard - Fast real-time compression algorithm c 35.3% $0.21 11
116 simeg/eureka ๐Ÿ’ก CLI tool to input and store your ideas without leaving the terminal rs 34.9% $0.41 15
117 rhysd/kiro-editor A small terminal UTF-8 text editor written in Rust ๐Ÿ“๐Ÿฆ€ rs 34.3% $0.13 9
118 yoav-lavi/melody Melody is a language that compiles to regular expressions and aims to be more readable and maintainable rs 34.2% $0.45 19
119 elkowar/pipr A tool to interactively write shell pipelines. rs 33.3% $0.11 11
120 direnv/direnv unclutter your .profile go 32.9% $0.22 8
121 XAMPPRocky/tokei Count your code, quickly. rs 32.0% $0.39 11
122 Epistates/treemd A (TUI/CLI) markdown navigator with tree-based structural navigation. rs 31.9% $0.21 7
123 wintermute-cell/ngrrram A TUI tool to help you type faster and learn new layouts. Includes a free cat. rs 31.0% $0.28 10
124 go-critic/go-critic The most opinionated Go source code linter for code audit. go 30.4% $0.49 14
125 antonmedv/walk Terminal file manager go 30.2% $0.24 13
126 doxygen/doxygen Official doxygen git repository c 29.7% $0.24 7
127 tree-sitter/tree-sitter An incremental parsing system for programming tools rs 29.1% $0.28 9
128 ogham/dog A command-line DNS client. rs 28.8% $0.17 7
129 jesseduffield/lazygit simple terminal UI for git commands go 28.3% $0.39 11
130 guumaster/hostctl Your dev tool to manage /etc/hosts like a pro! go 28.3% $0.66 17
131 typst/typst A markup-based typesetting system that is powerful and easy to learn. rs 28.0% $0.39 11
132 TheZoraiz/ascii-image-converter A cross-platform command-line tool to convert images into ascii art and print them on the console. Now supports braille art! go 28.0% $0.28 7
133 dundee/gdu Fast disk usage analyzer with console interface written in Go go 27.9% $0.31 8
134 agourlay/zip-password-finder Find the password of protected ZIP files. rs 27.2% $0.19 10
135 ariga/atlas Declarative schema migrations with schema-as-code workflows go 26.3% $0.43 16
136 Y2Z/monolith โฌ›๏ธ CLI tool and library for saving complete web pages as a single HTML file rs 25.8% $0.25 8
137 oppiliappan/statix lints and suggestions for the nix programming language rs 25.4% $0.30 11
138 cslarsen/jp2a Converts jpg images to ASCII c 25.2% $0.43 15
139 svenstaro/genact ๐ŸŒ€ A nonsense activity generator rs 24.1% $0.46 10
140 rochacbruno/marmite Markdown makes sites - A Static Site Generator for Blogs rs 22.5% $0.33 11
141 rcoh/angle-grinder Slice and dice logs on the command line rs 21.9% $0.49 15
142 YS-L/flamelens Flamegraph viewer in the terminal rs 21.9% $0.07 7
143 mgechev/revive ๐Ÿ”ฅ ~6x faster, stricter, configurable, extensible, and beautiful drop-in replacement for golint go 21.3% $0.38 15
144 crowdagger/crowbook Converts books written in Markdown to HTML, LaTeX/PDF and EPUB rs 20.6% $0.34 11
145 jhspetersson/fselect Find files with SQL-like queries rs 20.3% $0.43 12
146 eliukblau/pixterm Draw images in your ANSI terminal with true color go 19.5% $0.20 7
147 hairyhenderson/gomplate A flexible commandline tool for template rendering. Supports lots of local and remote datasources. go 19.3% $0.46 12
148 ivanceras/svgbob Convert your ascii diagram scribbles into happy little SVG rs 18.6% $0.15 6
149 HaliteChallenge/Halite @twosigma's first artificial intelligence programming challenge cpp 18.5% $0.20 6
150 Canop/broot A new way to see and navigate directory trees : https://dystroy.org/broot rs 18.2% $0.42 7
151 dandavison/delta A syntax-highlighting pager for git, diff, grep, rg --json, and blame output rs 17.8% $0.23 7
152 lua/lua A copy of the Lua development repository, as seen by the Lua team. Mirrored irregularly. All communication should be through the Lua mailing list https://www.lua.org/lua-l.html c 17.8% $0.20 13
153 zk-org/zk Plain text note-taking assistant go 17.0% $0.30 8
154 pls-rs/pls pls is a prettier and powerful ls(1) for the pros. rs 16.9% $0.24 9
155 arq5x/bedtools2 bedtools - the swiss army knife for genome arithmetic c 16.8% $0.34 7
156 stacked-git/stgit Stacked Git rs 16.3% $0.23 8
157 htop-dev/htop htop - an interactive process viewer c 15.3% $0.28 8
158 sharkdp/bat A cat(1) clone with wings. rs 13.4% $0.26 9
159 chirlu/sox SoX, Swiss Army knife of sound processing c 13.3% $0.23 7
160 tinycc/tinycc Unofficial mirror of mob development branch c 12.8% $0.43 12
161 BLAKE3-team/BLAKE3 the official Rust and C implementations of the BLAKE3 cryptographic hash function rs 11.9% $0.23 10
162 hooklift/gowsdl WSDL2Go code generation as well as its SOAP proxy go 11.8% $0.38 10
163 paradigmxyz/solar Blazingly fast, modular and contributor friendly Solidity compiler, written in Rust rs 11.5% $0.16 8
164 duckdb/duckdb DuckDB is an analytical in-process SQL database management system cpp 10.9% $0.43 13
165 rvben/rumdl Fast Markdown linter and formatter written in Rust rs 10.8% $0.36 7
166 sigoden/argc A Bash CLI framework, also a Bash command runner. rs 10.3% $0.55 16
167 ip7z/7zip 7-Zip cpp 9.9% $0.31 9
168 rust-embedded/svd2rust Generate Rust register maps (`struct`s) from SVD files rs 9.5% $0.09 6
169 ast-grep/ast-grep โšกA CLI tool for code structural search, lint and rewriting. Written in Rust rs 9.4% $0.41 11
170 alecthomas/chroma A general purpose syntax highlighter in pure Go go 9.3% $0.33 9
171 tomarrell/wrapcheck A Go linter to check that errors from external packages are wrapped go 9.2% $0.31 13
172 konradsz/igrep Interactive Grep rs 8.8% $0.21 14
173 hpjansson/chafa ๐Ÿ“บ๐Ÿ—ฟ Terminal graphics for the 21st century. c 8.7% $0.16 7
174 FiloSottile/age A simple, modern and secure encryption tool (and Go library) with small explicit keys, no config options, and UNIX-style composability. go 8.6% $0.47 14
175 johnkerl/miller Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON go 8.0% $0.32 11
176 tstack/lnav Log file navigator cpp 7.9% $0.23 9
177 LuaJIT/LuaJIT Mirror of the LuaJIT git repository c 7.7% $0.18 16
178 samtools/samtools Tools (written in C using htslib) for manipulating next-generation sequencing data c 7.1% $0.33 10
179 mikefarah/yq yq is a portable command-line YAML, JSON, XML, CSV, TOML, HCL and properties processor go 6.9% $0.76 20
180 hush-shell/hush Hush is a unix shell based on the Lua programming language rs 6.2% $0.23 10
181 OSGeo/gdal GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats. cpp 5.3% $0.36 9
182 jgm/pandoc Universal markup converter hs 5.2% $0.24 8
183 FFmpeg/FFmpeg Mirror of https://git.ffmpeg.org/ffmpeg.git c 4.6% $0.39 12
184 boyter/scc Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go go 4.5% $0.26 9
185 Nukesor/pueue :stars: Manage your shell commands. rs 3.9% $0.16 7
186 eudoxia0/hashcards A plain text-based spaced repetition system. rs 3.7% $0.48 15
187 universal-ctags/ctags A maintained ctags implementation c 3.6% $0.20 8
188 ninja-build/ninja a small build system with a focus on speed cpp 3.5% $0.20 9
189 gromacs/gromacs Public/backup repository of the GROMACS molecular simulation toolkit. Please do not mine the metadata blindly; we use https://gitlab.com/gromacs/gromacs for code review and issue tracking. cpp 3.2% $0.32 9
190 skeema/skeema Declarative pure-SQL schema management for MySQL and MariaDB go 3.1% $0.41 16
191 rbakbashev/elfcat ELF visualizer. Generates HTML files from ELF binaries. rs 2.5% $0.19 10
192 danmar/cppcheck static analysis of C/C++ code cpp 2.4% $0.24 9
193 php/php-src The PHP Interpreter c 2.2% $0.24 7
194 Lymphatus/caesium-clt Caesium Command Line Tools - Lossy/lossless image compression tool rs 1.6% $0.61 17
195 brocode/fblog Small command-line JSON Log viewer rs 1.2% $0.24 8
196 bellard/quickjs Public repository of the QuickJS Javascript Engine. c 1.2% $0.21 13
197 google/brotli Brotli compression format c 0.9% $0.52 18
198 stathissideris/ditaa ditaa is a small command-line utility that can convert diagrams drawn using ascii art ('drawings' that contain characters that resemble lines like | / - ), into proper bitmap graphics. java 0.7% $0.46 12
199 sqlite/sqlite Official Git mirror of the SQLite source tree c 0.6% $0.20 9
200 Stranger6667/jsonschema A high-performance JSON Schema validator for Rust rs 0.2% $0.18 10

Click row to see task details