Mostly nerdless

Calling jcmd Commands Programmatically

Johannes Bechberger — Mon, 20 Apr 2026 10:42:40 +0000

JCmd allows you to quickly get information on an existing JVM. This helpful for getting things like thread-dumps (see jstall for tool that heavily relies on this). In this blog post you’ll learn to send JCmd diagnostic commands programmatically. You can find the whole code of this blog post on GitHub.

Let’s get a sample application running:

> java Loop.java &
[1] 23462

Now we want to obtain the VM arguments and the Java command, that’s easy on the command line via jcmd:

> jcmd 23462 VM.command_line
23462:
VM Arguments:
jvm_args: --add-modules=ALL-DEFAULT 
java_command: jdk.compiler/com.sun.tools.javac.launcher.SourceLauncher Loop.java
java_class_path (initial): .
Launcher Type: SUN_STANDARD

But how can we do it programmatically? Calling jcmd always starts a new JVM, which we might not want.

This is where the Diagnostic MBean comes into play.

But first how does all this work in the observed JVM? This is all based on the Java Management Extensions (JMX) technology, the built-in management tools of the JVM. This is essentially a native agent that runs alongside your application and opens a port that other tools like jcmd can talk. The agent can be configured using system properties, to e.g. set the port or password.

The Java runtime library provides us with methods to connect to a JVM properly and then work with the JMX agent.

First, we attach to the JVM and obtain a VirtualMachine mirror object (please be aware that the JVM also contains another VirtualMachine class for the debugger, so don’t confuse the two):

var vm = VirtualMachine.attach(String.valueOf(pid));

We can use the VM object to e.g. attach an agent or get the system properties. But what
is important in our context: We can also start the JMX agent (with and without custom properties)
using the startLocalManagementAgent method:

var serviceUrl = new JMXServiceURL(vm.startLocalManagementAgent());

This method returns a string that represents the local JVM Service URL that has the following format:

service:jmx:protocol:sap

Which in our example looks something like:

service:jmx:rmi://127.0.0.1/stub/rO...g=

The base64 string is a remote method invocation stub that contains a serialized RMI endpoint. When we decode the stub, it looks something like:

... sr.javax.management.remote.rmi.RMIServerImpl_Stubxrjava.rmi.server.RemoteStub���ɋ�exrjava.rmi.server.RemoteObject ...

Essentially encoding how to access to connect to the JMX RMIServer.

Anyway: Now we connect to this end point via the JMXConnectorFactory
and the JMXConnector to get a proper MBeanServerConnection:

var connector = JMXConnectorFactory.connect(serviceUrl);
var mbeanServer = connector.getMBeanServerConnection();
var diagnosticCommand =  new ObjectName("com.sun.management:type=DiagnosticCommand");

This allows to interact with JMX and diagnostic command. We’re almost there.

In principal we can invoke the individual JCmd commands via the MBeanServerConnection:

Object invoke(ObjectName name,
 String operationName,
 Object[] params,
 String[] signature)

The only problem is that the operation name is not the name used with jcmd (and defined in the code for every command class):

String[] cmdArgs = new String[] { }; // currently no command arguments passed
Object[] params = new Object[] { cmdArgs };
String[] signature = new String[] { "[Ljava.lang.String;" };
var res = mbeanServer.invoke(diagnosticCommand, "vmCommandLine", params, signature);
System.out.println(res);

But a name that adheres to the Java method name guidelines, as it’s a MBean method name:
From VM.command_line to vmCommandLine. This transformation is implemented in the JMXExecutor and in the DiagnosticCommandImpl (the JVM handles the MBean names by iterating over all jcmd command and comparing the transformed name to the name incoming from the JMXExecutor). With two different transformation implementations.

But because the OpenJDK is GPLv2 licensed, I had to create my own
MIT licensed version:

private static String transformJcmdToMBeanName(String cmd) {
    StringBuilder out = new StringBuilder();
    boolean inFirstSegment = true;
    boolean capitalizeNext = false;
    
    for (int i = 0; i < cmd.length(); i++) {
        char c = cmd.charAt(i);
        if (c == '.' || c == '_') {
            // separators are removed and next character is capitalized
            inFirstSegment = false;
            capitalizeNext = true;
            continue;
        }
    
        if (capitalizeNext) {
            out.append(Character.toUpperCase(c));
            capitalizeNext = false;
        } else if (inFirstSegment) {
            out.append(Character.toLowerCase(c));
        } else {
            out.append(c);
        }
    }
    
    return out.toString();
}

The overall code to call jcmd commands programmatically is then:

public static void main(String[] args) throws Exception {
    int pid = Integer.parseInt(args[0]);
    String cmd = args[1];

    var vm = VirtualMachine.attach(String.valueOf(pid));
    var serviceUrl = new JMXServiceURL(vm.startLocalManagementAgent());

    var connector = JMXConnectorFactory.connect(serviceUrl);
    var mbeanServer = connector.getMBeanServerConnection();
    var diagnosticCommand =  new ObjectName("com.sun.management:type=DiagnosticCommand");
    String[] cmdArgs = new String[] { }; // currently no command arguments supported
    Object[] params = new Object[] { cmdArgs };
    String[] signature = new String[] { "[Ljava.lang.String;" };
    var res = mbeanServer.invoke(diagnosticCommand, transformJcmdToMBeanName(cmd), params, signature);
    System.out.println(res);
}

That’s all. It took me some time to figure it out, therfore this blog post. I hope it’s helpful to you. See you in the week or two on something bytecode related.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post Calling jcmd Commands Programmatically appeared first on Mostly nerdless.

On HotSpot Error Files and Useful Tools

Johannes Bechberger — Thu, 16 Apr 2026 07:39:53 +0000

Ever crashed a JVM? Then you typically get a HotSpot error file, or hs err for short. These files contain information about which crash occurred and what the JVM’s state was at the time of the crash. In this short blog post, we’ll be diving into the components of a typical hs err file and show case a few custom-built tools to work with them.

But before we start: A crashing JVM is normal when you develop the JVM, e.g., adding a new CPU-time profiler, but normally it should not happen (except maybe when your JVM runs out of memory), so the hs err files are not a common sight. Still, they are important to me and my colleagues at SapMachine, which is why I’m writing this blog post.

TL;DR: I wrote an online tool for redacting hs err files and also a syntax highlighter extension for VSCode, all as part of the jhserr project on GitHub.

Header and Summary

The initial part of such an error file is most important, as it provides the error, the error location, and information on the JVM and underlying OS versions. An example is the following:

#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0x000000013000f588, pid=16540, tid=8451
#
# JRE version: OpenJDK Runtime Environment (22.0) (slowdebug build 22-internal-adhoc.jbechberger.jdk)
# Java VM: OpenJDK 64-Bit Server VM (slowdebug 22-internal-adhoc.jbechberger.jdk, mixed mode, sharing, tiered, compressed oops, compressed class ptrs, g1 gc, bsd-aarch64)
# Problematic frame:
# C  [libSmallProfiler.so+0xf588]  signalHandler(int, __siginfo*, void*)+0x464
#
# No core dump will be written. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
#
# If you would like to submit a bug report, please visit:
#   https://bugreport.java.com/bugreport/crash.jsp
#

---------------  S U M M A R Y ------------

Command Line: -agentpath:libSmallProfiler.so=interval=0.1s math.MathParser

Host: HKJHJKLHJ, "MacBookPro17,1" arm64, 8 cores, 16G, Darwin 22.6.0, macOS 13.5.1 (22G90)
Time: Thu Aug 31 17:31:03 2023 CEST elapsed time: 1.376730 seconds (0d 0h 0m 1s)

Here you can clearly see that a segmentation fault (accessing memory it should not have) caused the JVM version 22 built by me to crash after it ran for 1.38s on an ARM MacBook. Furthermore, it shows you how the JVM was invoked, so the JVM probably crashed during some of my profiler experiments.

When a crash is caused by a failing assertion in a debug build, then you also see it:

#
# A fatal error has been detected by the Java Runtime Environment:
#
#  Internal Error (../../src/hotspot/os/posix/os_posix.cpp:803), pid=10064, tid=25091
#  assert(ms < MILLIUNITS) failed: Un-interruptable sleep, short time use only
# ...

Which is pretty neat, as it even includes the code location where the assertion failed. But what if you want to know the whole stack trace of the crashing thread? This is where the next section of the hs err file comes to our help:

Thread Section

This section gives you more information on the crashing thread, the native as well as the Java frame on the stack:

---------------  T H R E A D  ---------------

Current thread (0x0000000127f09120):  JfrThreadSampler "JFR Thread Sampler" [stack: 0x00000001720f8000,0x00000001722fb000] [id=25091]

Stack: [0x00000001720f8000,0x00000001722fb000],  sp=0x00000001722fae10,  free space=2059k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
V  [libjvm.dylib+0x121c6a0]  VMError::report_and_die(int, char const*, char const*, char*, Thread*, unsigned char*, void*, void*, char const*, int, unsigned long)+0x608
V  [libjvm.dylib+0x121cde0]  VMError::report_and_die(Thread*, void*, char const*, int, char const*, char const*, char*)+0x40
V  [libjvm.dylib+0x5de794]  report_vm_error(char const*, int, char const*, char const*, ...)+0x94
V  [libjvm.dylib+0xe439fc]  os::naked_short_sleep(long)+0x3c
V  [libjvm.dylib+0x979bb4]  JfrThreadSampler::run()+0x10c
V  [libjvm.dylib+0x116c118]  Thread::call_run()+0x220
V  [libjvm.dylib+0xe3c0c8]  thread_native_entry(Thread*)+0x160
C  [libsystem_pthread.dylib+0x726c]  _pthread_start+0x94

This might also include registers and their values, instructions, and the stack-to-memory mapping, helping you investigate issues with the current frame.

This section is followed by information on the whole JVM process:

Process Section

The process section starts with the user and user group id, followed by some internal state, every Java and every native thread with name, thread id and status, allowing you a glimpse at all currently running threads:

---------------  P R O C E S S  ---------------

uid  : 3670 euid : 3670 gid  : 15 egid : 15

umask: 0022 (----w--w-)

Threads class SMR info:
_java_thread_list=0x0000eb2e804febf0, length=14, elements={ ...
}
_java_thread_list_alloc_cnt=17, _java_thread_list_free_cnt=15, _java_thread_list_max=15, _nested_thread_list_max=0
_tlh_cnt=64, _tlh_times=17, avg_tlh_time=0.27, _tlh_time_max=8
_deleted_thread_cnt=1, _deleted_thread_times=0, avg_deleted_thread_time=0.00, _deleted_thread_time_max=0
_delete_lock_wait_cnt=0, _delete_lock_wait_max=0
_to_delete_list_cnt=0, _to_delete_list_max=1

Java Threads: ( => current thread )
  0x0000eb2e800beb80 JavaThread "main"                              [_thread_blocked, id=643836, stack(0x0000eb2e845e2000,0x0000eb2e847e0000) (2040K)]
  0x0000eb2e803711e0 JavaThread "Reference Handler"          daemon [_thread_blocked, id=643920, stack(0x0000eb2e600dc000,0x0000eb2e602da000) (2040K)]
...

Other Threads:
  0x0000eb2e803380e0 VMThread "VM Thread"                           [id=643918, stack(0x0000eb2e60706000,0x0000eb2e60904000) (2040K)]
  0x0000eb2e8016a7e0 WatcherThread "VM Periodic Task Thread"        [id=643917, stack(0x0000eb2e604f8000,0x0000eb2e606f6000) (2040K)]
  0x0000eb2e80143df0 ConcurrentGCThread "ShenControl"               [id=643911, stack(0x0000eb2e60b62000,0x0000eb2e60d60000) (2040K)]
...
Threads with active compile tasks:
Total: 0
VM state: not at safepoint (normal execution)

This is then followed by internal heap and class space information, as well as information on every heap region in a large table (lots of internal information):

Heap Regions:
Region state: EU=empty-uncommitted, EC=empty-committed, R=regular, H=humongous start, HP=pinned humongous start
              HC=humongous continuation, CS=collection set, TR=trash, P=pinned, CSP=pinned collection set
BTE=bottom/top/end, TAMS=top-at-mark-start
UWM=update watermark, U=used
T=TLAB allocs, G=GCLAB allocs
S=shared allocs, L=live data
CP=critical pins
|    0|R  |Y|BTE     c0000000,     c0080000,     c0080000|TAMS     c0080000|UWM     c0080000|U   512K|T     0B|G     0B|S   512K|L   511K|CP   0
|    1|R  |Y|BTE     c0080000,     c0100000,     c0100000|TAMS     c0100000|UWM     c0100000|U   512K|T     0B|G     0B|S   512K|L   502K|CP   0
|    2|H  |Y|BTE     c0100000,     c0180000,     c0180000|TAMS     c0100000|UWM     c0100000|U   512K|T     0B|G     0B|S   51
...

Events SubSection

After the heap information, we see many internal events. Like compilation events:

Compilation events (250 events):
Event: 1.821 Thread 0x0000eb2e8037d7e0 nmethod 95 0x0000eb2e67612ec8 code [0x0000eb2e67612fc0, 0x0000eb2e67613160]
Event: 1.822 Thread 0x0000eb2e8049fb40   96       1       java.lang.invoke.MethodType::returnType (5 bytes)
Event: 1.822 Thread 0x0000bd9f33b3d7a0 nmethod 94 0x0000eb2e67613188 code [0x0000eb2e67613280, 0x0000eb2e67613b28]
Event: 1.822 Thread 0x0000eb2e8049fb40 nmethod 96 0x0000eb2e6f1a5848 code [0x0000eb2e6f1a5940, 0x0000eb2e6f1a5ac8]

Or some GC and Metaspace history events:

GC Heap Usage History (1 events):
Event: 2.262 {heap Before GC invocations=0 (full 0):
 Shenandoah Heap
   1024M max, 1024M soft max, 1024M committed, 105M used
  2048 x 512K regions
 Status: not cancelled
 Reserved region:
  - [0x00000000c0000000, 0x0000000100000000) 
 Collection set:
  - map (vanilla): 0x0000000000009800
  - map (biased):  0x0000000000008000

}

Metaspace Usage History (1 events):
Event: 2.262 {metaspace Before GC invocations=0 (full 0):
 Metaspace       used 794K, committed 1024K, reserved 1114112K
  class space    used 82K, committed 192K, reserved 1048576K
}

Or dynamic library-related events:

Dll operation events (8 events):
Event: 0.011 Attempting to load shared library /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/lib/libjava.so
Event: 0.011 Loaded shared library /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/lib/libjava.so
Event: 1.667 Attempting to load shared library /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/lib/libnio.so
Event: 1.668 Loaded shared library /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/lib/libnio.so

Or deoptimization and class loading events:

Deoptimization events (4 events):
Event: 2.229 Thread 0x0000eb2e8054ca90 Uncommon trap: trap_request=0xffffff45 fr.pc=0x0000eb2e6f1ab46c relative=0x000000000000016c
Event: 2.229 Thread 0x0000eb2e8054ca90 Uncommon trap: reason=unstable_if action=reinterpret pc=0x0000eb2e6f1ab46c method=java.lang.String.isLatin1()Z @ 10 c2
Event: 2.229 Thread 0x0000eb2e8054ca90 DEOPT PACKING pc=0x0000eb2e6f1ab46c sp=0x0000eb2e5a6075b0
Event: 2.229 Thread 0x0000eb2e8054ca90 DEOPT UNPACKING pc=0x0000eb2e6eaa151c sp=0x0000eb2e5a6074f0 mode 2

Classes loaded (92 events):
Event: 1.505 Loading class java/lang/invoke/MethodHandleNatives$Constants
Event: 1.505 Loading class java/lang/invoke/MethodHandleNatives$Constants done
Event: 1.557 Loading class java/lang/invoke/DirectMethodHandle$StaticAccessor
Event: 1.557 Loading class java/lang/invoke/DirectMethodHandle$StaticAccessor done
Event

And many more. You want to know what the JVM did before the crash? This is the section for you.

Dynamic Libraries Subsection

This is followed by information on the loaded dynamic libraries (and where they are loaded):

Dynamic libraries:
00008000-0000a000 rw-p 00000000 00:00 0 
c0000000-100000000 rw-p 00000000 00:00 0 
400000000-400010000 ---p 00000000 00:00 0 
400010000-400ea0000 rw-p 00010000 fc:00 3622183                          /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/lib/server/classes_coh.jsa
400ea0000-401000000 ---p 00000000 00:00 0 
401000000-401020000 rw-p 00000000 00:00 0 
401020000-401040000 ---p 00000000 00:00 0 
401040000-401050000 rw-p 00000000 00:00 0 
401050000-441000000 ---p 00000000 00:00 0 
bd9eff9a0000-bd9eff9a2000 r-xp 00000000 fc:00 3622338                    /testfolder/output_openjdk27_dev_dbgU_linuxaarch64/testee-vm/bin/java

JVM SubSection

At the end of the file, you get information on the exact JVM arguments, all JVM options, signal handlers, and in-depth CPU information. I’m skipping these here for brevity.

Syntax Highlighting

One problem when working with the hs err files is that there is (was) no syntax highlighting in the editor that many people use for JVM C++ development: VSCode (IntelliJ apparently already has a plugin).

So I created one:

This also includes proper section handling, an overview structure, and highlighting of the important information.

Redaction

Sometimes you want to redact sensitive information such as user, folder, and host names, environment variables, or dynamic libraries. For this reason I created the jhserr tool which also supports converting hs err files to JSON and generating them back. This way jhserr can be used by others to develop their own small hs err related tools.

But command-line tools are cumbersome when you just want to process a single hs err file, so I used GraalVM Web Image to compile the jhserr library to WebAssembly and run it in the browser (find it here):

Conclusion

I hoped you liked learning about the hs err files and the new tools I wrote. These tools have been requested by my colleagues developing the JVM and have already proved useful. Looking into GraalVM’s Web Image technology was also interesting, as it allows me to run Java applications on the web, without additional backend servers, which is great for data privacy in the case of a redaction tool.

See you next week (hopefully) for something JCmd-related.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post On HotSpot Error Files and Useful Tools appeared first on Mostly nerdless.

Java 26 is boring, and that’s a good thing

Johannes Bechberger — Tue, 17 Mar 2026 07:18:39 +0000

A joint article with Lutske de Leeuw.

When people hear “boring tech”, they usually mean old, slow, or not innovative. But in production, boring implies something very different. Boring means predictable, with no surprises. Boring means your system still works at 3 a.m. when nobody wants to debug a memory leak. And boring also means that you can understand your system years after you wrote it.

Many platforms try to impress developers with significant changes, shiny rewrites, or breaking updates. Java took another path. Java optimizes for trust.

That choice has a cost. Java can look conservative next to languages that ship new syntax, stronger type-system guarantees, or more ambitious standard-library features, much faster. From the outside, that can make Java seem like it is standing still, even as the platform improves underneath.

If a Java release feels boring, that usually means:

Your code still compiles
Your APIs will still work
Your upgrade does not turn into a rewrite project

And that’s not a weakness. That’s why Java has survived for decades

TL;DR: Java 26 is usefully boring

If you only remember one thing: Java 26 is not flashy, but it quietly improves the runtime and platform where it matters.

JEP 522 (G1 throughput): apps can get faster (often 5-15%) without code changes.
JEP 516 (AOT object caching with any GC): better startup behavior, especially for microservices.
JEP 517 (HTTP/3): modern protocol support in the standard client, with fallback.
JEP 500 (final means final): fewer reflection hacks, more predictable behavior.
JEP 504 (remove applets): old, dead tech finally cleaned up.
JEP 524, JEP 525, JEP 526, JEP 529, JEP 530: previews/incubator maturing steadily.

Java is Boring by Design

This “boring” story is not new. It is close to Java’s original DNA from 1995:

Original design goal	Why it still matters
Simple, object-oriented, and familiar	Teams can keep large systems understandable for years
Robust and secure	Enterprise software needs predictable failure modes
Architecture-neutral and portable	Java still runs nearly everywhere
High performance	Modern JVM work keeps improving performance release after release
Interpreted, threaded, and dynamic	Concurrency and runtime behavior were core concerns from day one

If you look back at the different Java versions, the language changes since Java 8 have mostly been evolutionary rather than revolutionary.

var, records, text blocks, switch expressions, and pattern matching make Java easier to use, but they do not fundamentally change how most teams write software. That can make Java look less ambitious than newer languages.

At the same time, many newer platform features have their biggest immediate impact in libraries and frameworks, even if some, such as virtual threads, can also matter directly to application developers.

When library authors get better tools, everyone else gets better foundations for their applications. That is part of why Java can afford to stay boring.

A lot of Java’s practical innovation happens first in libraries, frameworks, build tools, and community spaces rather than in the language grammar itself. Java User Groups, conference talks, blog posts, and open-source experiments often act as the platform’s proving ground. Ideas get tested in the community long before they are standardized in the JDK. If you want to understand why Java keeps moving without constantly reinventing itself, look not just at the language, but also at the ecosystem around it, the global JUG network, and all the many conferences.

Even old Java looks Modern

For example, take:

import java.util.*;

class Example {
    Map>>> complex;
    List wildcardExtends;
    List wildcardSuper;
}

Can you guess the Java version? It’s Java 5.

Or take this later snippet:

class Example {
    record Pair(T first, U second) {}

    Pair pair = new Pair<>("test", 42);
}

It’s Java 16, released in 2021. Most developers are surprised by how old common Java syntax really is. That is exactly the point. Johannes built the Java Version Game so you can test this yourself: try to guess which Java version introduced a piece of syntax, and you’ll quickly see how much of the language has been “boring” for far longer than you’d expect.

How to upgrade

For many projects, upgrading from Java 25 to Java 26 is straightforward at the language level. In the simplest case, it starts with updating the Java release version in your build configuration:

In Maven, you most often just have to change:

Or in Gradle:

java {
    toolchain {
        languageVersion = JavaLanguageVersion.of(26)
    }
}

Modern IDEs typically highlight deprecated or removed APIs and suggest alternatives where needed. Thanks to Java’s strong backward compatibility guarantees, many applications compile and run without source changes, making the upgrade feel closer to routine maintenance than to a migration project.

That said, real upgrades are rarely just about the compiler. Teams also need to check framework support, build plugins, annotation processors, CI images, container base images, and the versions of libraries they depend on. In other words, the Java code may upgrade cleanly while the surrounding delivery stack still needs attention.

Backward compatibility should not be an excuse never to upgrade. “It still works on Java 8” is often technical debt with a smiley face. You might avoid short-term work, but you pay with missed security updates, missed runtime performance gains, and an increasingly fragile dependency graph.

Johan Vos put it well:

“There is a small but immediate cost of upgrading. There is a huge, potentially catastrophic but not immediate cost of staying on old versions. Until the disasters become visible, people don’t want to invest. Sounds a bit like climate change.”

If your codebase is large, tools like OpenRewrite can automate repetitive migration tasks, significantly reducing upgrade effort.

Important: Only use non-LTS versions of your JDK if you are aware of the risks and willing to upgrade twice a year. Never use these releases in production. But please still try them out, give feedback to the JDK developers, and report bugs.

What Java 26 changes

Java 26 introduces many JEPs (Java Enhancement Proposals), but not all have the same impact on day-to-day development. Some changes affect how Java behaves at runtime or how you interact with core APIs. Others are previews or incubating features that are intentionally not final yet.

Honestly, the most important practical improvement in Java 26 is not in the language or its APIs. It is the improved throughput with the G1 garbage collector. In other words, an upgrade can just give you better performance. This G1 improvement is the best kind of change, which is also the best kind of optimization: the kind where you do almost nothing.

In this section, we focus on the changes most likely to matter to Java developers today. These are features that are enabled by default, influence performance or behavior, or represent a clear step forward for the platform.

Preview and incubating features are treated differently. They are still evolving, require explicit opt-in, and may change or even be removed in future releases. Because of that, they are summarized later in this article, with a short explanation for each JEP. The goal there is awareness, not immediate adoption.

The following paragraphs highlight the most concrete changes in Java 26. After that, we take a short look at the preview and incubating features.

G1 GC: Improve Throughput by Reducing Synchronization (JEP 522)

The Garbage-First collector (G1) is the default garbage collector, so its performance is critical. Ivan Walulya and Thomas Schatzl improved G1 by reducing the synchronization required between application and GC threads. The main result is better throughput, with possible latency benefits depending on the workload.

In the JEP’s benchmarked workloads, throughput improved by 5-15% without application changes (source). Real applications will vary, but this is still the best kind of improvement: the platform gets faster while your code stays the same. It also reflects a platform that spends significant engineering effort on the JVM itself, not just on surface-level language changes.

This is arguably the most important change in Java 26, and also the most “boring” one: meaningful performance gains without changing your application code.

Ahead-of-Time Object Caching with Any GC (JEP 516)

JEP 516 is another practical runtime improvement that is easy to miss. The idea is that the JVM can cache startup objects from a training run and reuse them on future starts. In Java 26, this is available with any garbage collector, not only G1.

# Training run: record startup-created objects
java -XX:AOTCacheOutput=app.aot -jar myapp.jar

# Later starts: load prebuilt cache
java -XX:AOTCache=app.aot -jar myapp.jar

This can improve startup behavior for microservices and containerized workloads, and it does so without requiring application code changes

Prepare to Make Final Mean Final (JEP 500)

JEP 500 addresses a long-standing inconsistency: final fields that can still be mutated via reflection hacks. Java 26 starts warning about this behavior and prepares the ecosystem for stricter enforcement in a future release.

// This works in Java 25, but will warn in Java 26 and eventually be disallowed
class Config {
    private final String secret = "original";
}

var config = new Config();
var field = Config.class.getDeclaredField("secret");
field.setAccessible(true);
field.set(config, "hacked"); // <- final? Not really.

Example warning you can expect in Java 26:

WARNING: Final field mutation via reflection is deprecated and will be disallowed in a future release
WARNING: Attempted to mutate final field Config.secret using java.lang.reflect.Field::set

Historically, libraries used tricks like this for serialization, proxies, or framework internals. The platform is now moving toward stronger guarantees, which improve predictability and optimization opportunities.

HTTP/3 in the standard HTTP client (JEP 517)

Java 26 adds support for HTTP/3 to the standard java.net.http.HTTPClient API. This allows Java applications to communicate with HTTP/3 servers using the same client that was introduced in Java 11, without requiring a new API or a rewrite. Support for HTTP/3 is explicitly opt-in and does not change the default protocol behavior.

Why HTTP/3 matters today?

HTTP/3 is the successor to HTTP/2 and is built on top of the QUIC transport protocol rather than TCP. QUIC is a modern transport protocol designed for web traffic and runs over UDP, with TLS 1.3 integrated by default. This design reduces connection setup time, avoids head-of-line blocking, and improves performance on unreliable or high-latency networks. HTTP/3 is already widely used by modern browsers and supported by a large portion of public web infrastructure, making it increasingly relevant for client-side applications.

What does it mean that HTTP/3 lives in JDK 26?

By including HTTP/3 support in the standard HTTP Client API, Java removes the need for third-party networking libraries to access modern web protocols. Existing application code remains essentially unchanged, and developers can opt in to HTTP/3 on a per-client or per-request basis. If a server does not support HTTP/3, the client transparently falls back to HTTP/2 or HTTP/1.1. This approach preserves backward compatibility while allowing applications to adopt newer protocols at their own pace.

If you want to try it, the API usage is intentionally simple:

var client = HttpClient.newBuilder()
    .version(HttpClient.Version.HTTP_3)
    .build();

var request = HttpRequest.newBuilder()
    .uri(URI.create("https://example.com/api"))
    .build();

var response = client.send(request, HttpResponse.BodyHandlers.ofString());
System.out.println(response.statusCode());

Do you need HTTP/3?

I asked a Netty developer, and his answer essentially was: “You’ll be fine without it. HTTP/2 is good enough.”

HTTP is often not the bottleneck, and HTTP/2 is usually good enough. It is still useful for Java to keep pace with newer protocols, even if many teams will only use HTTP/3 once their frameworks and infrastructure adopt it.

The end of Java Applets (JEP 504)

Java applets were once the way you implemented interactive web applications if you didn’t want to use Flash. As an example:

import java.applet.Applet;
import java.awt.Button;
import java.awt.Graphics;
import java.awt.event.ActionEvent;
import java.awt.event.ActionListener;

public class HelloApplet extends Applet {
    private String message = "Hello from an Applet";
    private int clicks = 0;

    @Override
    public void init() {
        Button button = new Button("Click me");
        button.addActionListener(new ActionListener() {
            @Override
            public void actionPerformed(ActionEvent e) {
                clicks++;
                message = "Clicks: " + clicks;
                repaint(); // trigger redraw
            }
        });
        add(button);
    }

    @Override
    public void paint(Graphics g) {
        g.drawString(message, 20, 40);
    }
}

You could embed this via:



  
    
      Your browser does not support Java applets.

Java 26 removes the Applet API. This is not a breaking change in practice, but the final step of a very long deprecation process.

Applets have been obsolete for years. Browser support had already collapsed after NPAPI disappeared, and Java followed with a long, explicit off-ramp: the Applet API was deprecated in Java 9 in 2017, the appletviewer tool was removed in Java 11, the API was deprecated for removal in Java 17, and Java 26 now removes it entirely. With the Security Manager permanently disabled, there is no longer any technical basis for running applets safely.

As a result, Java 26 removes the java.applet package, along with related classes such as JApplet. For most developers, this has zero impact. Code that was still dependent on applets was already tied to older Java versions. And modern alternatives have existed for a long, long time.

This change is a good example of Java’s careful evolution: once a core feature, the Java platform phased it out gradually with years of notice, clear migration paths, and a quiet removal when the API lost its relevance. This is how boring works.

Preview and incubating features

Beyond PEM, Java 26 includes several preview and incubating features. The short version is simple: they are interesting, they are opt-in, and most teams can safely watch them mature from a distance for now.

If you want to try them, you must enable previews explicitly:

javac --enable-preview --release 26 PrimitivePatternExample.java
java --enable-preview PrimitivePatternExample

The short version

JEP	Feature	Why it matters
JEP 530	Primitive patterns	Makes pattern matching more consistent by covering primitive types too.
JEP 526	Lazy constants	Standardizes a common lazy-initialization pattern with proper safety guarantees.
JEP 529	Vector API	Gives performance-sensitive libraries explicit SIMD access.
JEP 525	Structured concurrency	Support for the PEM encoding of cryptographic objects,
JEP 524	PEM	Support for the PEM encoding of crypthographic objects,

The common theme is maturity over speed. Java is still exploring these ideas publicly with feedback, rather than forcing them into production too early.

Who these features are really for

Many of the newest platform capabilities primarily help library and framework developers first. That is often how Java improves safely.

Library authors use	Typical effect for application teams
Vector API	Faster search, crypto, and data-processing libraries
Structured concurrency	Cleaner, safer framework-level parallel orchestration
Memory/FFI improvements	Better native integration in infrastructure libraries
Virtual-thread ecosystem support	Higher throughput with less thread-management pain
PEM	Direct support of the PEM encoding in the standard library, so less boilerplate or external dependencies.

When those libraries improve, most teams benefit without having to rewrite business logic.

You don’t need these yet

Preview features are opt-in for a reason. They are there so developers can experiment, give feedback, and help shape the platform before these APIs become permanent. You can ignore them today and still get plenty of value from Java 26.

The same is true, more broadly, for the short release cadence. Non-LTS releases like Java 26 mean you do not have to wait years for useful features, runtime improvements, and ecosystem feedback to arrive. Even if you do not run every short-term release in production, they still help move the platform forward faster and make the eventual LTS releases better.

And if something here looks interesting, try it. That is part of how Java improves. Preview features exist so developers can kick the tires, find rough edges, and tell the platform engineers what works and what does not.

That community loop matters. Java is not built in a vacuum, and it is not shaped only by a small group behind closed doors. It is shaped by users, library authors, framework maintainers, JUGs, conference speakers, and people who try new things early enough to say, “this works,” “this is awkward,” or “please do not ship it like this.” That participatory culture is one of the reasons Java can move carefully without standing still

When boring is a liability

The argument for boring tech becomes much weaker when boring turns into visible lag.

Java still lacks or only partially supports several features that are standard elsewhere: built-in string interpolation, first-class null-safety, built-in JSON support, and simpler day-to-day concurrency ergonomics. Other languages often introduce these ideas to developers sooner, and developers benefit from a more modern look as a result.

That matters because language design is not only about raw capability. It also shapes how quickly new ideas spread, how much boilerplate teams accept, and whether new developers see a language as growing or merely enduring. Java’s cautious pace can therefore cost it mindshare, especially among developers comparing it with Kotlin, Rust, TypeScript, or newer JVM-adjacent tools.

Still, Java’s counterargument is stronger than “we do not need new things.” It is that some features are more valuable once they are proven, specified carefully, and integrated in a way that does not destabilize millions of existing systems. Java often arrives later, but it also tries to arrive with fewer surprises. That trade-off can be frustrating, but it is not irrational.

Java 26 improves a lot, but some long-requested ergonomics are still missing or only partially addressed:

built-in string interpolation, especially compared with the long-settled ergonomics of C# string interpolation,
first-class null-safety in the type system, as Kotlin demonstrates on the JVM,
built-in JSON support in the standard library, which many developers now expect out of the box,
simpler concurrency APIs and primitives for everyday code, where other platforms often expose structured concurrency more directly,
and probably many more.

Java is not perfect. We’re not here to pretend it is.

But none of this makes Java unusable. It does mean that Java sometimes pays a real price for its caution: Kotlin makes null-safety feel normal on the JVM, C# made string interpolation mundane years ago, many developers expect JSON handling to be built in rather than delegated immediately to Jackson or Gson, and structured concurrency often feels easier to reach for in ecosystems that expose it more directly. The upside is that Java usually adopts change with far more concern for compatibility than trendiness.

Conclusion

Well, Java 26 is boring, except for the JVM. So upgrading to the next JVM can still be worthwhile, even if the language itself has changed little.

One of Java’s great strengths is that you can still write code that looks a lot like Java 8 and nobody will notice. You can focus on your actual application instead of constantly rewriting your mental model whenever language fashions change.

Evolution over revolution is a big part of what kept Java relevant for 30 years. Careful language design and steady runtime work will likely keep it relevant for much longer.

Can Java be improved? Yes to all. Are some features still awkward? Yes. Does it miss things, such as string interpolation, that other languages already have? Also yes.

But Java has a vibrant community, its runtime is shaped by more than one company, and it still runs almost everywhere.

For a platform that is supposedly boring, that is a remarkably strong place to be. Java is still resilient, still improving, and still one of the safest bets you can make if you care about software that has to last.

It has never been a better time to be a Java developer. See you in another 30 years.

P.S.: If you want to see the talk version of this article, come to VoxxedDays Amsterdam 2026.

The post Java 26 is boring, and that’s a good thing appeared first on Mostly nerdless.

Writing a tiny JSON Parser

Johannes Bechberger — Tue, 10 Mar 2026 13:58:59 +0000

JSON is one of the foundational data formats, especially with modern REST APIs, but many existing libraries are either large or small and hard to comprehend. In this blog post, I’ll show you how to implement a tiny JSON parser that follows the official JSON grammar directly. This parser will not be the fastest or the most featureful, but it will be good enough for many use cases, where you need to parse simple JSON structures returned by a REST API. The resulting library is femtojson.

TLDR: I built femtojson, a tiny JSON library.

Usage

This library is as minimal as possible, only emitting existing List, Map, and primitive types, instead of custom wrapper classes:

import me.bechberger.util.json.JSONParser;
import java.io.IOException;
import java.util.Map;
import java.util.List;

public class Example {
    public static void main(String[] args) throws IOException {
        // Parse a simple object
        String jsonObject = "{\"name\": \"Alice\", \"age\": 30}";
        Map obj = (Map) JSONParser.parse(jsonObject);
        
        System.out.println(obj.get("name"));  // Output: Alice
        System.out.println(obj.get("age"));   // Output: 30
        
        // Parse an array
        String jsonArray = "[1, 2, 3, 4, 5]";
        List numbers = (List) JSONParser.parse(jsonArray);
        
        System.out.println(numbers.get(0)); // Output: 1
        
        // Parse nested structures
        String complexJson = "{\"items\": [1, \"two\", 3.14], \"active\": true}";
        Map complex = (Map) JSONParser.parse(complexJson);
        
        List items = (List) complex.get("items");
        System.out.println(items.get(1)); // Output: two
        
        Boolean active = (Boolean) complex.get("active");
        System.out.println(active); // Output: true
    }
}

The library also contains a pretty printer and tests to check that all edge cases are handled properly.

We start this blog post by looking at the grammar:

JSON Grammar

The grammar in McKeeman form with my own annotations and slightly reordered to make it more readable:

json   # this is the entry point to the grammar
   element

element # a JSON element is a JSON with some whitespace
        # e.g. '  1  ' is also valid JSON
    ws value ws

value   # a JSON value is either a primitive, 
        # an array, or an object
   object
   array
   string
   number
   "true"
   "false"
   "null"

object  # a JSON object is either empty ('{ }') or has members
    '{' ws '}'
    '{' members '}'

members # members is a comma separated list
    member
    member ',' members

member  # a member is '"key": value', with arbitrary whitespace 
    ws string ws ':' element

array   # an array is either empty or has elements
    '[' ws ']'
    '[' elements ']'

elements # elements is like members
    element
    element ',' elements

string  # a string is characters inside '"'
    '"' characters '"'

characters
    ""
    character characters

character
    '0020' . '10FFFF' - '"' - '\'
    '\' escape

escape
    '"'
    '\'
    '/'
    'b'
    'f'
    'n'
    'r'
    't'
    'u' hex hex hex hex

hex
    digit
    'A' . 'F'
    'a' . 'f'

number
    integer fraction exponent

integer
    digit
    onenine digits
    '-' digit
    '-' onenine digits

digits
    digit*

digit
    '0'
    onenine

onenine
    '1' . '9'

fraction
    ""
    '.' digits

exponent
    ""
    'E' sign digits
    'e' sign digits

sign
    ""
    '+'
    '-'

ws   # all supported whitespace characters (as hex codepoints)
    ""
    '0020' ws
    '000A' ws
    '000D' ws
    '0009' ws

Transforming the Grammar

We could start using this grammar directly to implement the parser, but we want to simplify it. Ideally, we want a grammar where, for every rule, we can decide between the options using as few characters as possible. With JSON, we can achieve this with only one character of so-called lookahead.

The only problematic bits are related to the concatenation of the same rule, sometimes with a character in between. Let’s rewrite the grammar, introducing the many rules into our previously pure McKeenan form:

json
   element

element
    ws value ws

value
   object
   array
   string
   number
   "true"
   "false"
   "null"

object  # a JSON object is either empty ('{ }') or has members
    '{' ws '}'
    '{' member (',' member)* '}'

member  # a member is '"key": value', with arbitrary whitespace 
    ws string ws ':' element

array   # an array is either empty or has elements
    '[' ws ']'
    '[' element (',' elements)* ']'

string  # a string is characters inside '"'
    '"' character* '"

character # essentially all non control characters excluding '"' and '\'
    '0020' . '10FFFF' - '"' - '\'
    '\' escape

escape   # the characters that can be escaped + special characters
    '"'
    '\'
    '/'
    'b'
    'f'
    'n'
    'r'
    't'
    'u' hex hex hex hex

hex     # valid hexadecimal character
    digit
    'A' . 'F'
    'a' . 'f'

number  # numbers a floating points with optional exponents
    integer fraction exponent

integer
    digit
    onenine digits
    '-' digit
    '-' onenine digits

digits
    digit
    digit digits

digit
    '0'
    onenine

onenine
    '1' . '9'

fraction
    ""
    '.' digits

exponent
    ""
    'E' sign digits
    'e' sign digits

sign
    ""
    '+'
    '-'

ws   # all supported whitespace characters (as hex codepoints)
    ""
    '0020' ws
    '000A' ws
    '000D' ws
    '0009' ws

Implementing the Rules

This grammar can now be easily implemented by creating a method for every rule:

/*
R
  'a' A
  'b' B
*/
void parseR() {
  if (current == 'a') {
    advance()
    parseA();
  } else if (current == 'b') {
    advance();
    parseB();
  }
  throw new ParseException(...);
}

The star expression can be expressed as loops:

/*
R
  A (',' A)*
*/
... parseR() {
  parseA();
  while (current == ',') {
    parseA();
  }
}

Of course, one would collect the results of the parse steps. You can read more about these so-called recursive descent parsers on Wikipedia.

A great thing about a parser that exposes individual rules is that we can use the same parser to parse subsets of the JSON grammar, e.g., only allow objects at the top level. Making the API simpler to use.

Conclusion

With some grammar engineering, we can build a grammar that is straightforward to implement as a recursive descent parser. The resulting parser library with a pretty printer is only 12KB big.

See you next week with something JDK specific on JCmd and JMX Diagnostic Beans.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post Writing a tiny JSON Parser appeared first on Mostly nerdless.

Redacting Data from Heap Dumps via hprof-redact

Johannes Bechberger — Tue, 24 Feb 2026 12:53:43 +0000

Two weeks ago, I showed you how Redacting Sensitive Data from Java Flight Recorder Files is possible using my new jfr-redact tool. This tool also supports redacting information from hs-error files, but it doesn’t handle heap dumps. Sadly, there is currently no support in OpenJDK for redacting these files directly, to quote Volker Simonis’ comment under my last blog post:

There’s also “JDK-8337517: Redacted Heap Dumps” (https://bugs.openjdk.org/browse/JDK-8337517) which unfortunately didn’t receive enough support from upstream

Well, there is now the tool hprof-redact that allows you to easily null all primitives and strings in the heap dump and even implement your own basic redactions when using it as a library. It’s a small tool (written with femtocli, of course) under MIT license, which we’ll cover in this blog post. Please be aware that it is still an early prototype, but it might already be useful:

./hprof-redact source.hprof output.prof

But first, what are heap dumps?

Heap Dumps

Heap dumps are snapshots of a Java application’s heap. It essentially contains all live objects, along with optional thread stacks. You can obtain a heap dump via jmap:

jmap -dump:file=file.hprof PID

This will result in a large binary file. The jmap tool also supports directly compressing the file while writing via -dump:gz=[1 to 9, 1 fastest compression, 9 highest].

You can work with the heap dump using a couple of open-source tools:

hprof-slurp: a heap-dump analyzer written in Rust that supports showing a summary of the instances per class
Eclipse Memory Analyzer Tool: A powerful UI tool to analyze heap dumps. If you like JMC, you’ll like this tool.

Apparently, there’s also support for viewing heap dumps in the Ultimate version of IntelliJ, but I haven’t used it in a while.

The great thing about the OpenJDK heap dump format, compared to the JFR format, is that the heap dump format is somewhat formally specified in a comment at the heapDumper.cpp implementation.

Why do we need to redact?

Heap dumps contain everything you store on the heap. Consider the following example, where, as part of your application, you have a configuration object with a user name and a secret (si:

record Configuration(String user, String password) {
}
public class SecretFieldTest {

    public static void main(String[] args) 
      throws InterruptedException {
        
        Configuration config = 
            new Configuration("admin-user", "very-secret-password");

        System.out.println("Press Ctrl+C to exit...");
        Thread.sleep(Long.MAX_VALUE);
    }
}

In this, a heap dump will clearly contain the one Configuration class instance with the very secret key, that you probably do not want to leak. Also, the key itself isn’t helpful for analyzing your heap anyway.

Sadly, I was unable to get MAT to give me the actual string values, but hprof-slurp and its --listStrings option had me covered.

> java test_programs/SecretFieldTest.java &
> jmap -dump:file=file.hprof 28287
> hprof-slurp --inputFile file.hprof --listStrings | grep "very-secret"
very-secret-password

Maybe MAT not showing the primitive and string values is a sign of how unimportant the actual values are.

Using hprof-redact

Hprof-redact is built to be as simple as possible. It just does what Henry Lin wanted to do in JDK-8337517: It transforms primitive values and strings. You can download hprof-redact from GitHub releases or use it via jbang (jbang hprof-redact@parttimenerd/hprof-redact):

Usage: hprof-redact [-hV] [--transformer=] [--verbose] 
                    
Stream and redact HPROF heap dumps.
                              Input HPROF path.
                             Output HPROF path or '-' for stdout.
  -h, --help                         Show this help message and exit.
  -t, --transformer=    Transformer to apply (default: zero).
                                     Options: zero (zero primitives + string
                                     contents), zero-strings (zero string
                                     contents only), drop-strings (empty string
                                     contents).
  -v, --verbose                      Log changed field values (primitive fields
                                     only) to stderr.
  -V, --version                      Print version information and exit.

The zero and zero-strings transformers replace the strings with strings of the same size, only consisting of null-bytes. This ensures that your heap instances have the same size and that you can still detect when large strings are a problem. Of course, this might leak a tiny bit of information, and replacing all strings with empty strings also drastically reduces the size of the heap dump.

When looking at the previous example, we can zero all strings, including the secret of our file.hprof via

target/hprof-redact file.hprof redacted.hprof

Running hprof-slurp on the redacted file shows no secret value.

hprof-slurp --listStrings still lists many other strings, including those related to class names, methods, and more, which are not redacted by default. I’m mainly focusing on redacting primitives and strings with hprof-redact after all.

hprof-redact tries to be fast and save memory, but using a two-pass file parsing: the first pass scans for metadata records to build a mapping of ID to name kind, and the second pass applies transformations to strings and primitive values based on their kind and to heap dump records based on class and field information. Heap dumps can get large, so this is important.

Implementing your own redaction

Having only three simple redaction transformers (at the time of writing) might seem limiting. Still, first: You’re always happy to contribute your own transformers to the project, open a pull request in the GitHub repository.

But you can also use hprof-redact as a library:


    me.bechberger
    hprof-redact
    0.1.1

Make sure to check for the latest version of the library.

Just implement the HprofTransformer interface. As an example, let’s write a transformer that replaces every string value with "REDACTED" and every integer value with 42:

import me.bechberger.hprof.transformer.HprofTransformer;

public class MyTransformer implements HprofTransformer {
    @Override
    public String transformUtf8String(String value) {
        return "REDACTED";
    }
    
    @Override
    public int transformInt(int value) {
        return 42;
    }
}

You can use it on a heap dump file as follows:

import me.bechberger.hprof.HprofRedact;

void main() throws IOException {
    HprofRedact.process(
        Path.of("input.hprof"),
        Path.of("output.hprof"),
        new MyTransformer());
}

Conclusion

hprof-redact is a simple tool and library that solves a minor pain point in working with heap dumps, featuring a custom, fast heap dump parser and a simple command-line interface.

Thanks for coming this far, I hope you find hprof-redact useful too. See you in another week, probably with something on JSON and parsers.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post Redacting Data from Heap Dumps via hprof-redact appeared first on Mostly nerdless.

Femtocli: A small but mighty CLI library for small CLI tools in < 45KB

Johannes Bechberger — Mon, 16 Feb 2026 08:20:02 +0000

TL;DR: I built femtocli, a small command-line parsing library with sub-commands, an annotation-based API, and support for parsing Java agent arguments.

Every command-line tool that I, and probably you, write needs a command-line interface (CLI). Writing simply, with one or two options manually, is possible, but becomes tedious fast. Especially when our tool grows, and we add more options and subcommands.

Usually, you would add a library like the fantastic picocli to your project and be fine. It’s a really easy-to-use CLI library with all the features that you would want. So you can write a simple CLI quite quickly and declaratively:

@Command(name = "demo", description = "A demo application")
public class Main implements Runnable {

    @CommandLine.Parameters(index = "0", description = "Some parameter")
    String parameter;

    @Option(names = "--some-flag")
    boolean someFlag;

    @Override
    public void run() { /* print */ }

    public static void main(String[] args) {
        new CommandLine(new Main()).execute(args);
    }
}

The only problem? While the application JAR itself is only around 3KB for these tiny examples, the application JAR with the Picocli 4.7.7 dependency is 407KB.

Is this a problem? It might not be for you, but it certainly can be for small tools like jstall, where the Picocli dependency accounted for over half the JAR size in the minimal version, which is kind of a lot just for the option parsing. Of course, in many situations you don’t care, as there are other libraries, e.g., for JSON parsing, that can easily add multiple megabytes. But for tiny CLI helper tools, it can matter.

I couldn’t find any maintained CLI library with a picocli-like declarative approach and sub-commands under 50KB, so I built one. And the best part: I was able to add all the features I liked (converters, custom footers, …) and ignore the ones I don’t usually need (GraalVM support, ANSI colors, …).

The new library is called femtocli, it requires Java 17 and is MIT licensed:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;

import java.util.concurrent.Callable;

@Command(name = "greet", description = "Greet a person")
class GreetCommand implements Callable {
    @Option(names = {"-n", "--name"}, description = "Name to greet", required = true)
    String name;

    @Option(names = {"-c", "--count"}, description = "Count (default: ${DEFAULT-VALUE})", defaultValue = "1")
    int count;

    @Override
    public Integer call() {
        for (int i = 0; i < count; i++) System.out.println("Hello, " + name + "!");
        return 0;
    }
}

@Command(name = "myapp", description = "My CLI application", version = "1.0.0",
        subcommands = {GreetCommand.class})
public class QuickStart implements Runnable {
    public void run() {
        System.out.println("Use 'myapp greet --help'");
    }

    public static void main(String[] args) {
        FemtoCli.run(new QuickStart(), args);
    }
}

Try it:

> ./examples/run.sh QuickStart greet --name=World --count=1
Hello, World!

As you see, I designed the library to be almost a drop-in replacement for the basic use cases. The resulting JAR is currently below 55KB and 45KB when using the minimal version without Java debug information.

Before I show you all the cool features of this tiny, but mighty command-line library, I want to answer the crucial question:

Should you use it?

You should definitely not use it if the JAR size is not important to you or if you develop a Java agent. Libraries like Picocli offer much more functionality, are better tested, and have better documentation. They evolved over time. Femtocli has not. You can find a good list of libraries at Tim’s list and awesome-java.

But if you are adventurous, you might still find bugs and frequent releases, need a small library, and still want a good set of features, then give the CLI library a try.

Also, one of the benefits of having my own command-line parsing library is that I can add features that are important to me, but probably to many other people as well. One stand-out feature, besides the small size, is the ability to parse Java agent-like arguments, where a comma separates parameters and options:

Agent Mode

This is a feature that I found in no other command-line parsing library. As the avid reader of my blog knows, I quite like to develop Java agents, e.g., for my post Who instruments the instrumenters?, but these agents are passed a single string. It is a convention to pass arguments comma-separated. Usually I would write an ad-hoc library or ackwardly parse arguments and convert them to the normal style, to be parsed with picocli.

With femtocli, this is no longer necessary:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;
import me.bechberger.femtocli.annotations.Parameters;

import java.time.Duration;
import java.util.concurrent.Callable;

/**
 * Example showcasing FemtoCli agent args mode (comma-separated arguments).
 * 
 * Example invocations:
 * 

 *   {@code start,interval=1ms}
 *   {@code stop,output=file.jfr,verbose}
 *   {@code help}
 *   {@code version}
 * 
 */
@Command(
        name = "agent-cli",
        description = "Demo CLI for agent args mode",
        version = "1.0.0",
        subcommands = {AgentCli.Start.class, AgentCli.Stop.class},
        mixinStandardHelpOptions = true
)
public class AgentCli implements Runnable {

    @Override
    public void run() {
        // default action
        System.out.println("Try: start,interval=1ms or stop,output=file.jfr,verbose");
    }

    @Command(name = "start", description = "Start recording", mixinStandardHelpOptions = true)
    public static class Start implements Callable {

        @Option(names = "--interval", defaultValue = "1ms", description = "Sampling interval")
        Duration interval;

        @Override
        public Integer call() {
            System.out.println("start: interval=" + interval);
            return 0;
        }
    }

    @Command(name = "stop", description = "Stop recording", mixinStandardHelpOptions = true)
    public static class Stop implements Callable {
        @Parameters
        String mode;

        @Option(names = "--output", required = true, description = "Output file")
        String output;

        @Option(names = {"-v", "--verbose"}, description = "Verbose")
        boolean verbose;

        @Override
        public Integer call() {
            System.out.println("stop: mode=" + mode + ", output=" + output + ", verbose=" + verbose);
            return 0;
        }
    }

    public static void main(String[] args) {
        // Demonstrate agent mode if a single agent-args string is passed,
        // otherwise fall back to normal argv parsing.
        if (args.length == 1) {
            System.exit(FemtoCli.runAgent(new AgentCli(), args[0]));
        }
        System.exit(FemtoCli.run(new AgentCli(), args));
    }
}

> ./examples/run.sh AgentCli --help
Usage: agent-cli,[hV],[COMMAND]
Options:
  h, help         Show this help message and exit.
  V, version      Print version information and exit.
Commands:
  start  Start recording
  stop   Stop recording
> ./examples/run.sh AgentCli start,interval=1ms
start: interval=PT0.001S
> ./examples/run.sh AgentCli stop,jfr,output=file.jfr,verbose
stop: mode=jfr, output=file.jfr, verbose=true

But femtocli has many more features.

Femtocli’s Features

Femtocli has the following features:

Define commands with @Command (classes and subcommand methods)
Options via @Option (short/long names, required, default values, param labels, split, per-option converter, and verifiers)
Positional parameters via @Parameters (index, arity, paramLabel, defaultValue, …)
Mixins (reusable option groups) via @Mixin
Nested subcommands (classes and methods)
Multi-value options: arrays and List (repeat option or use split delimiter)
Built-in type conversion for primitive types, Path, Duration, enums, and support for custom converters
Automatic -h/--help and -V/--version flags
End-of-options marker (--)
Description placeholders (${DEFAULT-VALUE}, ${COMPLETION-CANDIDATES})
Custom header, customSynopsis, and footer in help output
Ability to hide commands and options from help output and to omit options defined in the class, in a parent class, or a mixin

A few more might follow, but the hard size limit of 55 KB (45 KB for the minimal build) is restrictive. But, well, I increased it slightly before incorporating important features. Is this a slippery slope? Maybe. But in the end, I want to have a library that solves my problems.

Usage

Add the library as a dependency in your project (< 55KB):


  me.bechberger.util
  femtocli
  0.2.0

And for the minimal version without debug metadata (< 45KB):


  me.bechberger.util
  femtocli-minimal
  0.2.0

I would recommend using the minimal version for releases mostly, as it’s harder to debug the CLI library when something goes wrong.

Be aware that the library is under active development, so the version numbers mentioned here are just a snapshot at the time of writing.

In the following, I’ll show you more examples of what femtocli is capable of. You’ll find an up-to-date example section in the project’s README. All examples can be built and run in the examples folder of the main repository. So give it a try.

Subcommands as Methods

You saw in the introduction that femtocli supports subcommand classes, but it also supports subcommand methods for convenience:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;

@Command(name = "myapp")
public class SubcommandMethod implements Runnable {
    @Command(name = "status", description = "Show status")
    int status() {
        System.out.println("OK");
        return 0;
    }

    @Override
    public void run() {
    }

    public static void main(String[] args) {
        FemtoCli.run(new SubcommandMethod(), args);
    }
}

> ./examples/run.sh SubcommandMethod status
OK

Positional Parameters

Femtocli also supports parameters identified by their position and arity, with optional parameter labels:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Parameters;

import java.util.List;

/**
 * Shows how to use positional parameters.
 * Positional parameters are defined by their index and are not prefixed by an option name.
 */
public class PositionalParameters implements Runnable {
    @Parameters(index = "0", paramLabel = "FILE", description = "Input file")
    String file;

    @Parameters(index = "1..*", paramLabel = "ARGS", description = "Extra arguments")
    List args;

    @Override
    public void run() {
        System.out.println("File: " + file);
        System.out.println("Args: " + args);
    }

    public static void main(String[] args) {
        FemtoCli.run(new PositionalParameters(), args);
    }
}

> ./examples/run.sh PositionalParameters in.txt arg1 arg2
File: in.txt
Args: [arg1, arg2]
> ./examples/run.sh PositionalParameters --help
Usage: positionalparameters [-hV] FILE [ARGS...]
      FILE         Input file
      [ARGS...]    Extra arguments
  -h, --help       Show this help message and exit.
  -V, --version    Print version information and exit.

Mixins

Like any good command-line parsing library, femtocli also supports mixins to share options between commands:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Mixin;
import me.bechberger.femtocli.annotations.Option;

/**
 * Shows how to use mixins to share options between subcommands. Run with "a -v" or "b -v" to see the effect.
 */
@Command(name = "mixins", subcommands = {MixinsAndSubcommands.A.class, MixinsAndSubcommands.B.class})
public class MixinsAndSubcommands implements Runnable {
    /** Example how to use mixins to share options between commands */
    static class Common {
        @Option(names = {"-v", "--verbose"})
        boolean verbose;
    }

    @Command(name = "a")
    static class A implements Runnable {
        @Mixin
        Common common;

        public void run() {
            System.out.println("Verbose: " + common.verbose);
        }
    }

    @Command(name = "b")
    static class B implements Runnable {
        @Mixin
        Common common;

        public void run() {
            System.out.println("Verbose: " + common.verbose);
        }
    }

    @Override
    public void run() {
    }

    public static void main(String[] args) {
        FemtoCli.run(new MixinsAndSubcommands(), args);
    }
}

> ./examples/run.sh MixinsAndSubcommands a
Verbose: false
> ./examples/run.sh MixinsAndSubcommands --help
Usage: mixins [-hV] [COMMAND]
  -h, --help       Show this help message and exit.
  -V, --version    Print version information and exit.
Commands:
  a  
  b  
> ./examples/run.sh MixinsAndSubcommands a --help
Usage: mixins a [-hV] [--verbose]
  -h, --help       Show this help message and exit.
  -v, --verbose
  -V, --version    Print version information and exit.

Spec Injection

Spec fields allow you to access the current CLI session at runtime, to, e.g., access the command output stream or print the usage information:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.Spec;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;

import java.time.Duration;

/**
 * Example showcasing injection of the {@link Spec} object.
 * 
 * The Spec object contains the configured input and output streams,
 * as well as a method to print usage help with the same formatting as the current FemtoCli run.
 */
@Command(name = "inspect", description = "Example that uses Spec", mixinStandardHelpOptions = true)
public class SpecInjection implements Runnable {
    Spec spec; // injected

    @Option(names = {"-i", "--interval"},
            defaultValue = "10ms",
            description = "Sampling interval (default: ${DEFAULT-VALUE})")
    Duration interval;

    @Override
    public void run() {
        // Use the configured streams
        spec.out.println("interval = " + interval.toMillis());
        // Print usage with the same formatting as the current FemtoCli run
        spec.usage();
    }

    public static void main(String[] args) {
        FemtoCli.run(new SpecInjection(), args);
    }
}

> ./examples/run.sh SpecInjection --interval 10ms
interval = 10
Usage: inspect [-hV] [--interval=]
Example that uses Spec
  -h, --help                   Show this help message and exit.
  -i, --interval=    Sampling interval (default: 10ms)
  -V, --version                Print version information and exit.

Custom Type Converters

Femtocli supports parsing the primitive types and their boxing wrappers, as well as Duration and Path, but if you want more, you can bring your own converters (and you can override existing ones):

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.TypeConverter;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;

import java.time.Duration;

/**
 * Example showcasing custom type converters.
 * 
 * Example invocation:
 * 
{@code
 * java CustomTypeConverters --name=hello --timeout=PT30S
 * }
 */
@Command(name = "convert")
public class CustomTypeConverters implements Runnable {

    /** Custom type converter that converts a string to uppercase. */
    public static class Upper implements TypeConverter {
        public String convert(String value) {
            return value.toUpperCase();
        }
    }

    static boolean parseOnOff(String value) {
        if (value.equalsIgnoreCase("on")) return true;
        if (value.equalsIgnoreCase("off")) return false;
        throw new IllegalArgumentException("Expected 'on' or 'off'");
    }

    @Option(names = "--name", converter = Upper.class)
    String name;

    @Option(names = "--turn", converterMethod = "parseOnOff")
    boolean turn;

    @Option(names = "--timeout")
    Duration timeout;

    @Override
    public void run() {
        System.out.println("Name: " + name);
        System.out.println("Turn: " + turn);
        System.out.println("Timeout: " + timeout);
    }

    public static void main(String[] args) {
        FemtoCli.builder()
                .registerType(java.time.Duration.class, java.time.Duration::parse)
                .run(new CustomTypeConverters(), args);
    }
}

> ./examples/run.sh CustomTypeConverters --name=max --turn on --timeout=PT10S
Name: MAX
Turn: true
Timeout: PT10S

Enum Support

Of course, femtocli also supports enums and Picocli, like ${COMPLETION-CANDIDATES} placeholders in descriptions:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;

@Command(name = "enums")
public class EnumsAndCompletionCandidates implements Runnable {
    enum Mode { fast, safe }

    @Option(names = "--mode",
            defaultValue = "safe",
            description = "Mode (${COMPLETION-CANDIDATES}), default: ${DEFAULT-VALUE}")
    Mode mode;

    public void run() {
        System.out.println("Mode: " + mode);
    }

    public static void main(String[] args) {
        FemtoCli.run(new EnumsAndCompletionCandidates(), args);
    }
}

> ./examples/run.sh EnumsAndCompletionCandidates
Mode: safe
> ./examples/run.sh EnumsAndCompletionCandidates --mode fast
Mode: fast
> ./examples/run.sh EnumsAndCompletionCandidates --help
Usage: enums [-hV] [--mode=]
  -h, --help       Show this help message and exit.
      --mode=
                   Mode (fast, safe), default: safe
  -V, --version    Print version information and exit.

Custom Header, Footer, and Synopsis

You can customize the help messages a tiny bit, not as deeply as with other libraries, but probably deep enough:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;
import me.bechberger.femtocli.annotations.Command;
import me.bechberger.femtocli.annotations.Option;

/**
 * A command with a custom header and synopsis.
 * The header is printed above the usage message, and the synopsis replaces the default usage line.
 */
@Command(
        name = "mytool",
        header = {"My Tool", "Copyright 2026"},
        customSynopsis = {"Usage: mytool [OPTIONS] "},
        description = "Process files",
        footer = """
                Examples:
                  mytool --flag
                """
)
public class CustomHeaderAndSynopsis implements Runnable {

    @Option(names = "--flag")
    boolean flag = false;

    public void run() {
    }

    public static void main(String[] args) {
        FemtoCli.run(new CustomHeaderAndSynopsis(), args);
    }
}

> ./examples/run.sh CustomHeaderAndSynopsis --help
My Tool
Copyright 2026
Usage: mytool [OPTIONS] 
Process files
      --flag
  -h, --help       Show this help message and exit.
  -V, --version    Print version information and exit.

Examples:
  mytool --flag

Global Configuration

Femtocli allows you to configure a few settings globally, like the shown version, making it easier to maintain consistency:

package me.bechberger.femtocli.examples;

import me.bechberger.femtocli.FemtoCli;

public class GlobalConfiguration implements Runnable {

    @Override
    public void run() {
    }

    public static void main(String[] args) {
        FemtoCli.builder()
                .commandConfig(c -> {
                    c.version = "1.2.3";
                })
                .run(new GlobalConfiguration(), args);
    }
}

And a few more features…

Conclusion

Making a tailor-made command-line library for my everyday use cases as a builder of small tools is really fun and, honestly, quite a bit of work. But I’m pretty happy with the results. It’s a small library with an API not too dissimilar to Picocli, but with a few nice additions. I might not use this library for everything, but it’s good enough that I already use it in tools like jstall (Quickly Inspect your Java Application with JStall), where it serves me well. I hope this library is helpful for you, too. It’s MIT-licensed, and I’m open to any suggestions, bug reports, or pull requests.

Thanks for coming along with me, and see you next week with a blog post on redacting heap dump files.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

P.S.: I had a great time last week at JFokus, meeting so many awesome people.

The post Femtocli: A small but mighty CLI library for small CLI tools in < 45KB appeared first on Mostly nerdless.

Redacting Sensitive Data from Java Flight Recorder Files

Johannes Bechberger — Fri, 13 Feb 2026 11:35:54 +0000

I few weeks ago, I showed you how to read and write JFR files programmatically. This week we’re using the covered basic-jfr-professor to create a fully fledged (yet still experimental) JFR and hserr file redaction tool called jfr-redact.

TL;DR: Download jfr-redact from GitHub and redact sensitive information like user names, tokens, and keys from files via:

# Using the JAR directly
java -jar jfr-redact.jar redact recording.jfr

# Redact text files
java -jar jfr-redact.jar redact-text hs_err.log

Foundations

JFR events like jdk.InitialEnvironmentVariable make it really easy to leak information:

As keys and tokens might be passed via environment variables. Additionally, we can have Socket IO-related events that might leak internal hostnames, ports, and more, to name a few.

So we want to remove specific events and redact certain event properties. The OpenJDK already provides us with a basic tool for the former: jfr scrub

jfr scrub subcommand

Use jfr scrub to remove sensitive contents from a file or to reduce its size.

The syntax is:

jfr scrub [–include-events <filter>] [–exclude-events <filter>] [–include-categories <filter>] [–exclude-categories <filter>] [–include-threads <filter>] [–exclude-threads <filter>] <input-file> [<output-file>]

--include-events <filter>
Select events matching an event name.
--exclude-events <filter>
Exclude events matching an event name.
--include-categories <filter>
Select events matching a category name.
--exclude-categories <filter>
Exclude events matching a category name.
--include-threads <filter>
Select events matching a thread name.
--exclude-threads <filter>
Exclude events matching a thread name.
<input-file>
The input file to read events from.
<output-file>
The output file to write filter events to.
Documentation for jfr scrub

But this tool cannot filter properties. There are already ideas to perform simple property filtering directly during JFR recording (see JBS issue). Still, it remains to be seen when and how it will be implemented and integrated.

So we need to implement our own.

Using Basic-JFR-Processor to Build a Simple Redactor

basic-jfr-professor makes it really easy to, for example, create a redaction tool that removes all jdk.InitialEnvironmentVariable events and redact all properties with names “token” and “port” (source):

public static void main(String[] args) {
    if (args.length < 2) {
        System.err.println(
          "Usage: SimpleRedactorExample  ");
        System.exit(2);
    }

    Path input = Path.of(args[0]);
    Path output = Path.of(args[1]);

    // Create a modifier that drops events
    JFREventModifier modifier = new JFREventModifier() {
        @Override
        public boolean shouldRemoveEvent(RecordedEvent event) {
            return event.getEventType().getName()
              .equals("jdk.InitialEnvironmentVariable");
        }

        @Override
        public String process(String fieldName, String value) {
            if (fieldName.equals("token")) {
                return "";
            }
            return value;
        }

        @Override
        public int process(String fieldName, int value) {
            if (fieldName.equals("port")) {
                return 0;
            }
            return value;
        }
    };

    JFRProcessor processor = new JFRProcessor(modifier, input);

    try (FileOutputStream out = 
           new FileOutputStream(output.toFile())) {
        // process(...) returns a RecordingImpl
        // that should be closed to finalize the file
        RecordingImpl result = processor.process(out);
        // Close the recording to flush any remaining data
        result.close();
    } catch (IOException e) {
        e.printStackTrace();
        System.exit(1);
    }
}

We can extend this to make it more configurable and also to identify sensitive strings during the discovery phase, and then replace them throughout: My jfr-redact project.

JFR-Redact

It has the following features (from the README):

Property Redaction: Redact sensitive properties in events with key and value fields
- Patterns: password, passwort, pwd, secret, token, key, … (case-insensitive)
Event Removal: Remove entire event types that could leak information
- Examples: jdk.OSInformation, SystemProcess, InitialEnvironmentVariable, ProcessStart
Event Filtering: Advanced filtering similar to jfr scrub command (docs)
- Filter by event name, category, or thread name
- Supports glob patterns (*, ?) and comma-separated lists
- Include/exclude filters with flexible combinations
String Pattern Redaction: Redact sensitive patterns in string fields
- Home folders: /Users/[^/]+, C:\Users\[a-zA-Z0-9_\-]+, /home/[^/]+
- e-mail addresses, UUIDs, IP addresses
- Configurable to exclude method names, class names, or thread names
Two-Pass Discovery: Automatically discover sensitive values and redact them everywhere
- First pass: Extract usernames, hostnames, and other values from patterns (e.g., extract johndoe from /Users/johndoe)
- Second pass: Redact discovered values wherever they appear in the file
- Configurable minimum occurrences and allowed lists to reduce false positives
- Use --discovery-mode=fast for single-pass (faster), --discovery-mode=default for two-pass (more thorough)
Words Mode: Discover and redact specific words/identifiers
- Discover all distinct words in a file: jfr-redact words discover recording.jfr
- Create rules to keep or redact specific words
- Apply rules: jfr-redact words redact app.log redacted.log -r rules.txt
Network Redaction: Redact ports and addresses from SocketRead/SocketWrite events
Path Redaction: Redact directory paths while keeping filenames (configurable)
Pseudonymization: Preserve relationships between values while protecting data
- Hash mode: Consistent mapping to pseudonyms (e.g., )
- Counter mode: Sequential numbering (value1→1, value2→2)
- Realistic mode: Generate plausible alternatives (e.g., john.doe@company.com → alice.smith@test.com)
- Custom replacements: Define specific mappings in config (e.g., johndoe → alice, /home/johndoe → /home/testuser)
- Optional, enabled via --pseudonymize flag
Text File Redaction: Apply the same redaction patterns to arbitrary text files
- Perfect for redacting Java error logs (hs_err_pid*.log), which contain system properties, environment variables, and file paths

As I mentioned earlier, it’s highly experimental, so we offer no guarantees that it works correctly; however, it should capture most sensitive information.

After downloading it, you basically just call it, using either a custom config or a predefined one:

jfr-redact redact recording.jfr redacted.jfr --config strict

It supports a superset of the features of jfr scrub. There are three additional interesting modes that I would like to showcase: the text redaction and the words mode, as well as the ability to concatenate multiple JFR files.

Text Redaction

You can configure jfr-redact with custom string redaction rules, which jfr-redact applies to strings within JFR events. One example is the redaction of IP addresses:

    # IP addresses
    ip_addresses:
      enabled: true
      patterns:
        - '\b(?:[0-9]{1,3}\.){3}[0-9]{1,3}\b'
        - '\b(?:[0-9a-fA-F]{1,4}:){7}[0-9a-fA-F]{1,4}\b'

Such rules can be used to create custom config files for basic text redaction. A common form of text that I often encounter in JDK development is error files generated when a JVM exits involuntarily, known as hserr files. This why I created a special hserr.yaml config, which removes all information from hserr files that might be sensitive.

You can use the text-redaction feature as follows:

# Redact a Java error log file (hs_err_pid*.log)
# Uses the preset hserr by default
java -jar jfr-redact.jar redact-text hs_err_pid12345.log hs_err_pid12345.redacted.log

# Redact any text file with pseudonymization
java -jar jfr-redact.jar redact-text debug-output.txt debug-output.redacted.txt --pseudonymize

And provide custom configuration files via the --config option. So

OS:
uname: Darwin FVF 22.6.0 Darwin Kernel Version 22.6.0: Wed Jul  5 22:22:52 PDT 2023; root:xnu-8796.141.3~6/RELEASE_ARM64_T8103 arm64
OS uptime: 3 days 22:33 hours
rlimit (soft/hard): STACK 8176k/65520k , CORE 0k/infinity , NPROC 2666/4000 , NOFILE 10240/infinity , AS infinity/infinity , CPU infinity/infinity , DATA infinity/infinity , FSIZE infinity/infinity , MEMLOCK infinity/infinity , RSS infinity/infinity
load average: 9.25 11.63 11.44

CPU: total 8 (initial active 8) 0x61:0x0:0x1b588bb3:0, fp, asimd, aes, pmull, sha1, sha256, crc32, lse, sha3, sha512

Is replaced with

OS:
uname: Darwin *** 22.6.0 Darwin Kernel Version 22.6.0: Wed Jul  5 22:22:52 PDT 2023; root:***~6/RELEASE_ARM64_T8103 arm64
OS uptime: ***
rlimit (soft/hard): STACK 8176k/65520k , CORE 0k/infinity , NPROC 2666/4000 , NOFILE 10240/infinity , AS infinity/infinity , CPU infinity/infinity , DATA infinity/infinity , FSIZE infinity/infinity , MEMLOCK infinity/infinity , RSS infinity/infinity
load average: 9.25 11.63 11.44

CPU: total 8 (initial active 8) 0x61:0x0:0x1b588bb3:0, fp, asimd, aes, pmull, sha1, sha256, crc32, lse, sha3, sha512

Words Mode

Perhaps you don’t trust the complex redaction engine; maybe you want to review the individual words in the file and redact specific words. jfr-redact has you covered with a mode that Götz, a colleague of mine, requested. It models his own workflow:

Get all words in a file sorted alphabetically: Words must match [a-zA-Z0-9_\\-+/]+, contain at least one letter and are not hexadecimal numbers
Read through all the words to check whether you spot something sensitive
Redact all sensitive words

This is more work than auto-redaction, but less than reading through a whole document/JFR printout. jfr-redact can’t help you with step 2, but for step 1 it has the words discover, and for step 3 the words redact command. As always, you can read more about these commands in the README; I provide a brief overview in the following.

Consider the hserr file excerpt from the previous section. Let’s store it in a file called hserr.log and start the discovery:

> jfr-redact words discover hserr.log words.txt

Successfully wrote 47 words to:
  .../words.txt

The generated file looks as follows:

0k/infinity
10240/infinity
6/RELEASE_ARM64_T8103
8176k/65520k
AS
CORE
CPU
DATA
Darwin
FSIZE
FVF
...
xnu-8796.141.3

Consider now that we find xnu-8796.141.3 to be sensitive. We can just prefix its line with - (all non-prefixed lines are ignored):

0k/infinity
...
- xnu-8796.141.3

Now we can run the redaction of the hserr file using the rules in the words file:

> jfr-redact words redact hserr.log hserr.redacted.log -r words.txt
Loaded 1 redaction rules
Redacting text file: hserr.log
Processed 7 lines total
Redacted 1 lines
Processed 50 unique values: 1 redacted, 0 kept
Wrote redacted output to: hserr.redacted.log

The resulting file has, as expected, the string xnu-8796.141.3 replaced with ***:

OS:
uname: Darwin FVF 22.6.0 Darwin Kernel Version 22.6.0: Wed Jul  5 22:22:52 PDT 2023; root:***~6/RELEASE_ARM64_T8103 arm64
OS uptime: 3 days 22:33 hours
rlimit (soft/hard): STACK 8176k/65520k , CORE 0k/infinity , NPROC 2666/4000 , NOFILE 10240/infinity , AS infinity/infinity , CPU infinity/infinity , DATA infinity/infinity , FSIZE infinity/infinity , MEMLOCK infinity/infinity , RSS infinity/infinity
load average: 9.25 11.63 11.44

CPU: total 8 (initial active 8) 0x61:0x0:0x1b588bb3:0, fp, asimd, aes, pmull, sha1, sha256, crc32, lse, sha3, sha512

It’s essential to note that only single words are matched. The redaction rules also support glob patterns, and keeping and replacing. Read more about this in the help documentation of words redact.

Concat Mode

A feature that is not directly related to redaction but to JFR processing in general is the ability to concatenate multiple JFR files without any processing:

# Concatenate two JFR files
java -jar jfr-redact.jar concat one.jfr two.jfr -o combined.jfr

This process takes a long time, especially for larger files, possibly due to the JMC writer API not prioritizing performance. Concatenating 250MB of JFR takes, for example, around 15 minutes on my MacBook Pro M4.

An interesting observation is that passing a large file (e.g., a 242 MB one) on its own reduces its size substantially (to 182 MB in my case). So you can also use the concatenation for compression. The reason for this is that a JFR file typically consists of multiple parts, each with its own constant pool. But the JMC writer API only creates one part, so there is only one instance of any constant, which saves memory.

Usage as a Library

You can, of course, use jfr-redact also as a library:


  me.bechberger
  jfr-redact
  0.1.2

Refer to the tests and individual command implementations to see how to use the library.

Conclusion

I always wanted to have a small tool to remove information from JFR files, and recently found the time to implement this tool, building a few smaller libraries along the way. jfr-redact enables the easy redaction of sensitive information from JFR and text files, and even supports pseudonymization.

I hope this tool is as helpful for you as it is for me. See you in another week with something different.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post Redacting Sensitive Data from Java Flight Recorder Files appeared first on Mostly nerdless.

Implement a new JStall Feature with Me

Johannes Bechberger — Mon, 09 Feb 2026 12:56:43 +0000

Or: How I use GitHub Copilot to go from feature to idea

A few weeks back, I introduced you to jstall (Quickly Inspect your Java Application with JStall), a tool that analyses what your JVM is currently doing. This week is the first time I’m bringing you in on the development and letting you peek behind the curtains to see how I go from idea to implemented feature. The feature I’ll implement the jvm-support analysis that checks that the JVM running your application is not outdated.

This is the first time I recorded my development process, so I hope you still liked it.

See you on another day for something command-line parser- or redaction-related.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

The post Implement a new JStall Feature with Me appeared first on Mostly nerdless.

The Java Version Quiz

Johannes Bechberger — Tue, 03 Feb 2026 07:30:35 +0000

Over the last 30 years, Java has added many features, including generics, lambdas, pattern matching, and records. You surely know that lambdas have been introduced in Java eight and records in Java 16, but can you distinguish the other Java versions?

I felt I couldn’t, so I created a tiny little Java game: The Java Version Quiz. In this quiz, you get a Java snippet and have to decide between five different Java versions. Pick the smallest Java version where the snippet is valid code (without using preview features).

The screenshot shows the alpha version of the game, which includes features introduced in Java 1.0alpha2 and 1.0alpha3, including bug fixes. A game version for only the nerdiest of Java connaisseurs. Source is a dump of the alpha2 and alpha3 packages on GitHub.

The game focuses on Java language differences and major runtime differences, which are easy to check without semantic analysis. It’s a by-product of another fun little project.

I hope you learn some new features of Java and discover that it is evolving over the years, while still keeping the syntax similar enough that it’s hard to spot the differences between versions. And if you’re unsure what a specific feature in the shown code snippet is, the quiz gives you a handy description.

If you find any issues or have new code examples, feel free to contribute to the quiz on GitHub.

See you in another week for another blog post on something JFR-related.

The post The Java Version Quiz appeared first on Mostly nerdless.

Reproducing a Tricky Bug in Minutes With a Custom Linux Scheduler Written in Java

Johannes Bechberger — Mon, 26 Jan 2026 08:24:28 +0000

Ever had a tricky bug caused by a race condition or rare concurrency condition that was really hard to reproduce? It’s great when you have a fix that should work in theory, but without a reproducer, only time will tell whether your fix really worked. In this blog post, we’ll revisit my old blog post Hello eBPF: Concurrency Testing using Custom Linux Schedulers (19), and try to use the concurrency-fuzz-scheduler to reproduce a bug I fixed a while ago in the OpenJDK.

The scheduler aims to be as chaotic as possible; hence, Jake Hillion’s Rust version is called scx_chaos. But we’ll focus on the Java version, the concurrency-fuzz-scheduler, because it’s not only implemented in Java on top of my hello-ebpf library, but it’s also optimized for fuzzing Java applications, inserting random sleeps at the scheduler level with a focus on non-VM threads.

TL;DR: The concurrency scheduler is a nice tool to provoke rare parallelism conditions and create reproducers.

The bug in question is JDK-8366486, reported by David Holmes in August 2025: A test case that checks that we can run multiple recordings with the CPU-time sampler in direct succession does work. The only problem: The test should not work, but it still worked most of the time. If you’re only interested in the actual bug, skip ahead to the end of the blog post for an explanation.

You’ll find the fixed version here and the broken version here (because the old JDK with the actual bug had compilation issues on my current system, I had to reintroduce the bug in a separate branch).

Let’s start with running the test case with the standard Linux scheduler on a large machine, so that everything can run nicely in parallel:

Running the Test Case Normally

The test case is part of the test suite of the OpenJDK, so we can run the test via make:

make CONF=linux-x86_64-server-fastdebug test \
  TEST=jtreg:test/jdk/jdk/jfr/event/profiling/TestCPUTimeSampleMultipleRecordings.java

This automatically builds the tests for us and then runs them with the jtreg runner. But we can also call jtreg directly, by using the command stored in build/linux-x86_64-server-fastdebug/test-support/jtreg_test_jdk_jdk_jfr_event_profiling_TestCPUTimeSampleMultipleRecordings_java/jtreg.cmdline. For simplicity, we store the command in a file called test.sh.

Let’s run this using hyperfine:

> hyperfine ./test.sh --runs 50
Benchmark 1: ./test.sh
  Time (mean ± σ):     25.093 s ± 13.168 s    [User: 12.819 s, System: 1.656 s]
  Range (min … max):   10.828 s … 62.806 s    50 runs

So we expect these tests to run fairly quickly on average and run only rarely longer.

Running the Test Case with the Chaotic Scheduler

Let’s run the same test case using the chaotic concurrency-fuzz-scheduler:

> ./scheduler.sh ./test.sh --log --java --timeout 200 --sleep 0.1ms,10ms --run 0.1ms,10ms
...
Iteration timed out
Killing process

Iteration Count: 11
Iteration Duration: mean=67.4s+-49.9s,min=22.0s,max=200.5s

It takes only a few minutes (usually around 10) to reach 200-second runtimes, making error reproduction far faster.

With a few minutes more, we can even reach a runtime of 380 seconds (running with --timeout 400):

Iteration Count: 29
Iteration Duration: mean=84.3s+-92.5s,min=18.6s,max=384.6s

Let’s run the fixed version of the test case for comparison:

Running the Fixed Test Case

The fixed version can run in a loop for literal hours with the chaotic scheduler, without any problems:

Iteration Count: 1119
Iteration Duration: mean=13.3s+-1.2s,min=11.4s,max=20.5s

When running it alone, we can see that the performance impact of the custom scheduler is not terrible:

hyperfine ./test.sh --runs 50
Benchmark 1: ./test.sh
  Time (mean ± σ):      5.805 s ±  0.047 s    [User: 13.321 s, System: 1.660 s]
  Range (min … max):    5.727 s …  6.003 s    50 runs

But we can surely improve it.

The Bug

Let’s break down the issue in the test case by taking a look at the code:

public class TestCPUTimeSampleMultipleRecordings {

    static volatile boolean alive = true;

    public static void main(String[] args) throws Exception {
        // start a thread that spends time on the CPU
        Thread t = new Thread(TestCPUTimeSampleMultipleRecordings::nativeMethod);
        t.start();
        for (int i = 0; i < 2; i++) {
            try (RecordingStream rs = new RecordingStream()) {
                // enable the CPU-time sampler to record a sample per thread
                // every milli-second of CPU-time
                rs.enable(EventNames.CPUTimeSample).with("throttle", "1ms");
                rs.onEvent(EventNames.CPUTimeSample, e -> {
                    // when get our first sample, we quit the recording
                    alive = false;
                    rs.close();
                });
                // we start the recording, this calls only terminates when
                // the recording is stopped
                rs.start();
            }
        }
        alive = false;
    }

    public static void nativeMethod() {
        while (alive) {
            JVM.getPid();
        }
    }
}

Do you spot the issue? Look closely at when we set the alive variable to false. So the first recording upon stopping also causes the nativeMethod to stop consuming CPU-time eventually.

Why does it not always fail?

How does this test case even normally succeed? With a simple print statement, we find that we most often get:

CPUTimeSample event received in recording 1: {
  osName = "JFR Periodic Tasks"
  osThreadId = 219953
  javaName = "JFR Periodic Tasks"
  javaThreadId = 51
  group = {
    parent = {
      parent = {
        parent = N/A
        name = "system"
      }
      name = "main"
    }
    name = "AgentVMThreadGroup"
  }
  virtual = false
}

This thread is not related to our test case at all, but to the internals of the OpenJDK JFR implementation. The thread in question runs (source):

while (true) {
    long wait = Options.getWaitInterval(); // 1000ms
    try {
        synchronized (this) {
            if (JVM.shouldRotateDisk()) {
                rotateDisk();
            }
            if (isToDisk()) {
                EventLog.update();
            }
        }
        long minDelta = PeriodicEvents.doPeriodic();
        wait = Math.min(minDelta, Options.getWaitInterval());
    } catch (Throwable t) {
        // Catch everything and log, but don't allow it to end the periodic task
        Logger.log(JFR_SYSTEM, WARN, "Error in Periodic task: " + t.getMessage());
    } finally {
        takeNap(wait);
    }
}

The inner iteration in our test case usually takes around 0.16ms (median). With our one millisecond sampling interval, we expect to catch this quite often, depending on the scheduler’s patterns. But this is still a test bug, because the test runtime can fluctuate widely by chance. When the test would have worked as intended, nativeMethod would consume CPU-time all the time.

The chaotic scheduler increased the likelihood that the CPU-time profiler would not sample the “JFR Periodic Tasks” thread, though the exact reason is unclear to me.

If I’m honest, without the chaotic scheduler running and showing that the test’s runtime varies widely, I wouldn’t have gone down the path of trying to understand what happened. I would have put the test timeouts down to nothing more than synchronization issues. Heck, the first draft of this blog post even mentioned race conditions in the title.

Conclusion

Of course, the chaotic scheduler is still a prototype. Still, I hope to have shown you that it’s worth exploring for testing related to race conditions, tricky concurrency issues, and creating fast reproducers. The scheduler is written directly in Java, so it would be interesting to integrate with OpenJDK-related tooling. We’re essentially helping to debug a JVM test using a Java library in the kernel.

After Jake Hillion reproduced a kernel bug with his Rust version of the scheduler, this is now the second real-world bug where chaotic scheduling helped. I hope it’s not the last one.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

The post Reproducing a Tricky Bug in Minutes With a Custom Linux Scheduler Written in Java appeared first on Mostly nerdless.

Reading and Writing JFR Files Programmatically

Johannes Bechberger — Mon, 19 Jan 2026 07:41:15 +0000

Last week, I showed you the Fastest Way to get the Version of a Java Installation. This week, I’ll show you something completely different: how to interact with JFR data programmatically, showcasing a new library called basic-jfr-processor in the process.

While JFR is a great tool for profiling your application and gaining insights, the file format is, on purpose, not well documented or specified. One of the best sources of information is Gunnar Morling’s blog post on the topic, and of course, the OpenJDK source code.

But of course, there are ready-made APIs for reading JFR files and OpenJDK-adjacent libraries to write them. In this overview blog post, I’ll showcase the built-in Java JFR API, Jaroslav Bachorik’s jafar API, and the JMC JFR writer API, as well as my own basic-jfr-processor library based on the latter.

We start with the built-in API:

Reading JFR Files using Java’s API

The API is pretty simple: it consists of a RecordingFile class that allows us to parse the events in the file as RecordingEvent instances with properties that can be a primitive, a special type like RecordedStackTrace or RecordedClass, or a complex RecordingObject:

The following is an example from documentation: It prints a histogram of all sampled methods in a file:

public static void main(String[] args) throws IOException {
    if (args.length != 1) {
        System.err.println("Must specify a recording file.");
        return;
    }

    RecordingFile.readAllEvents(Path.of(args[0])).stream()
        .filter(e -> e.getEventType().getName().equals("jdk.ExecutionSample"))
        .map(e -> e.getStackTrace())
        .filter(s -> s != null)
        .map(s -> s.getFrames().getFirst())
        .filter(f -> f.isJavaFrame())
        .map(f -> f.getMethod())
        .collect(
            Collectors.groupingBy(m -> m.getType().getName() + "." + m.getName() + " " + m.getDescriptor(),
            Collectors.counting()))
        .entrySet()
        .stream()
        .sorted((a, b) -> b.getValue().compareTo(a.getValue()))
        .forEach(e -> System.out.printf("%8d %s\n", e.getValue(), e.getKey()));
    // there is also an iterator-like API via new RecordingFile(outputFile)
}

You can access fields of an event or object by using one of the many accessor functions, like getBoolean(String name). The main advantage of this API is that it’s already built in and maintained by the JFR authors themselves. The main disadvantage is that it’s a pull-based API (we ask JFR for the next event) and that it’s rather slow.

An alternative API is jafar by Jaroslav Bachorik:

Reading JFR Files using Jafar

Jafar allows us to directly parse JFR files into objects, removing the need for slow object-getter requests like in the previous API and making the code far more readable.

The following is an example from the documentation:

import io.jafar.parser.api.*;
import java.nio.file.Paths;

@JfrType("custom.MyEvent")
public interface MyEvent { // no base interface required
  String myfield();
}

try (TypedJafarParser p = JafarParser.newTypedParser(Paths.get("/path/to/recording.jfr"))) {
  HandlerRegistration reg = p.handle(MyEvent.class, (e, ctl) -> {
    System.out.println(e.myfield());
    long pos = ctl.stream().position(); // current byte position while in handler
    // ctl.abort(); // optionally stop parsing immediately without throwing
  });
  p.run();
  reg.destroy(p); // deregister
}

This API is especially great when parsing large JFR files with known events, as jafar uses compile-time code generation to improve speed and drastically reduce memory footprint (more on this in the excellent README). But jafar also has an untyped API (again, the example is from the documentation):

import io.jafar.parser.api.*;
import java.nio.file.Paths;

try (UntypedJafarParser p = JafarParser.newUntypedParser(Paths.get("/path/to/recording.jfr"))) {
  HandlerRegistration reg = p.handle((type, value) -> {
    if ("jdk.ExecutionSample".equals(type.getName())) {
      // You can retrieve the value by providing 'path' -> "eventThread", "javaThreadId"
      Object threadId = Values.get(value, "eventThread", "javaThreadId");
      // You can also get the value conveniently typed - for primitive values you need to use the boxed type in the call
      long threadIdLong = Values.as(value, Long.class, "eventThread", "javaThreadId");
      // use threadId ...
    }
  });
  p.run();
  reg.destroy(p);
}

It’s impressive what Jaroslav built, and I recommend that you take a closer look at this relatively new library.

Being able to read JFR files is great, but you might wonder how to write them ourselves.

Writing JFR Events

Sadly, there is no built-in writer API in Java, but there is the JMC JFR writer API, which is developed by the OpenJDK project, albeit not by the current primary JFR file format maintainers.

To start writing a file, we first create a Recording object using the Recordings class:

Recording recording = Recordings.newRecording(outputStream, 
       r -> { /* configure settings */ });

The Recording class allows us to register event types and object types, and write events.

In the following, an example that writes two instances of the custom event MyEvent from above, into a file and reads it back using the built-in JFR reader API (GitHub):

import jdk.jfr.consumer.RecordingFile;
import org.openjdk.jmc.flightrecorder.writer.TypesImpl;
import org.openjdk.jmc.flightrecorder.writer.api.*;

import java.io.FileOutputStream;
import java.io.IOException;
import java.nio.file.Path;

/**
 * Example demonstrating how to write a JFR file using JMC Writer API.
 */
public class JMCWriterExample {

    public static void main(String[] args) throws IOException {
        Path outputFile = Path.of("example.jfr");

        final long startTicks = 1;

        try (FileOutputStream fos = 
                     new FileOutputStream(outputFile.toFile())) {
            // Initialize a new recording
            Recording recording = Recordings.newRecording(fos, r -> {
                // Optional: configure recording settings
                // Ensure JDK type initialization
                r.withJdkTypeInitialization();
                // Set start ticks for timestamp consistency in nanoseconds
                // 1 seems to work best
                r.withStartTicks(startTicks);
            });

            // Register the event type
            Type myEventType = recording.registerType(
                "com.example.MyEvent",
                "jdk.jfr.Event",
                typeBuilder -> {
                    // Add the implicit startTime field
                    // the implicit fields must come first
                    typeBuilder.addField("startTime", 
                            TypesImpl.Builtin.LONG,
                            field ->
                                    field.addAnnotation(
                                            Types.JDK.ANNOTATION_TIMESTAMP, 
                                            "TICKS"));
                    // Add the custom field
                    typeBuilder.addField("myfield", 
                            Types.Builtin.STRING);
                }
            );

            // Write an event instance
            recording.writeEvent(myEventType.asValue(eventBuilder -> {
                // the fields have to be set in order of declaration
                eventBuilder.putField("startTime", System.nanoTime() - startTicks);
                eventBuilder.putField("myfield", "Hello from JMC!");
            }));

            // Write another event
            recording.writeEvent(myEventType.asValue(eventBuilder -> {
                eventBuilder.putField("startTime", System.nanoTime() - startTicks);
                eventBuilder.putField("myfield", "Another event");
            }));

            // Close the recording to finalize the file
            recording.close();
        }

        // Printing file contents for demonstration
        RecordingFile.readAllEvents(outputFile)
                .forEach(System.out::println);

        System.out.println("JFR file written to: " + outputFile);
    }

When we run this, we get something like:

com.example.MyEvent {
  startTime = 19:08:19.704 (2026-01-14)
  myfield = "Hello from JMC!"
}


com.example.MyEvent {
  startTime = 19:08:19.704 (2026-01-14)
  myfield = "Another event"
}


JFR file written to: example.jfr

The API is rather complicated to use, and the predefined complex types (e.g., stack traces, …) are only approximations of the real types. The RecordingImpl class is a good starting point to see how it can be used.

But what if you only want to modify existing events in an existing file?

Modifying Files with Basic-JFR-Processor

My new basic-jfr-processor library allows you to do precisely this. It’s built on top of the JMC writer API. It supports writing RecordedEvents from the built-in JFR reader API, including support for an event modifier class to allow you to drop events or modify values.

Removing a specific type of event from a file is as simple as:

// Create a modifier that drops events
JFREventModifier modifier = new JFREventModifier() {
    @Override
    public boolean shouldRemoveEvent(RecordedEvent event) {
        return event.getEventType().getName().equals("example.UserLogin");
    }
};

// Process the recording
JFRProcessor processor = new JFRProcessor(modifier, inputFile);
try (FileOutputStream out = new FileOutputStream(outputFile.toFile())) {
    processor.process(out).close();
}

The JFRProcessor code is also a good example of how to use the JMC writer API to create a file as close as possible to a JFR-generated file. Feel free to reuse the code in your own projects.

Conclusion

The JFR file format is not formally specified; however, there is the possibility of reading and writing JFR files using both built-in and external libraries. This leads to a tool that redacts JFR files, including individual properties. In the last example of the blog post, you saw a basic version of such a tool. In the next blog post, I’ll cover a more complete version.

Thanks for coming along with me and see you next week.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

The post Reading and Writing JFR Files Programmatically appeared first on Mostly nerdless.

The Fastest Way to get the Version of a Java Installation

Johannes Bechberger — Mon, 12 Jan 2026 08:16:06 +0000

Last week, I demonstrated that OpenJDK is faster than GraalVM Java, at least for obtaining the Java version. This even prompted the mighty Thomas Wuerthinger (creator of GraalVM) to react. But the measured ~20ms for the OpenJDK is still too slow for applications like execjar, where it could significantly increase the runtime of short-running CLI tools. In this week’s brief blog post, I’ll show you the fastest way to access the Java version.

The main performance issue is that calling java -version creates a process with a fairly large (around 38MB) maximum resident set size, and using a proper command line parser. But do we actually need to call the java binary to get the version?

TL;DR: I created the java-version tool, which can obtain the Java version in under a millisecond.

Basic Idea

No, we can just realize that most Java installations have a release file that contains the relevant information in a machine-readable format. You can find this file in the main folder of the installation (./release when java is in ./bin).

For my SapMachine 25 installation, the file looks like:

IMPLEMENTOR="SAP SE"
IMPLEMENTOR_VERSION="SapMachine"
JAVA_RUNTIME_VERSION="25+36-LTS"
JAVA_VERSION="25"
JAVA_VERSION_DATE="2025-09-16"
LIBC="default"
...

The file is generated in the OpenJDK at build time via (source):

define create-info-file
  $(if $(JDK_ARCH_ABI_PROP_NAME), \
    $(call info-file-item, "SUN_ARCH_ABI", "$(JDK_ARCH_ABI_PROP_NAME)"))
  $(call info-file-item, "SOURCE", "$(strip $(SOURCE_REVISION))")
  $(call info-file-item, "IMPLEMENTOR", "$(COMPANY_NAME)")
  $(if $(VENDOR_VERSION_STRING), \
    $(call info-file-item, "IMPLEMENTOR_VERSION", "$(VENDOR_VERSION_STRING)"))
  $(call info-file-item, "JAVA_VERSION_DATE", "$(VERSION_DATE)")
  $(call info-file-item, "JAVA_RUNTIME_VERSION", "$(VERSION_STRING)")
  $(call info-file-item, "OS_NAME", "$(RELEASE_FILE_OS_NAME)")
  $(call info-file-item, "OS_ARCH", "$(RELEASE_FILE_OS_ARCH)")
  $(call info-file-item, "LIBC", "$(RELEASE_FILE_LIBC)")
endef

For GraalVM, the file looks similar.

Let’s implement a java-version tool with this knowledge.

Implementation

I chose to implement the tool in C++ to avoid the overhead of any runtime. The implementation is fairly straightforward, especially with the modern filesystem API (C++ 17):

#include 
#include 
#include 
#include 

namespace fs = std::filesystem;

int main(int argc, char* argv[]) {
    if (argc != 2) {
        std::cerr << "Usage: " << argv[0] << " \n";
        return 1;
    }

    // Resolve the path, following any symbol links
    fs::path javaPath;
    try {
        javaPath = fs::canonical(argv[1]);
    } catch (const fs::filesystem_error& e) {
        std::cerr << "Failed to resolve path: " << e.what() << "\n";
        return 1;
    }

    // Expect .../bin/java
    if (javaPath.filename() != "java" ||
        javaPath.parent_path().filename() != "bin") {
        std::cerr << "Path does not end with /bin/java\n";
        return 1;
    }

    // ../release
    fs::path releasePath = javaPath.parent_path().parent_path() / "release";

    // Try to open the release file
    std::ifstream file(releasePath);
    if (!file) {
        std::cerr << "Failed to open " << releasePath << "\n";
        return 1;
    }

    const std::string key = "JAVA_VERSION=\"";
    std::string line;

    // Look for the JAVA_VERSION line
    while (std::getline(file, line)) {
        if (line.rfind(key, 0) == 0) { // starts with key
            auto start = key.size();
            auto end = line.find('"', start);
            if (end != std::string::npos) {
                std::cout << line.substr(start, end - start) << "\n";
                return 0;
            }
        }
    }

    std::cerr << "JAVA_VERSION not found\n";
    return 1;
}

You can find the whole file on GitHub and build it via:

g++ -std=c++17 java_version.cpp -o java_version

Usage

The usage is pretty simple:

./java-version `which java`
# or directly
./java-version java/25-sapmchn/bin/java

Benchmarks

The most important part of this blog post is, of course, the comparison with java -version (although you already saw it in the blog post OpenJDK is faster than GraalVM Java*). For the benchmark, I used hyperfine on my MacBook Pro M5:

Benchmark 1: java -version
  Time (mean ± σ):      19.8 ms ±   1.5 ms    [User: 11.1 ms, System: 10.7 ms]
  Range (min … max):    17.1 ms …  31.1 ms    1000 runs
 
Benchmark 2: java-version java/25-sapmchn/bin/java
  Time (mean ± σ):     925.1 µs ± 418.2 µs    [User: 466.3 µs, System: 373.8 µs]
  Range (min … max):   425.3 µs … 5137.1 µs    1000 runs
 
Summary
  java-version java/25-sapmchn/bin/java ran
   21.41 ± 9.81 times faster than java -version

It’s probably as fast as you can get.

Conclusion

Sometimes the fastest solution is to find the information you need in a different place. I will admit that the tool solves a problem that most people don’t have, but solving it helped improve the startup time of my tools.

I hope you still found the second part of my mini-series on java -version interesting, and I can assure you it was the last. The next blog post will be on reading and writing JFR files programmatically.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

P.S.: Stay warm in the cold and let the flow guide you.

The post The Fastest Way to get the Version of a Java Installation appeared first on Mostly nerdless.

OpenJDK is faster than GraalVM Java*

Johannes Bechberger — Fri, 09 Jan 2026 08:08:31 +0000

Well, we all know that the most crucial feature of the JVM runtime is the -version output. So how does the OpenJDK (in the form of SapMachine) compare with GraalVM? It’s significantly faster. Using hyperfine, we can see that GraalVM 25 CE takes almost twice as long to emit the version number as a regular SapMachine 25 on my MacBook Pro M5:

The slowness of java -version was actually one of the performance issues of the tool I showcased in How to Build an Executable from a JAR using ExecJAR, as it originally used java -version a lot to check the Java version constraint.

Is this relevant? Not really. However, so are most microbenchmarks and benchmarks in general that are taken out of context. You should not generalize small benchmarks, and modern systems are complex.

You can find some bigger, non-version-related benchmarks comparing the different JVMs, for example, at https://ionutbalosin.com/2024/02/jvm-performance-comparison-for-jdk-21/.

Join me next week for a blog post on something different and learn how to check the version of a Java installation even faster in under one millisecond:

P.S.: I just ran some more benchmarks: OpenJDK 25 is 18% faster than OpenJDK 17 and 21 and a whopping 84% faster than OpenJDK 11. Upgrade now!

P.P.S.: As many people (Thomas Wuerthinger, Fabio Niebhaus, Volker Simonis, and multiple of my SapMachine colleagues) pointed out, the differences between OpenJDK and GraalVM are due to the GraalVM initializing the JVM Compiler Interface (JVMCI). The difference between the two becomes negligible when running OpenJDK with enabled JVMCI (initialize the JIT at the beginning):

The post OpenJDK is faster than GraalVM Java* appeared first on Mostly nerdless.

How to Build an Executable from a JAR using ExecJAR

Johannes Bechberger — Mon, 05 Jan 2026 09:44:21 +0000

In my last blog post, I covered a new tool called jstall, which enables you to quickly check on a Java application. Because it was tiresome to always call the tool via java -jar jstall, I looked for a way to create executables directly from JARs, inspired by async-profiler’s build system. And I, of course, went down a rabbit hole. In this blog post, I’ll show you how use execjar to easily create your own executable JARs that you can execute directly on the command line while still being valid JARs.

TL;DR: execjar is a CLI and Maven plugin that enables you to create executables from JARs by just adding a few lines to your Maven file:


  me.bechberger
  execjar
  0.1.1
  
    
      
        execjar

When your project is called jstall, this creates an executable with the same name that you can execute directly via ./jstall.

Important: The resulting executable is compatible only with UNIX (Linux and macOS) environments.

However, before I delve into the in-depth configuration options of my new tool, I’d like to provide some background on its implementation.

Idea

Yes, we could use GraalVM’s native-image to create binaries from JARs, while this has some benefits like reduced startup runtime and runtime memory usage, it also has some problems:

It creates platform-dependent binaries
- Platforms like PowerPC (which is essential for my SapMachine team) are not supported
- Building and shipping the binaries is cumbersome
The binaries get fairly large and take time to build
Native-image requires special configuration to support Java features like reflection
… and might not support all features
- I had to use fallbacks for discovering the available JVMs on the system (fallback is jps) or getting a thread dump via a JMX connection (fallback is jstack)

So it’s not really suitable for small tools like jstall or async-profiler’s jfrconv. But what is the alternative? We can simply prepend a launcher script to the JAR file, which then executes the JAR file via a JVM found on the current machine (automatically located by searching a few predefined locations). So literally:

cat launcher.sh > jstall
cat jstall.jar >> jstall
# later (and omitting some flags)
java -jar jstall

But why does this work?

Background

It works because Java’s JARs are just ZIP files:

JAR file is a file format based on the popular ZIP file format and is used for aggregating many files into one. A JAR file is essentially a zip file that contains an optional META-INF directory. A JAR file can be created by the command-line jar tool, or by using the java.util.jar API in the Java platform. There is no restriction on the name of a JAR file, it can be any legal file name on a particular platform.
JAR File Specification

An ZIP files have an interesting property: Its central directory, which lists all the files with their relative offsets from the central directory, is placed at the end of the file:

ZIP-64 Internal Layout by Niklaus Aeschbache on Wikipedia

This is really good, because shell files are read from the beginning, allowing us to prepend a shell script simply:

While the ZIP specification doesn’t explicitly tell us, many ZIP implementations assume that a file starts with a local file header and its magic number 0x04034b50. The OpenJDK does the same (source):

if (JAR_CHECKING_ENABLED && !zipAccess.startsWithLocHeader(jar)){
    IOException x = new IOException("Invalid Jar file");
    // ...
}

But luckily, we can disable this by passing the sun.misc.URLClassPath.disableJarChecking property via the command-line argument -Dsun.misc.URLClassPath.disableJarChecking.

There is only one problem left: how do we call java -jar ... in a way that prevents the shell from trying to parse the JAR portion of the file? We use the shell-builtin exec command:

Many Unix shells also offer a builtin exec command that replaces the shell process with the specified program.^[1][7] Wrapper scripts often use this command to run a program (either directly or through an interpreter or virtual machine) after setting environment variables or other configuration. By using exec, the resources used by the shell program do not need to stay in use after the program is started.
WIKIPEDIA

Features of execjar

The interesting part of the execjar project is not the combining of a shell script and a JAR, but it’s the auto-generated shell script itself. The execjar tool generates a shell script that can

finds the Java binary in various locations
honors version constraints, allowing you to explicitly specify a minimal and maximum version of the Java binary. The shell script then attempts to find a suitable JVM on the system. Did you know that calling java -version takes more than 100ms? You can instead just read the release file that ships with any JRE.
sets custom environment variables and system properties,
prepends and appends custom arguments to the list of passed arguments

And, of course, it’s really easy to use.

Usage of the Maven Plugin

It is essential to note that the packaged JARs must include all dependencies to be executable. Therefore, you must use the Maven plugin in conjunction with other plugins that create a JAR with dependencies. There is a simple example-project in the execjar repository, whose pom.xml shows you exactly this:



    maven-assembly-plugin
    3.6.0
    
        
            jar-with-dependencies
        
        
            
                com.example.HelloExecJar
            
        
    
    
        
            make-assembly
            package
            
                single
            
        
    




    me.bechberger
    execjar
    0.1.1
    
    
        
            
                execjar

And voila, every mvn package also produces an executable. You can, of course, configure various settings, such as the minimum Java version (defaulting to the compile target version) and the JAR file to process. You can find all about these options in the README of the execjar project.

Usage of the Command Line Tool

There is also a command-line tool that you can directly download from the releases page:

# Using the script (recommended for Unix-like systems)
chmod +x execjar
./execjar myapp.jar

# Using the JAR directly
java -jar execjar.jar myapp.jar -o myapp

This tool supports the same options as the Maven plugin, just call ./execjar --help to get a list of them.

Now on to some benchmarks.

Benchmark Comparison with Native-Image

In the following benchmarks, I’m using the jstall project as an example, as it is precisely the target use case for execjar: It’s a small CLI tool that solves one specific use case.

To make it more interesting, I compare the bruntimed runtime performance with native-image, the natural competitor. As you’ll see shortly, the performance difference is not as big as expected. However, I want to mention one caveat: I’m not an expert with native-image, and the numbers I obtain should in no way be generalised.

Setup

I’m using hyperfine to run multiple executions on my MacBook Pro M5. The original jstall JAR with all dependencies is 4.7MB in size. We’re using GraalVM 25 CE for both native-image and as a base JVM, to simplify running benchmarks, but this should be comparable to using OpenJDK.

The jstall JAR depends on the picocli library for command-line argument parsing and transitively depends on Jackson (via jthreaddump). Hence, the JAR includes a lot of code, despite being a rather small application.

Maybe I could remove the Jackson dependency of jthreaddump, or make it optional, but that’s for another day.

Build-Time Performance

I start by comparing the build-time, but exclude the overhead of Maven and only compare the call to native-image with the equivalent execjar call:

> hyperfine "... native-image ..." "... execjar ..." --warmup 1
Benchmark 1: native-image
  Time (mean ± σ):     22.183 s ±  1.238 s    [User: 150.302 s, System: 6.544 s]
  Range (min … max):   20.834 s … 24.466 s    10 runs
 
Benchmark 2: execjar
  Time (mean ± σ):     220.7 ms ±  62.5 ms    [User: 389.2 ms, System: 62.4 ms]
  Range (min … max):   175.5 ms … 416.9 ms    15 runs

 
Summary
  execjar ran
  100.51 ± 29.01 times than native-image

This is expected, as execjar doesn’t do that much and mainly uses handlebars.java to create the launcher shell script, verifying that the passed JAR has a main class and creating the final executable. Just calling execjar --help takes around 102ms.

The size difference between the generated binaries is significant: while the execjar-built executable has roughly the same size as the original JAR (4.8 MB), the native-image-built executable is 6.5 times larger, at around 31 MB.
Now to the runtime performance:

Basic RuntIME Performance

Running jstall without arguments lists the supported commands and VMs on the system; this is as simple as a jstall command can be. Let’s see how both executables compare:

> hyperfine "./nativejstall" "./execjstall" --warmup 5 
Benchmark 1: ./nativejstall
  Time (mean ± σ):      66.6 ms ±   1.5 ms    [User: 97.0 ms, System: 29.6 ms]
  Range (min … max):    63.9 ms …  70.8 ms    42 runs
 
Benchmark 2: ./execjstall
  Time (mean ± σ):     143.1 ms ±   2.1 ms    [User: 253.1 ms, System: 41.3 ms]
  Range (min … max):   140.8 ms … 148.5 ms    20 runs
 
Summary
  ./nativejstall ran
    2.15 ± 0.06 times faster than ./execjstall

As expected, the native-image-built executable runs far faster. But this is not the case for the first run (which is why we use hyperfine with --warmup 5), then it’s around 240ms (the execjar-built executable is faster with around 200ms), probably reflecting the larger file size of the former.

Is this relevant? For the CLI user, it’s probably noticeable, but 140ms is still fast for users (the reaction time of humans is higher).

JStall DeadLock performance

A typical use case with jstall is to check whether an application is currently in a deadlock, like the deadlock example from last week’s blog post:

./jstall deadlock Dead

Using the newly introduced application filtering feature that allows the user to identify JVMs by a part of their label.

Here we can see that there is not really a noticeable difference between the execjar-built executable and the native-image one:

hyperfine "./nativejstall deadlock Dead" "./execjstall deadlock Dead" -i 
Benchmark 1: ./nativejstall deadlock Dead
  Time (mean ± σ):     128.1 ms ±   4.2 ms    [User: 195.6 ms, System: 57.6 ms]
  Range (min … max):   123.2 ms … 138.7 ms    21 runs
 
Benchmark 2: ./execjstall deadlock Dead
  Time (mean ± σ):     253.9 ms ±   2.1 ms    [User: 455.2 ms, System: 79.2 ms]
  Range (min … max):   251.4 ms … 258.5 ms    11 runs
 
Summary
  ./nativejstall deadlock Dead ran
    1.98 ± 0.07 times faster than ./execjstall deadlock Dead

The most used command of jstall is the status command, where it looks different:

JStall Status performance

This is because this command takes two thread dumps with a 5-second sleep between:

> hyperfine "./nativejstall status Dead" "./execjstall status Dead" -i
Benchmark 1: ./nativejstall status Dead
  Time (mean ± σ):      5.220 s ±  0.009 s    [User: 0.319 s, System: 0.097 s]
  Range (min … max):    5.210 s …  5.241 s    10 runs
 
Benchmark 2: ./execjstall status Dead
  Time (mean ± σ):      5.366 s ±  0.006 s    [User: 0.671 s, System: 0.126 s]
  Range (min … max):    5.355 s …  5.374 s    10 runs
 
Summary
  ./nativejstall status Dead ran
    1.03 ± 0.00 times faster than ./execjstall status Dead

So what do these benchmarks tell us? Native-image creates faster binaries, but for small CLI tools like jstall, the difference is probably negligible.

Conclusion

The execjar project provides a small tool that enables us to exploit an interesting quirk of JARs to create executable files for Java tools. This helps us to make tools like jstall more usable, allowing us to place them directly in PATH. It’s part of my goal to build a suite of small tools that solve specific problems, improving my day-to-day developer experience.

I hope you find execjar useful. See you in the next week or two for another blog post.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

P.S.: Happy New Year…

The post How to Build an Executable from a JAR using ExecJAR appeared first on Mostly nerdless.

Quickly Inspect your Java Application with JStall

Johannes Bechberger — Tue, 30 Dec 2025 11:07:00 +0000

Welcome to the last blog post of the year. Last week, I discussed the limitations of custom JFR events. This week, I’ll also be covering a profiling-related topic and showcasing a tiny tool called JStall.

I hope I’m not the only one who sometimes wonders: “What is my Java application doing right now?” When you don’t see any output. Yes, you could perform a simple thread dump via jstack, but it is hard to understand which threads are actually consuming CPU and making any sort of progress. This is where my tiny tool called JStall comes in:

JStall is a small command-line tool for one-shot inspection of running JVMs using thread dumps and short, on-demand profiling. The tool essentially takes multiple thread dumps of your application and uses the per-thread cpu-time information to find the most CPU-time-consuming Java threads.

First, download the JStall executable from the GitHub releases page. Let us then start by finding the currently running JVMs:

> ./jstall
Usage: jstall   [options]

Available commands:
  status     - Show overall status (deadlocks + most active threads)
  deadlock   - Check for deadlocks
  most-work  - Show threads doing the most work
  flame      - Generate flame graph
  threads    - List all threads

Available JVMs:
  7153 ./jstall
  1223 
  8136 ./renaissance-gpl-0.16.0.jar
  6138 org.jetbrains.idea.maven.server.RemoteMavenServer36
  5597 DeadlockDemo
  49294 com.intellij.idea.Main

This provides us with a list of options for the main status command, as well as a list of JVM processes and their corresponding main classes. Let’s start checking for deadlocking:

Checking for Deadlocks

Process 5597 is an instance of the DeadlockDemo program that we saw running above. Let’s obtain the status (a combination of deadlock and most-work analysis):

> ./jstall 5597
=== deadlock ===
Deadlock detected:

Found one Java-level deadlock:
=============================
"DeadlockThread-1":
  waiting to lock monitor 0x00006000029215f0 (object 0x000000052f817880, a java.lang.Object),
  which is held by "DeadlockThread-2"

"DeadlockThread-2":
  waiting to lock monitor 0x00006000029280d0 (object 0x000000052f817878, a java.lang.Object),
  which is held by "DeadlockThread-1"

Java stack information for the threads listed above:
===================================================
"DeadlockThread-1":
        at DeadlockDemo.lambda$main$0(scratch_2.java:15)
        - waiting to lock <0x000000052f817880> (a java.lang.Object)
        - locked <0x000000052f817878> (a java.lang.Object)
        at DeadlockDemo$$Lambda/0x000007fc01040400.run(Unknown Source)
        at java.lang.Thread.runWith(java.base@25/Thread.java:1487)
        at java.lang.Thread.run(java.base@25/Thread.java:1474)
"DeadlockThread-2":
        at DeadlockDemo.lambda$main$1(scratch_2.java:26)
        - waiting to lock <0x000000052f817878> (a java.lang.Object)
        - locked <0x000000052f817880> (a java.lang.Object)
        at DeadlockDemo$$Lambda/0x000007fc01040800.run(Unknown Source)
        at java.lang.Thread.runWith(java.base@25/Thread.java:1487)
        at java.lang.Thread.run(java.base@25/Thread.java:1474)

Found one Java-level deadlock:
=============================
"DeadlockThread-3":
  waiting to lock monitor 0x0000600002938000 (object 0x000000052f817890, a java.lang.Object),
  which is held by "DeadlockThread-4"

"DeadlockThread-4":
  waiting to lock monitor 0x0000600002934000 (object 0x000000052f817888, a java.lang.Object),
  which is held by "DeadlockThread-3"

Java stack information for the threads listed above:
===================================================
"DeadlockThread-3":
        at DeadlockDemo.lambda$main$2(scratch_2.java:38)
        - waiting to lock <0x000000052f817890> (a java.lang.Object)
        - locked <0x000000052f817888> (a java.lang.Object)
        at DeadlockDemo$$Lambda/0x000007fc01040c00.run(Unknown Source)
        at java.lang.Thread.runWith(java.base@25/Thread.java:1487)
        at java.lang.Thread.run(java.base@25/Thread.java:1474)
"DeadlockThread-4":
        at DeadlockDemo.lambda$main$3(scratch_2.java:49)
        - waiting to lock <0x000000052f817888> (a java.lang.Object)
        - locked <0x000000052f817890> (a java.lang.Object)
        at DeadlockDemo$$Lambda/0x000007fc01041000.run(Unknown Source)
        at java.lang.Thread.runWith(java.base@25/Thread.java:1487)
        at java.lang.Thread.run(java.base@25/Thread.java:1474)

Found 2 deadlocks.

=== most-work ===
Top threads by activity (2 dumps):
Combined CPU time: 0.00s, Elapsed time: 5.10s (0.0% overall utilization)

1. Monitor Deflation Thread
   CPU time: 0.00s (64.8% of total)
   Core utilization: 0.0%
   States: RUNNABLE: 100.0%
2. Service Thread
   CPU time: 0.00s (14.8% of total)
   Core utilization: 0.0%
   States: RUNNABLE: 100.0%
3. Attach Listener
   CPU time: 0.00s (13.6% of total)
   Core utilization: 0.0%
   States: RUNNABLE: 100.0%

For this, JStack obtained two thread dumps and analyzed them. JStack found two thread dumps and identified that the top CPU-time-consuming threads are JVM-related.

Listing All Threads

Using the threads command, we can take a look at all threads:

> ./jstall threads 5597
Threads (2 dumps):
Combined CPU time: 0.00s, Elapsed time: 5.10s (0.0% overall utilization)

THREAD                    CPU TIME  CPU %  STATES         TOP STACK FRAME                                   
------------------------  --------  -----  -------------  --------------------------------------------------
Monitor Deflation Thread     0.00s  65.2%  RUNNABLE                                                         
Service Thread               0.00s  18.5%  RUNNABLE                                                         
Attach Listener              0.00s   7.6%  RUNNABLE                                                         
C1 CompilerThread0           0.00s   4.3%  RUNNABLE                                                         
C2 CompilerThread0           0.00s   4.3%  RUNNABLE                                                         
Common-Cleaner               0.00s   0.0%  TIMED_WAITING  java.lang.Object.wait0                            
DeadlockThread-1             0.00s   0.0%  BLOCKED        DeadlockDemo.lambda$main$0                        
DeadlockThread-2             0.00s   0.0%  BLOCKED        DeadlockDemo.lambda$main$1                        
DeadlockThread-3             0.00s   0.0%  BLOCKED        DeadlockDemo.lambda$main$2                        
DeadlockThread-4             0.00s   0.0%  BLOCKED        DeadlockDemo.lambda$main$3                        
Finalizer                    0.00s   0.0%  WAITING        java.lang.Object.wait0                            
Notification Thread          0.00s   0.0%  RUNNABLE                                                         
Reference Handler            0.00s   0.0%  RUNNABLE       java.lang.ref.Reference.waitForReferencePending...
Signal Dispatcher            0.00s   0.0%  RUNNABLE                                                         
main                         0.00s   0.0%  WAITING        java.lang.Object.wait0

This provides a concise overview of the app’s progress.

Status of a Renaissance Instance

Let’s look at process 8136, which is an instance of the Renaissance benchmark suite.

> ./jstall 8136
=== most-work ===
Top threads by activity (2 dumps):
Combined CPU time: 13.34s, Elapsed time: 5.11s (261.4% overall utilization)

1. Executor task launch worker for task 0.0 in stage 53.0 (TID 75)
   CPU time: 4.14s (31.1% of total)
   Core utilization: 81.2%
   States: RUNNABLE: 100.0%
   Common stack prefix:
     org.apache.spark.util.collection.SizeTrackingAppendOnlyMap.changeValue(SizeTrackingAppendOnlyMap.scala:32)
     org.apache.spark.util.collection.ExternalSorter.insertAll(ExternalSorter.scala:200)
     org.apache.spark.shuffle.sort.SortShuffleWriter.write(SortShuffleWriter.scala:63)
     org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:104)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:54)
     org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
     org.apache.spark.scheduler.Task.run(Task.scala:141)
     org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
     org.apache.spark.executor.Executor$TaskRunner$$Lambda/0x0000030001959c00.apply
   ... (8 more lines)

2. Executor task launch worker for task 1.0 in stage 53.0 (TID 76)
   CPU time: 4.12s (30.9% of total)
   Core utilization: 80.8%
   States: RUNNABLE: 100.0%
   Common stack prefix:
     org.apache.spark.util.collection.SizeTrackingAppendOnlyMap.changeValue(SizeTrackingAppendOnlyMap.scala:32)
     org.apache.spark.util.collection.ExternalSorter.insertAll(ExternalSorter.scala:200)
     org.apache.spark.shuffle.sort.SortShuffleWriter.write(SortShuffleWriter.scala:63)
     org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:104)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:54)
     org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
     org.apache.spark.scheduler.Task.run(Task.scala:141)
     org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
     org.apache.spark.executor.Executor$TaskRunner$$Lambda/0x0000030001959c00.apply
   ... (8 more lines)

3. Executor task launch worker for task 2.0 in stage 53.0 (TID 77)
   CPU time: 4.06s (30.4% of total)
   Core utilization: 79.5%
   States: RUNNABLE: 100.0%
   Common stack prefix:
     org.apache.spark.util.collection.SizeTrackingAppendOnlyMap.changeValue(SizeTrackingAppendOnlyMap.scala:32)
     org.apache.spark.util.collection.ExternalSorter.insertAll(ExternalSorter.scala:200)
     org.apache.spark.shuffle.sort.SortShuffleWriter.write(SortShuffleWriter.scala:63)
     org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:104)
     org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:54)
     org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
     org.apache.spark.scheduler.Task.run(Task.scala:141)
     org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
     org.apache.spark.executor.Executor$TaskRunner$$Lambda/0x0000030001959c00.apply
   ... (8 more lines)

Here, you see the top three (configurable) CPU-time-consuming threads over the last 10 seconds, along with the stack trace parts common to both thread dumps.

Using the threads command, you can again see all threads:

> ./jstall threads 8136
Threads (2 dumps):
Combined CPU time: 13.12s, Elapsed time: 5.11s (256.8% overall utilization)

THREAD                                              CPU TIME  CPU %  STATES                       TOP STACK FRAME                                   
--------------------------------------------------  --------  -----  ---------------------------  --------------------------------------------------
Executor task launch worker for task 1.0 in sta...     4.16s  31.7%  RUNNABLE: 50%, BLOCKED: 50%  java.lang.ClassValue$ClassValueMap.readAccess     
Executor task launch worker for task 2.0 in sta...     4.16s  31.7%  RUNNABLE                     scala.runtime.ClassValueCompat.get                
Executor task launch worker for task 0.0 in sta...     4.05s  30.9%  RUNNABLE                     org.apache.spark.util.collection.SizeTrackingAp...
main                                                   0.20s   1.5%  WAITING                      jdk.internal.misc.Unsafe.park                     
C2 CompilerThread0                                     0.16s   1.2%  RUNNABLE                                                                       
task-result-getter-1                                   0.07s   0.6%  WAITING                      jdk.internal.misc.Unsafe.park                     
task-result-getter-0                                   0.07s   0.5%  WAITING                      jdk.internal.misc.Unsafe.park                     
...

But thread dumps aren’t the only way to peek at a running Java application; you can also take a short profile and generate a flamegraph:

Generating Flamegraphs

For this purpose, I integrated async-profiler using my ap-loader library:

> ./jstall flame 8136  
Starting flamegraph generation for PID 8136...
Event: cpu
Duration: 10s
Interval: 10ms
Output: flame.html

Executing: asprof -d 10 -e cpu -i 10000000 -f flame.html 8136

Profiling for 10 seconds
Done

✓ Flamegraph successfully generated!
Output file: .../flame.html
File size: 349576 bytes

Which results in this rather unwieldy flamegraph, courtesy of the Apache Spark benchmark:

Adding Analyses

Do you have an idea for another thread dump analysis? Just implement the Analysis interface, create a CLI command class, and submit a pull request on GitHub. The Analysis interface is pretty simple:

public interface Analyzer {

    /**
     * Returns the name of this analyzer.
     */
    String name();

    /**
     * Returns the set of options this analyzer supports.
     *
     * Common options: "dumps", "interval", "keep", "top"
     */
    Set supportedOptions();

    /**
     * Returns the dump requirement for this analyzer.
     */
    DumpRequirement dumpRequirement();

    default AnalyzerResult analyze(List dumpsWithRaw, 
      Map options) {
        List dumps = dumpsWithRaw.stream().map(ThreadDumpWithRaw::parsed).toList();
        return analyzeThreadDumps(dumps, options);
    }

    default AnalyzerResult analyzeThreadDumps(List dumps, 
     Map options) {
        throw new UnsupportedOperationException(
            "Analyzer must implement either analyze(List, Map) " + 
            "or analyzeThreadDumps(List, Map)");
    }
}

Additionally, there are helper classes that make writing new analyses even easier. So feel free to submit your own. Ideas for new analyses could be:

A lock view that emits a lock graph
A thread-state analysis
A thread blockage analysis that takes the top stack frames and determines whether the threads are waiting for IO, user input, or something else

The threaddump analyses use the jthreaddump parsing library, which you can also use for your own projects.

Conclusion

I sometimes get the time to write tools that solve minor annoyances, in this case, finding the status of a Java app. I hope you find JStall as helpful as I do, and make this open-source project part of your daily toolbox. Feel free to contribute via GitHub issues or pull requests.

Thank you for reading some (at least one) of my blog posts this year. I hope you will join me again next year.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

P.S.: Never stop exploring…

P.P.S.: You can, of course, use the JStack tool via jbang too: jbang jstall@parttimenerd/jstall

P.P.P.S: Abschalten

The post Quickly Inspect your Java Application with JStall appeared first on Mostly nerdless.

Don’t use Arrays or other Complex Types in Custom JFR Events

Johannes Bechberger — Fri, 12 Dec 2025 07:45:58 +0000

JDK Flight Recorder (JFR) provides support for custom events as a profiler. Around two years ago, I wrote a blog post on this very topic: Custom JFR Events: A Short Introduction. These custom events are beneficial because they enable us to record additional project-specific information alongside the standard JFR events, all in the same file. We can then view and process this information with the JFR tools. You can freely specify these events in Java.

There is only one tiny problem nobody talks about: Array support (and, in more general, the support of complex types).

Take the following event:

class ArrayEvent extends Event {
    
    @Label("String Array")
    String[] stringArray;
    
    @Label("Int Array")
    int[] intArray;
    
    @Label("Long Array")
    long[] longArray;
    
    @Label("Non array field")
    String nonArrayField = "default";
}

What would you expect to happen when we create the event using the following code?

ArrayEvent event = new ArrayEvent();
event.stringArray = new String[]{"one", "two", "three"};
event.intArray = new int[]{1, 2, 3, 4, 5};
event.longArray = new long[]{100L, 200L, 300L};
event.commit();

You probably expect that the event on disk looks like (via jfr print):

ArrayEvent {
  startTime = 11:52:28.250 (2025-12-11)
  stringArray = ["one", "two", "three"]
  intArray = [1, 2, 3, 4, 5]
  longArray = [100, 200, 300]
  nonArrayField = "default"
  eventThread = "main" (javaThreadId = 3)
  stackTrace = [
    JFRArrayTest.main(String[]) line: 30
  ]
}

Then you’re wrong. It actually looks like

ArrayEvent {
  startTime = 11:52:28.250 (2025-12-11)
  nonArrayField = "default"
  eventThread = "main" (javaThreadId = 3)
  stackTrace = [
    JFRArrayTest.main(String[]) line: 30
  ]
}

Don’t trust me? Here is the source code, so that you can run it yourself. So essentially JFR hates arrays (kind of):

And no, the JVM doesn’t emit any warnings; it simply omits the fields.

Which Types are Supported?

The metadata.xml file, which defines many of the built-in JFR events, like GCHeapSummary, already provides some hints by listing the types at the end. However, I expected that arrays of supported types would be supported; however, as you saw above, this wasn’t the case.

When the JVM writes the fields of events, it checks that every explicit field has a valid type (source):

for (FieldModel field : classModel.fields()) {
    if (!foundFields.contains(field.fieldName().stringValue()) && 
      isValidField(field.flags().flagsMask(), field.fieldTypeSymbol())) {
        fieldDescs.add(FieldDesc.of(field.fieldTypeSymbol(), 
             field.fieldName().stringValue()));
        foundFields.add(field.fieldName().stringValue());
    }
}

A slight tangent: The lines below this snippet show us that the parent class fields of our event are also parsed so that we can have hierarchies of custom events.

The isValidField method checks that fields are neither transient (marked not to be serialized) nor static (source):

static boolean isValidField(int access, String className) {
    if (Modifier.isTransient(access) || Modifier.isStatic(access)) {
        return false;
    }
    return Type.isValidJavaFieldType(className);
}

Skipping transient fields can be helpful when storing additional information in custom JFR events, when we reuse the event instances elsewhere.

The Type#isValidJavaFieldType method then checks that the type is one of: boolean, char, float, double, byte, short, int, long, Class, String, Thread, StackTrace (source)

As Erik Gahlin noted: From the jdk.jfr.Event documentation: Supported field types are the Java primitives: boolean, char, byte, short, int, long, float, and double. Supported reference types are: String, Thread and Class. Arrays, enums, and other reference types are silently ignored and not included. (not mentioning StackTrace)

So a custom field can have a non-primitive type if it’s one of the limited selections. Therefore, the following event is processed as expected:

class ClassEvent extends Event {
    Class klass;
    StackTraceElement[] stack;
}

A sample event on disk may look like:

ArrayEvent {
  startTime = 12:30:36.469 (2025-12-11)
  nonArrayField = "default"
  classField = java.lang.String (classLoader = bootstrap)
  eventThread = "main" (javaThreadId = 3)
  stackTrace = [
    JFRArrayTest.main(String[]) line: 54
    jdk.internal.reflect.DirectMethodHandleAccessor.invoke(Object, Object[]) line: 104
    java.lang.reflect.Method.invoke(Object, Object[]) line: 565
    com.sun.tools.javac.launcher.SourceLauncher.execute(MemoryContext, String[]) line: 258
    com.sun.tools.javac.launcher.SourceLauncher.run(String[], String[]) line: 138
  ]
}

Conclusion

Custom JFR events are a valuable tool, but they have subtle limitations, as illustrated in this blog post. It’s a pity that there are no warnings with unsupported types, so the issue is slightly more complex to catch. I found this while writing custom JFR events using the JMC JFR writer API and reading them back with the standard OpenJDK JFR API. Why did I do this?

Come back next week to learn about redacting sensitive information from JFR files.

This article is part of my work in the SapMachine team at SAP, making profiling and debugging easier for everyone.

The post Don’t use Arrays or other Complex Types in Custom JFR Events appeared first on Mostly nerdless.

Creating a Gridfinity Chocolate Advent Calendar

Johannes Bechberger — Fri, 05 Dec 2025 09:18:10 +0000

This week is a bit different as I’m working on a fun year-end blog and doing my regular work, so I’ll share with you how to create an advent calendar:

Normal Advent calendars are boring: so let’s make our own! We’ll combine all our favorite technologies: Gridfinity (for the grid system), 3D printing (for the grid), vacuum molding (for the chocolate), laser cutting (for the frame), and automated paper cutting to create advent calendars that are both beautiful and functional.

Along the way, we’ll cover practical food safety considerations and show how these techniques come together to produce something tasty, nerdy, and gift-worthy:

You can mind the positive for the chocolate mold on MakerWorld:

I also created a 2×2 Gridfinity grid that fits RitterSport:

You can learn how to create QR codes out of chocolate in another video of mine:

I usually post computer/Java-related blog posts, but I hope you also liked this blog post, in which I shared my hobbies.

The post Creating a Gridfinity Chocolate Advent Calendar appeared first on Mostly nerdless.

Who instruments the native instrumenters?

Johannes Bechberger — Thu, 20 Nov 2025 08:45:13 +0000

Hot-patching the JVM to hook native Java agents

Over a year ago, I wrote a blog post called Who instruments the instrumenters? together with Mikaël Francoeur on how we debugged the Java instrumentation code. In the meantime, I gave a more detailed talk on this topic at VoxxedDays Amsterdam. The meta-agent that I developed for this worked well for Java agents/instrumenters, but what about native agents? Marco Sussitz found my agent and asked exactly this question. Native agents are agents that utilize the JVMTI API to, for example, modify class bytecode; however, they are not written in Java. With this blog post, I’m proud to announce that the meta-agent now supports instrumenting native agents.

TL;DR: Meta-agent allows you to see how an agent, native or Java, transforms bytecode.

There are many examples of native agents, like DynaTrace‘s monitoring agent or async-profiler‘s method tracer. I’m using the latter in my example here, as it’s open-source and readily available. The method tracer instruments the Java bytecode to trace the execution time of specific methods. You can find more about it in the async-profiler forum.

As a sample program, we use Loop.java:

public class Loop {
    public static void main(String[] args) 
      throws InterruptedException {
        while (true) Thread.sleep(1000);
    }
}

Let’s trace the Thrread.sleep method and use the meta-agent to see what async-profiler does with the bytecode:

java -agentpath:native/libnative_agent.dylib \
     -javaagent:target/meta-agent.jar=server \
     -agentpath:libasyncProfiler.dylib=start,trace=java.lang.Thread.sleep,file=duration.html \
     Loop.java

This opens a server at localhost:7071 and we check how async-profiler modified the Thread class:

So we can now instrument native agents like any other Java agent. And the part: As all Java agents are built on top of the libinstrument native agent, we can also see what any Java agent is doing. For example, we can see that the Java instrumentation agent instruments itself:

So I finally built an instrumenter that can essentially instrument my instrumentation agent, which in turn instruments other instrumentation agents. Another benefit is that the instrumenter can find every modification of any Java agent.

Background

Before I show you how I implemented the new meta-agent features, I want to show you how a typical native agent is implemented. In my blog post Instrumenting Java Code to Find and Handle Unused Classes, I showed how to develop your own instrumenting agent in Java. It’s pretty simple, we just create an implementation of the ClassFileTransformer interface, and implement its transform method to transform classes when they are loaded or a retransform is triggered.

But how would you do this in C/C++ using the native API? We have the JVM Tool Interface (JVMTI) at our disposal. This interface allows the creation of native agents. Commonly used native agents include, for example, the JDWP agent for debugging or the libinstrument agent, which triggers all the Java agents.

In the following, I’ll show you how to implement a tiny native agent; you can find the full source code on GitHub. In contrast to the Java agent, where we implement the ClassFileTransformer#transform method, we implement here a hook for the ClassFileLoad JVMTI event:

void JNICALL ClassFileLoadHook(
    jvmtiEnv *jvmti,
    JNIEnv *jni,
    jclass class_being_redefined,
    jobject loader,
    const char *name,
    jobject protection_domain,
    jint class_data_len,
    const unsigned char *class_data,
    jint *new_class_data_len,
    unsigned char **new_class_data
) {
   if (name) {
        printf("[Agent] Class loaded: %s\n", name);
    } else {
        printf("[Agent] Anonymous class loaded.\n");
    }
}

We don’t transform any of the bytecode here, as there is no proper native bytecode library, and all agents have to essentially reimplement their own (some are based on the old java_crw_demo.c).

Now we just need to create an agent and register the hook. For this, we implement the Agent_OnLoad method that is called when the agent is first loaded:

JNIEXPORT jint JNICALL Agent_OnLoad(JavaVM *vm, char *options, void *reserved) {
    // We define some variables where JVMTI environment
    // and the error state is later stored
    jvmtiEnv *jvmti;
    jvmtiError err;

    printf("[Agent] Agent_OnLoad called.\n");

    // We get the JVMTI environment and fail if we had a problem
    if ((*vm)->GetEnv(vm, (void **)&jvmti, JVMTI_VERSION_1_2) != JNI_OK) {
        printf("[Agent] Unable to get JVMTI environment.\n");
        return JNI_ERR;
    }

    // We set capabilities of this agent, requesting access to ClassFileLoadHooks
    jvmtiCapabilities caps;
    memset(&caps, 0, sizeof(caps));
    caps.can_generate_all_class_hook_events = 1;
    err = (*jvmti)->AddCapabilities(jvmti, &caps);
    if (err != JVMTI_ERROR_NONE) {
        printf("[Agent] AddCapabilities failed: %d\n", err);
        return JNI_ERR;
    }

    // We register the ClassFileLoadHook
    jvmtiEventCallbacks callbacks;
    memset(&callbacks, 0, sizeof(callbacks));
    callbacks.ClassFileLoadHook = &ClassFileLoadHook;
    err = (*jvmti)->SetEventCallbacks(jvmti, &callbacks, sizeof(callbacks));
    if (err != JVMTI_ERROR_NONE) {
        printf("[Agent] SetEventCallbacks failed: %d\n", err);
        return JNI_ERR;
    }

    // We finally enable the ClassFileLoadHook event
    err = (*jvmti)->SetEventNotificationMode(jvmti, JVMTI_ENABLE, JVMTI_EVENT_CLASS_FILE_LOAD_HOOK, NULL);
    if (err != JVMTI_ERROR_NONE) {
        printf("[Agent] SetEventNotificationMode failed: %d\n", err);
        return JNI_ERR;
    }

    // and return
    printf("[Agent] ClassFileLoadHook registered.\n");
    return JNI_OK;
}

We now have a simple native agent that informs us whenever a class is loaded or retransformed. After we built it, we can use it with the Loop.java file from before:

> java -agentpath:native/libagent_minimal_cfh.dylib Loop.java               
[Agent] Agent_OnLoad called.
[Agent] ClassFileLoadHook registered.
[Agent] Class loaded: jdk/internal/vm/ContinuationSupport
[Agent] Class loaded: jdk/internal/vm/Continuation$Pinned
[Agent] Class loaded: sun/launcher/LauncherHelper
[Agent] Class loaded: jdk/internal/loader/BuiltinClassLoader$2
...

It is important to note that the JVM calls the Agent_OnLoad method of every native agent before the JVM starts loading our application (or the compiler in our example) and before the JVM starts any Java agent.

Now the question is: How can we instrument the native agent? In the Java case, we just transformed every call to Instrumentation#addTransformer to call a meta-agent method instead, which wrapped that Transformer before adding it.

Our goal now is to achieve a similar outcome with native agents. In an ideal world, we would just transform all calls to SetEventCallbacks. But can we? Without any vtable hacks or binary trickery? This is where an observation comes in handy:

Patching JVMTI

The SetEventCallbacks method belongs to the JVMTI environment struct that our agent obtained from the JVM by calling GetEnv. This struct is defined in the generated JVMTI header as follows (for C source code, it’s different for C++):

typedef struct jvmtiInterface_1_ {

  /*   1 :  RESERVED */
  void *reserved1;

  /*   2 : Set Event Notification Mode */
  jvmtiError (JNICALL *SetEventNotificationMode) (jvmtiEnv* env,
    jvmtiEventMode mode,
    jvmtiEvent event_type,
    jthread event_thread,
     ...);
  // ...
  
    /*   122 : Set Event Callbacks */
  jvmtiError (JNICALL *SetEventCallbacks) (jvmtiEnv* env,
    const jvmtiEventCallbacks* callbacks,
    jint size_of_callbacks);
  // ...
}

A tiny agent that prints the address of (*jvmti)->SetEventCallbacks later we can confirm that the method pointers for all agents point to the same address. So we can just override this method pointer via

*(void**)&((*jvmti)->SetEventCallbacks) = &SetEventCallbacks;

And now our wrapper

jvmtiError
SetEventCallbacks(jvmtiEnv* env, 
  jvmtiEventCallbacks* callbacks, 
  jint size_of_callbacks) { ... }

is called whenever a follow-up agent calls this method. Therefore, our native wrapping agent must always be the first agent. So we’re essentially hot-patching the JVM.

In the most basic case of only supporting wrapping one agent, we can implement a basic wrapper method as follows (GitHub):

// Original ClassFileLoadHook callback from the wrapped agent
static void (*original_ClassFileLoadHook)(jvmtiEnv *jvmti, JNIEnv *jni, 
                                          jclass class_being_redefined, jobject loader, 
                                          const char *name, jobject protection_domain, 
                                          jint class_data_len, const unsigned char *class_data, 
                                          jint *new_class_data_len, unsigned char **new_class_data) = NULL;

// Our wrapper for ClassFileLoadHook
static void JNICALL
wrapped_ClassFileLoadHook(jvmtiEnv *jvmti, JNIEnv *jni, jclass class_being_redefined,
                          jobject loader, const char *name, jobject protection_domain,
                          jint class_data_len, const unsigned char *class_data,
                          jint *new_class_data_len, unsigned char **new_class_data) {
    printf("[WRAPPER] ClassFileLoadHook called for class: %s\n", name ? name : "NULL");
    // Call the original ClassFileLoadHook
    original_ClassFileLoadHook(jvmti, jni, class_being_redefined, loader, name,
                                protection_domain, class_data_len, class_data,
                                new_class_data_len, new_class_data);
}

// Our wrapper for SetEventCallbacks
jvmtiError
SetEventCallbacks(jvmtiEnv* env, jvmtiEventCallbacks* callbacks, jint size_of_callbacks) {
    printf("[WRAPPER] SetEventCallbacks called\n");
    
    if (callbacks != NULL && callbacks->ClassFileLoadHook != NULL) {
        printf("[WRAPPER] Intercepting ClassFileLoadHook callback\n");
        
        // Store the original ClassFileLoadHook callback
        original_ClassFileLoadHook = callbacks->ClassFileLoadHook;

        // Replace with our wrapped version
        callbacks->ClassFileLoadHook = wrapped_ClassFileLoadHook;

        // Call original SetEventCallbacks
        return original_SetEventCallbacks(env, callbacks, size_of_callbacks);
    }
    
    // No ClassFileLoadHook to wrap, just pass through
    return original_SetEventCallbacks(env, callbacks, size_of_callbacks);
}

And that’s all. The implementation of this native agent instrumentation is relatively simple, yet it took me the better part of a morning, somewhere in Bulgaria, to come up with it.

Supporting Multiple Agents

Now we just need to support multiple agents. We could, in theory, create function objects on the heap, but this would require us to write or copy a small amount of custom assembly code onto a part of the heap that we would have to mark as executable. However, there is a more straightforward, albeit less elegant solution: we can limit the number of supported agent loadings to a large, fixed number, such as 4096, and probably be fine.

Now we start by defining an array of a ClassFileLoadHookInfo struct which includes both the wrapped load hook and the name of the agent:

typedef struct {
    void (*callback)(jvmtiEnv *jvmti, JNIEnv *jni, jclass class_being_redefined, 
                     jobject loader, const char *name, jobject protection_domain, 
                     jint class_data_len, const unsigned char *class_data, 
                     jint *new_class_data_len, unsigned char **new_class_data);
    char name[MAX_AGENT_NAME_LEN];
} ClassFileLoadHookInfo;

static ClassFileLoadHookInfo agent_info[MAX_AGENTS];
static int next_agent_slot = 0;

This is where we store all the wrapped load hooks. The idea is then to create 4096 functions, where function 0 wraps the load hook from agent_info[0], … . All these functions are created using macros and their addresses in a static WrapperFunc wrapper_functions[MAX_AGENTS] so that we later easily get the nth function address.

But how does the wrapping function then communicate with our meta-agent?

Communicating with the meta-agent

Unfortunately, we cannot simply use JVMTI to invoke the meta-agent’s Java code and record the collected bytecode transformations within the meta-agent. The problem is that we would need to do this within a ClassFileLoadHook, which would easily create circular class loading errors.

Instead, we create a temporary folder /tmp/njvm and atomically create numbered files in there for every recorded transformation. The files have the following format:

Line 1: agent_name (e.g., “agent_minimal_cfh”)
Line 2: class_name (e.g., “java/lang/String” or “unknown”)
Line 3: old_len (decimal number, e.g., “1234”)
Line 4: new_len (decimal number, e.g., “1456”)
Binary data: old_len bytes of original class data
Binary data: new_len bytes of transformed class data

The meta-agent has a loop that iterates regularly over the folder, parsing all files within and removing duplicates, ensuring that we don’t encounter duplicates.

Using the Native Agent

But first: Is this safe? Yes (but don’t use it in production), and the implementation itself probably still contains some errors. However, it’s platform-independent and doesn’t use undefined behavior in C, nor does it use any assembly code.

To use it, just clone the meta-agent repository and build the native agent via make:

git clone https://github.com/parttimenerd/meta-agent
cd meta-agent
(cd native; make)
# for good measure also build the Java meta-agent
mvn package -DskipTests
# Now you can use it via
java -agentpath:native/libnative_agent.dylib=log=verbose \
     -javaagent:target/meta-agent.jar=server ...

Conclusion

In this blog post, I showcased the new support for native agents in the meta-agent project and how I implemented the feature using JVM hot patching. This is particularly beneficial for debugging native agents and examining what existing agents, such as async-profiler’s method tracer, do. Of course, we could extend this to also wrap other JVMTI methods, but for now, this is out of scope.

Thanks for coming so far. I hope to see you in two weeks with something else.

P.S.: Here is a picture of a cat from the Pompeii museum.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone. Thanks to Marco Sussitz for the idea of instrumenting native agents and for showcasing my meta-agent at JavaZone.

The post Who instruments the native instrumenters? appeared first on Mostly nerdless.

Running an LLM on an Android Phone

Johannes Bechberger — Wed, 05 Nov 2025 06:25:39 +0000

In my last blog post, I showed you how to work with JFR files using DuckDB, which started a blog series that I surely will continue. Just not this week. Instead, I want to showcase a tiny app to run AI models using the MediaPipe API directly on your phone. I created the app for another purpose (perhaps described in a future blog post) earlier this year, but never wrote anything about it. So here we are.

TL;DR: I built an Android app that offers AI models via a server

The app is open-source and available on GitHub; it’s experimental, but maybe it can help you build your own apps. You can download the releases page of the repo and install it.

The LLM API endpoint, writing a poem on a backyard scene

The Android App

As already described, you can just download the app, but to fully use it, you need to install some AI models. For models like Google’s Gemma, which require authentication for download, you must click “…” to access the download link and then download the files from HuggingFace after agreeing to the license terms. After downloading, load the model file into the app using the “Load” button. The app can download other models directly. Please note that you may need to refresh the page manually. After installation, you can test the model directly with a basic prompt:

The app opens a port (typically 8005) and allows you to test its web endpoints directly. You can use it to capture images using the rear and front camera and do some object detection, using the EfficientDet Lite 2 model (not the best, but it’s small):

As you saw in the TL;DR section, you also prompt the installed LLMs, using them, for example, for better on-device object-detection:

Which leads to “slightly” better results than the EfficientDet Lite 2 model:

[
  {
    "object": "chair",
    "details": "woven wicker chair with a curved back and a metal frame. Covered in fallen leaves."
  },
  {
    "object": "table",
    "details": "wooden table, partially visible."
  },
  {
    "object": "leaves",
    "details": "Numerous fallen leaves, primarily yellow and brown, scattered on the ground."
  },
  {
    "object": "plants/vines",
    "details": "Green plants and vines growing on a wall or fence behind the chair and table."
  },
  {
    "object": "ground",
    "details": "Paved ground with a brick or stone pattern."
  }
]

However, in defense of the smaller model, the LLM took 40 times longer (46 seconds vs. 1.2 seconds).

Please note that, for privacy reasons, the app must be open and visible to capture images.

There is also the possibility of capturing the current orientation of the phone, but that’s similar to the other APIs.

Server Functionality

As I mentioned earlier, this app starts a server at port 8005, allowing you to easily access its AI capabilities from other apps and from the terminal, such as Termux or the Linux Terminal App.

You find all the available APIs and their request and response formats in the project’s README, but curl localhost:8005 also gives you an overview:

Please be aware that the /location API is currently not working. But all the other APIs are, as you’ve seen before. Querying the local LLM is simple via curl:

The same for the orientation API:

Background

The Google AI Edge Gallery allows you to run the Gemma models directly on your phone with an interactive chat:

It’s a great app to explore three different Gemma and one Qwen models on the CPU and GPU of your smartphone, with linked API samples for the MediaPipe library.

The only Problem: I wanted to use these models and more in an emulated Linux running on my Android. However, these emulated OS instances can’t access the camera or other sensors, and they also run applications significantly slower than Android does directly. I created the app showcased in this blog post to expose all its functionality via a server.

In the following, we use the app to create a few command-line apps for Termux.

A Tiny Fortune Clone

A tiny sample use case would be a fortune clone. Fortune is a small UNIX utility that “prints a random, hopefully interesting, adage” (from its man-page):

fortune is a program that displays a pseudorandom message from a database of quotations. Early versions of the program appeared in Version 7 Unix in 1979.^[1] The most common version on modern systems is the BSD fortune, originally written by Ken Arnold.^[2] Distributions of fortune are usually bundled with a collection of themed files, containing sayings like those found on fortune cookies (hence the name), quotations from famous people, jokes, or poetry.
WIKIPEDIA oN ThE FORTUNE Utility

> fortune
You could get a new lease on life -- if only you didn't need the first
and last month in advance.
> fortune
Help me, I'm a prisoner in a Fortune cookie file!

Let’s create our own using the AI server in UNIX in a tiny shell script:

#!/bin/sh
# minimal fortune clone using local AI
# usage: ./fortune.sh

curl -s -d '{ "text": "You are a clone of the unix fortune tool, print a random, hopefully interesting, adage. Only print the single line adage directly.", "model": "GEMMA_3_1B_IT" }' localhost:8005/ai/text \
| sed -n 's/.*"response": *"\(.*\)".*/\1/p' \
| sed 's/\\n//g' \
| sed 's/^```//; s/```$//' \
| sed 's/^\.\.\.//' \
| sed 's/\\u0027/'"'"'/g'

It’s usually with two to three seconds, not the fastest fortune clone, but it gives interesting results (the right picture is a version instructed to be a funny clone):

Sometimes the AI server experiences issues; it’s still a prototype…

Conclusion

It’s all a big experiment, demonstrating the power of tiny AI models on modern smartphones. I hope you can use it to develop your own fun little apps or shell scripts in Termux, just as I did for this blog post.

Thank you for coming this far. I look forward to seeing you in the next few weeks for a blog post on instrumenting native agents.

The post Running an LLM on an Android Phone appeared first on Mostly nerdless.

Making JFR Quack: Importing JFR files into DuckDB

Johannes Bechberger — Fri, 24 Oct 2025 07:45:55 +0000

In my previous post, I showed you how tricky it is to compare objects from the JFR Java API. You probably wondered why I wrote about this topic. Here is the reason: In this blog post, I’ll cover how to load JFR files into a DuckDB database to allow querying profiling data with simple SQL queries, all JFR views included.

This blog post will start a small series on making JFR quack.

TL;DR

You can now use a query tool (via GitHub) to transform JFR files into similarly sized DuckDB files:

> java -jar target/query.jar duckdb import jfr_files/recording.jfr duckdb.db
> duckdb duckdb.db "SELECT * FROM Events";
┌───────────────────────────────┬───────┐
│             name              │ count │
│            varchar            │ int32 │
├───────────────────────────────┼───────┤
│ GCPhaseParallel               │ 69426 │
│ ObjectAllocationSample        │  6273 │

Or run the queries directly, with the database file being cached (if you don’t pass --no-cache), directly supporting all built-in JFR views:

> java -jar target/query.jar query jfr_files/metal.jfr "hot-methods" 
Method                                                                                                   Samples Percent
-------------------------------------------------------------------------------------------------------- ------- -------
java.util.concurrent.ForkJoinPool.deactivate(ForkJoinPool.WorkQueue, int)                                   1066   8.09%
scala.collection.immutable.RedBlackTree$.lookup(RedBlackTree.Tree, Object, Ordering)                         695   5.27%
akka.actor.dungeon.Children.initChild(ActorRef)                                                              678   5.14%

This view is implemented as:

CREATE VIEW "hot-methods" AS
SELECT
  (c.javaName || '.' || m.name || m.descriptor) AS "Method",
  COUNT(*) AS "Samples",
  format_percentage(COUNT(*) / (SELECT COUNT(*) FROM ExecutionSample)) AS "Percent"
FROM ExecutionSample es
JOIN Method m ON es.stackTrace$topMethod = m._id
JOIN Class c ON m.type = c._id
GROUP BY es.stackTrace$topApplicationMethod, c.javaName, m.name, m.descriptor
ORDER BY COUNT(*) DESC
LIMIT 25

Previously

My new tool is the next evolution of my JFR-query-based tool, which I showcased in my post, An Experimental Front-End for JFR Queries, essentially allowing you to execute queries directly on JFR files using the JFR query syntax. It allowed running queries like the following, which is the view defined directly above in SQL:

COLUMN 'Method', 'Samples', 'Percent'
FORMAT none, none, normalized
SELECT stackTrace.topFrame AS T, COUNT(*), COUNT(*)
FROM ExecutionSample GROUP BY T LIMIT 25

But the JFR query system has severe limitations:

It was only ever built to implement a narrow set of views for the jfr view tool.
Its SQL variant is truly custom with syntax like DIFF([B|E].startTime) to fit the jfr view use case. But this makes the syntax hard to understand and hard to use.
The SQL variant lacks even basic things like non-equality comparisons
It has a custom-built query engine, which is why custom syntax can be implemented, but also why it doesn’t have any query optimisations.
Extending the simple code is possible, but developing and maintaining a version with the missing SQL features would be a considerable effort.

The JFR query system is suitable for its intended use case, but not for general-purpose querying. This is why I went on to search for a viable alternative. I found it in the form of DuckDB. It’s more verbose than JFR files, but executing the queries harnesses the power of a proper database engine and is thereby significantly faster than running the JFR queries.

Alternatives

There are several alternatives to developing a custom project: First and foremost, of course, the jfr-analytics project by Gunnar Morling, which uses Apache Calcite to directly support SQL queries on top of the JFR Java API. This tool allows you to write queries like (source)

SELECT
  ts."startTime",
  ts."parentThread"."javaName" as "parentThread",
  ts."eventThread"."javaName" AS "newThread",
  TRUNCATE_STACKTRACE(ts."stackTrace", 20) AS "stackTrace"
FROM "jdk.ThreadStart" ts LEFT JOIN "jdk.ThreadEnd" te
  ON ts."eventThread"."javaThreadId" = te."eventThread"."javaThreadId"
WHERE te."startTime" IS NULL;

to find ThreadStart events without a corresponding ThreadEnd event.

The advantage is that it can easily support the nested structs in JFR data and doesn’t require the data to be transformed. The main disadvantage is that the tool uses non-standard SQL and is sadly not maintained anymore.

Other database importers exist for InfluxDB or Grafana, but all implementations I could find only support a limited number of JFR events.

Another notable project is jfrv, a web-based JFR viewer with a custom DuckDB reader to import JFR files directly. The JFR reader is written in Rust, which allows it to be compiled to WASM together with DuckDB and run directly in the browser. The main disadvantage of this project is that developing and supporting your own JFR reader is not advisable, as the JFR file format is neither well-documented nor well-specified.

Update: Frederic Thevenet wrote an interesting adapter for Lucene that I somehow must have missed.

I also toyed with a similar idea a while back: https://binjr.eu/blog/2023/08/new-data-adapter-jdk-flight-recorder/
With that said, there are some differences in the approach I took over the one you discussed in your post.
For one, I opted to use an inverted index (#Lucene) instead of a relational DB as my backend, which comes with it’s own trade-offs, like offering a query language that is somewhat easier to use, but not as nearly as powerful.
The other main difference, is that the route I used to get there is kinda like the opposite from the one you took: while you went from the backend working your way up to the UI, I very much started there (as I already had it) and worked my way down.
Doing things this way around meant that I could benefit immediately from the UI features that were there already (which was the whole point, of course) but it makes integrating new ones that don’t fit so naturally with the rest of the tool, much more time consuming…

At any rate, I would love to hear your thoughts if you find the time to give it a try!
(you can get it here: https://github.com/binjr/binjr/releases)
Frederic Thevenet On MASTODON

This is an interesting approach, but as Frederic wrote, it has some significant short-comings. But in the end its good to have a whole set of tools.

Why DuckDB

DuckDB is an easy-to-use in-memory database. This database is optimized for analytical workflows:

DuckDB is designed to support analytical query workloads, also known as online analytical processing (OLAP). These workloads are characterized by complex, relatively long-running queries that process significant portions of the stored dataset, for example aggregations over entire tables or joins between several large tables.
WHY DuckDB on duckdb.org

DuckDB is open-source, has a proper query optimizer, and supports many SQL features. The Java library already includes the DuckDB binaries for multiple platforms, so installing any software is unnecessary.

SQLite would be another good option, but DuckDB seems more optimized for my use case. Proper database systems like PostgreSQL or Prometheus would force the user into a more complex setup process, countering my project goals.

Goals and Non-Goals

The main goal of this project is to be able to query and analyze JFR data, with a focus on garbage collection related data, using the already widely known SQL. The aim is to provide a simple tool that mimics the jfr tool, which doesn’t require any setup or installation of additional software. The produced database files should be self-contained and usable with other tools.

But of course, with all that, the tool should also be maintainable. The eventual goal is to include a basic UI.

On the other hand, it is not a goal to replace the JFR tool or to be too configurable. So, it will probably never support databases other than DuckDB. It’s also not a goal to support every custom JFR event in existence, nor is it a goal to support deep stack traces in the events. But the goal is to support all JVM-internal JFR events and most user-defined ones, too, so there is no special handling of specific events.

Simplicity over complexity, and maintainability over feature richness.

Modelling JFR Events as Tables

We need to model JFR events as simple tables because we can’t have the luxury of Apache Calcite to query the object graph. So we start by translating the most common elements.

Events are becoming tables, so, for example, an event like jdk.CPULoad

is modelled as a table consisting of a string and an integer:

D SHOW CPULoad;
┌──────────────┬─────────────┬─────────┬─────────┬─────────┬─────────┐
│ column_name  │ column_type │  null   │   key   │ default │  extra  │
│   varchar    │   varchar   │ varchar │ varchar │ varchar │ varchar │
├──────────────┼─────────────┼─────────┼─────────┼─────────┼─────────┤
│ startTime    │ TIMESTAMP   │ YES     │ NULL    │ NULL    │ NULL    │
│ jvmUser      │ FLOAT       │ YES     │ NULL    │ NULL    │ NULL    │
│ jvmSystem    │ FLOAT       │ YES     │ NULL    │ NULL    │ NULL    │
│ machineTotal │ FLOAT       │ YES     │ NULL    │ NULL    │ NULL    │
└──────────────┴─────────────┴─────────┴─────────┴─────────┴─────────┘

With the start time modelled as a timestamp. The additional data type information is sadly lost in translation, as DuckDB (in contrast to PostgreSQL) doesn’t have proper custom data types. The specific data type is only noted in the table comment:

D SELECT comment FROM duckdb_tables() where table_name = 'CPULoad';
comment = DESCRIPTION(Information about the recent CPU usage of the JVM process); Column "jvmUser": Percentage; Column "jvmSystem": Percentage; Column "machineTotal": Percentage

This isn’t pretty, but it should at least help for documentation purposes. You also see the table description.

Inlined Structs

Structs are modelled in two ways: Either they are inlined or a new table is created with an _id column. Structs like VirtualSpace are inlined if they either consist of a single property or if they only, like VirtualSpace consist of numeric properties:

This struct is inlined into jdk.GCHeapSummary:

This makes it easier to formulate SQL queries by reducing the number of joins, improving the speed, and file size. The event table looks like the following:

D SHOW GCHeapSummary;
┌─────────────────────────┬─────────────┬─────────┬─────────┬─────────┬─────────┐
│       column_name       │ column_type │  null   │   key   │ default │  extra  │
│         varchar         │   varchar   │ varchar │ varchar │ varchar │ varchar │
├─────────────────────────┼─────────────┼─────────┼─────────┼─────────┼─────────┤
│ startTime               │ TIMESTAMP   │ YES     │ NULL    │ NULL    │ NULL    │
│ gcId                    │ INTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ when                    │ VARCHAR     │ YES     │ NULL    │ NULL    │ NULL    │
│ heapSpace$start         │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
│ heapSpace$committedEnd  │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
│ heapSpace$committedSize │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
│ heapSpace$reservedEnd   │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
│ heapSpace$reservedSize  │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
│ heapUsed                │ BIGINT      │ YES     │ NULL    │ NULL    │ NULL    │
└─────────────────────────┴─────────────┴─────────┴─────────┴─────────┴─────────┘

The inlined struct’s properties are prefixed by the event property name. These inlined structs are also included in the comments:

D .mode line
D SELECT comment FROM duckdb_tables() where table_name = 'GCHeapSummary';
comment = Column "heapSpace$start": MemoryAddress with DESCRIPTION(Start address of the virtual space); Column "heapSpace$committedEnd": MemoryAddress with DESCRIPTION(End address of the committed memory for the virtual space); Column "heapSpace$committedSize": DataAmount(BYTES) with DESCRIPTION(Size of the committed memory for the virtual space); Column "heapSpace$reservedEnd": MemoryAddress with DESCRIPTION(End address of the reserved memory for the virtual space); Column "heapSpace$reservedSize": DataAmount(BYTES) with DESCRIPTION(Size of the reserved memory for the virtual space); Column "heapUsed": DataAmount(BYTES) with DESCRIPTION(Bytes allocated by objects in the heap)

You might notice that the comments also include descriptions of properties where available. A front-end could parse this.

Referenced Structs

But not all structs are inlined, so what do they look like? As hinted before, we create a separate table for them with an _id column, which is used to reference the structs. An example of this is the Class struct:

This is modelled as:

D SHOW Class;
┌─────────────┬─────────────┬─────────┬─────────┬─────────┬─────────┐
│ column_name │ column_type │  null   │   key   │ default │  extra  │
│   varchar   │   varchar   │ varchar │ varchar │ varchar │ varchar │
├─────────────┼─────────────┼─────────┼─────────┼─────────┼─────────┤
│ _id         │ UINTEGER    │ NO      │ PRI     │ NULL    │ NULL    │
│ classLoader │ UINTEGER    │ YES     │ NULL    │ NULL    │ NULL    │
│ name        │ VARCHAR     │ YES     │ NULL    │ NULL    │ NULL    │
│ package     │ UINTEGER    │ YES     │ NULL    │ NULL    │ NULL    │
│ modifiers   │ INTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ hidden      │ BOOLEAN     │ YES     │ NULL    │ NULL    │ NULL    │
│ javaName    │ VARCHAR     │ YES     │ NULL    │ NULL    │ NULL    │
└─────────────┴─────────────┴─────────┴─────────┴─────────┴─────────┘

This table itself references the ClassLoader struct. But be aware that this ClassLoader struct cannot, in turn, reference classes, because recursion makes the insertion into tables far more complicated.

The Class table comment includes the relationship between the two tables:

D SELECT comment FROM duckdb_tables() where table_name = 'Class';
comment = Column "classLoader": references "ClassLoader"(_id); Column "package": references "Package"(_id)

Sadly, foreign keys didn’t work for me for this purpose. Some references in JFR can be null, so having zero as a reference number models this. The first row of every struct table is a default row for this purpose:

D SELECT * FROM Class LIMIT 1;
┌─────┬─────────────┬──────┬─────────┬───────────┬────────┬──────────┐
│ _id │ classLoader │ name │ package │ modifiers │ hidden │ javaName │
├─────┼─────────────┼──────┼─────────┼───────────┼────────┼──────────┤
│ 0   │ NULL        │ NULL │ NULL    │ 0         │ false  │ NULL     │
└─────┴─────────────┴──────┴─────────┴───────────┴────────┴──────────┘

Now on to stack traces, which are specially handled.

Stack Traces

Stack traces are modelled in JFR as structs with arrays:

The problem is that supporting arrays in general in the database is hard, as the JDBC driver for DuckDB lacks full support. So the converter only supports arrays for stack traces and handles both stack traces and stack frames with custom logic.

All stack trace structs are inlined, as seen, for example, with the ExecutionSample event:

D SHOW ExecutionSample;
┌─────────────────────────────────┬──────────────┬─────────┬─────────┬─────────┬─────────┐
│           column_name           │ column_type  │  null   │   key   │ default │  extra  │
│             varchar             │   varchar    │ varchar │ varchar │ varchar │ varchar │
├─────────────────────────────────┼──────────────┼─────────┼─────────┼─────────┼─────────┤
│ startTime                       │ TIMESTAMP    │ YES     │ NULL    │ NULL    │ NULL    │
│ sampledThread                   │ UINTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$topMethod            │ UINTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$topApplicationMethod │ UINTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$topNonInitMethod     │ UINTEGER     │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$length               │ SMALLINT     │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$truncated            │ BOOLEAN      │ YES     │ NULL    │ NULL    │ NULL    │
│ stackTrace$methods              │ UINTEGER[10] │ YES     │ NULL    │ NULL    │ NULL    │
│ state                           │ VARCHAR      │ YES     │ NULL    │ NULL    │ NULL    │
└─────────────────────────────────┴──────────────┴─────────┴─────────┴─────────┴─────────┘

I made the conscious decision, for now, to only include the method property per frame, this allows me to save a significant amount of memory. But maybe I’ll add the option to include the line number and type if the need arises.

You might notice I capped the stack trace depth at 10 by default to save memory. This is adjustable. The additional $topMethod, $topApplicationMethod, and $topNonInitMethod are just convenience columns that are often used in JFR views.

Of course, I could have normalized the SQL table by replacing the array with ten columns, but this clutters the table view and doesn’t offer space benefits.

Now on possible front-ends:

Front-End

Currently, there is no custom front-end for the DuckDBified JFR files, but there are several existing options. One is the Grafana DuckDB source, which should allow you to view the profiling and monitoring data using Grafana dashboards.

Furthermore, there is a built-in DuckDB UI (via duckdb -ui):

This is a simple web-based UI that allows you to run queries. I used it a lot while developing the current tool. Unfortunately, it doesn’t allow you to create plots.

Name

This project doesn’t have a proper name yet, so I’m open to suggestions. It could be “JFRDuck” or “JFRQuery,” but I’m unsure.

Tool

You can either build the tool via Maven, download the latest release from GitHub, or directly call it using jbang (jbang jfr-query@parttimenerd/jfr-query). Don’t be afraid of its size of almost 80MB, which is primarily due to the embedded DuckDB database driver.

The tool currently has essentially five commands:

> java -jar target/query.jar 
Usage: query.jar [-hV] [COMMAND]
Querying JFR recordings with DuckDB
  -h, --help      Show this help message and exit.
  -V, --version   Print version information and exit.
Commands:
  import   Import a JFR recording into a DuckDB database
  query    Execute a SQL query or view on the JFR DuckDB database and print the
             results.
  macros   List available SQL macros (views) for JFR analysis.
  views    List available SQL views for JFR analysis.
  context  Create a description of the tables, macros and views for generating
             SQL queries using AI
  help     Display help information about the specified command.

The two most common use cases are the transformation of a JFR file into a database:

> java -jar target/query.jar duckdb import jfr_files/default.jfr duckdb.db

And querying via the query command, as shown at the beginning of this blog post. The idea is to have a self-contained tool that contains every dependency.

Conclusion

In this blog post, I showed my newest work on querying JFR files using DuckDB. In my opinion, this is the way forward and a building block for future JFR-related tooling. The DuckDB database allows you to easily analyze JFR data using SQL, without me having to implement a custom query engine or extend the existing jfr tool.

Thanks for coming so far. I’ll see you in a week or two for a blog post on the next part of this series, most probably a short blog post on how AI can help generate SQL queries.

This blog post is part of my work in the SapMachine team at SAP, making profiling easier for everyone.

P.S.: Enjoy the autumn…

The post Making JFR Quack: Importing JFR files into DuckDB appeared first on Mostly nerdless.

Mostly nerdless

Calling jcmd Commands Programmatically

On HotSpot Error Files and Useful Tools

Header and Summary

Thread Section

Process Section

Events SubSection

Dynamic Libraries Subsection

JVM SubSection

Syntax Highlighting

Redaction

Conclusion

Java 26 is boring, and that’s a good thing

TL;DR: Java 26 is usefully boring

Java is Boring by Design

Even old Java looks Modern

How to upgrade

What Java 26 changes

G1 GC: Improve Throughput by Reducing Synchronization (JEP 522)

Ahead-of-Time Object Caching with Any GC (JEP 516)

Prepare to Make Final Mean Final (JEP 500)

HTTP/3 in the standard HTTP client (JEP 517)

Why HTTP/3 matters today?

What does it mean that HTTP/3 lives in JDK 26?

Do you need HTTP/3?

The end of Java Applets (JEP 504)

Preview and incubating features

The short version

Who these features are really for

You don’t need these yet

When boring is a liability

Conclusion

Writing a tiny JSON Parser

Usage

JSON Grammar

Transforming the Grammar

Implementing the Rules

Conclusion

Redacting Data from Heap Dumps via hprof-redact

Heap Dumps

Why do we need to redact?

Using hprof-redact

Implementing your own redaction

Conclusion

Femtocli: A small but mighty CLI library for small CLI tools in < 45KB

Should you use it?

Agent Mode

Femtocli’s Features

Usage

Subcommands as Methods

Positional Parameters

Mixins

Spec Injection

Custom Type Converters

Enum Support

Custom Header, Footer, and Synopsis

Global Configuration

Conclusion

Redacting Sensitive Data from Java Flight Recorder Files

Foundations

jfr scrub subcommand

Using Basic-JFR-Processor to Build a Simple Redactor

JFR-Redact

Text Redaction

Words Mode

Concat Mode

Usage as a Library

Conclusion

Implement a new JStall Feature with Me

Or: How I use GitHub Copilot to go from feature to idea

The Java Version Quiz

Reproducing a Tricky Bug in Minutes With a Custom Linux Scheduler Written in Java

Running the Test Case Normally

Running the Test Case with the Chaotic Scheduler

Running the Fixed Test Case

The Bug

Why does it not always fail?

Conclusion

Reading and Writing JFR Files Programmatically

Reading JFR Files using Java’s API

`jfr scrub` subcommand