Playbook

This document aims to help you get started with profiling test suites and answers the following questions: which profiles to run first? How do we interpret the results to choose the next steps? Etc.

NOTE: This document assumes you're working with a Ruby on Rails application and RSpec testing framework. The ideas can easily be translated into other frameworks.

📼 Check out also the "From slow to go" RailsConf 2024 workshop recording to see this playbook in action.

Step 0. Configuration basics

Low-hanging configuration fruits:

Disable logging* in tests—it's useless. If you really need it, use our logging utils.

ruby

config.logger = ActiveSupport::TaggedLogging.new(Logger.new(nil))
config.log_level = :fatal

Disable coverage and built-in profiling by default. Use env var to enable it (e.g., COVERAGE=true)

* Modern SSD hard drives make the overhead of file-based logging almost negligible. Still, we recommend disabling logging to make sure tests are not affected in any environment (e.g., Docker on MacOS).

Step 1. General profiling

It helps to identify not-so-low hanging fruits. We recommend using StackProf or Vernier, so you must install them first (if not yet):

bundle add stackprof
# or
bundle add vernier

Configure TestProf to generate JSON profiles by default:

ruby

TestProf::StackProf.configure do |config|
  config.format = "json"
end

We recommend using speedscope to analyze these profiles.

Step 1.1. Application boot profiling

TEST_STACK_PROF=boot rspec ./spec/some_spec.rb

NOTE: running a single spec/test is enough for this profiling.

What to look for? Some examples:

No Bootsnap used or not configured to cache everything (e.g., YAML files)
Slow Rails initializers that are not needed in tests. Vernier's Rails hooks feature is especially useful in analyzing Rails initializers.

Step 1.2. Sampling tests profiling

The idea is to run a random subset of tests multiple times to reveal some application-wide problems. You must enable the sampling feature first:

# For RSpec in your spec_helper.rb
require "test_prof/recipes/rspec/sample"

# For Minitest in your test_helper.rb
require "test_prof/recipes/minitest/sample"

Then run multiple times and analyze the obtained flamegraphs:

SAMPLE=100 bin/rails test
# or
SAMPLE=100 bin/rspec

Common findings:

Encryption calls (*crypt*-whatever): relax the settings in the test env
Log calls: are you sure you disabled logs?
Databases: maybe there are some low-hanging fruits (like using DatabaseCleaner truncation for every test instead of transactions)
Network: should not be there for unit tests, inevitable for browser tests; use Webmock to disable HTTP calls completely.

Step 2. Narrow down the scope

This is an important step for large codebases. We must prioritize quick fixes that bring the most value (time reduction) over dealing with complex, slow tests individually (even if they're the slowest ones). For that, we first identify the types of tests contributing the most to the overall run time.

We use TagProf for that:

TAG_PROF=type TAG_PROF_FORMAT=html TAG_PROF_EVENT=sql.active_record,factory.create bin/rspec

Looking at the generated diagram, you can identify the two most time-consuming test types (usually models and/or controllers among them).

We assume that it's easier to find a common slowness cause for the whole group and fix it than dealing with individual tests. Given that assumption, we continue the process only within the selected group (let's say, models).

Step 3. Specialized profiling

Within the selected group, we can first perform quick event-based profiling via EventProf. (Maybe, with sampling enabled as well).

Step 3.1. Dependencies configuration

At this point, we may identify some misconfigured or misused dependencies/gems. Common examples:

Inlined Sidekiq jobs:

EVENT_PROF=sidekiq.inline bin/rspec spec/models

Wisper broadcasts (patch required):

EVENT_PROF=wisper.publisher.broadcast bin/rspec spec/models

PaperTrail logs creation:

Enable custom profiling:

TestProf::EventProf.monitor(PaperTrail::RecordTrail, "paper_trail.record", :record_create)
TestProf::EventProf.monitor(PaperTrail::RecordTrail, "paper_trail.record", :record_destroy)
TestProf::EventProf.monitor(PaperTrail::RecordTrail, "paper_trail.record", :record_update)

Run tests:

EVENT_PROF=paper_trail.record bin/rspec spec/models

See the Sidekiq example on how to quickly fix such problems using RSpecStamp.

Step 3.2. Data generation

Identify the slowest tests based on the amount of time spent in the database or factories (if any):

# Database interactions
EVENT_PROF=sql.active_record bin/rspec spec/models

# Factories
EVENT_PROF=factory.create bin/rspec spec/models

Now, we can narrow our scope further to the top 10 files from the generated reports. If you use factories, use the factory.create report.

TIP: In RSpec, you can mark the slowest examples with a custom tag automatically using the following command:

EVENT_PROF=factory.create EVENT_PROF_STAMP=slow:factory bin/rspec spec/models

Step 4. Factories usage

Identify the most used factories among the slow:factory tests:

FPROF=1 bin/rspec --tag slow:factory

If you see some factories used much more times than the total number of examples, you deal with factory cascades.

Visualize the cascades:

FPROF=flamegraph bin/rspec --tag slow:factory

The visualization should help to identify the factories to be fixed. You find possible solutions in this post.

Step 4.1. Factory defaults

One option to fix cascades produced by model associations is to use factory defaults. To estimate the potential impact and identify factories to apply this pattern to, run the following profiler:

FACTORY_DEFAULT_PROF=1 bin/rspec --tag slow:factory

Try adding create_default and measure the impact:

FACTORY_DEFAULT_SUMMARY=1 bin/rspec --tag slow:factory

# More hits — better
FactoryDefault summary: hit=11 miss=3

Step 4.2. Factory fixtures

Back to the FPROF=1 report, see if you have some records created for every example (typically, user, account, team). Consider replacing them with fixtures using AnyFixture.

Step 5. Reusable setup

It's common to have the same setup shared across multiple examples. You can measure the time spent in let / before compared to the actual example time using RSpecDissect:

RD_PROF=1 bin/rspec

Take a look at the slowest groups and try to replace let/let! with let_it_be and before with before_all.

IMPORTANT: Knapsack Pro users must be aware that per-example balancing eliminates the positive effect of using let_it_be / before_all. You must switch to per-file balancing while at the same time keeping your files small—that's how you can maximize the effect of using Test Prof optimizations.

Conclusion

After applying the steps above to a given group of tests, you should develop the patterns and techniques optimized for your codebase. Then, all you need is to extrapolate them to other groups. Good luck!

Playbook ​

Step 0. Configuration basics ​

Step 1. General profiling ​

Step 1.1. Application boot profiling ​

Step 1.2. Sampling tests profiling ​

Step 2. Narrow down the scope ​

Step 3. Specialized profiling ​

Step 3.1. Dependencies configuration ​

Step 3.2. Data generation ​

Step 4. Factories usage ​

Step 4.1. Factory defaults ​

Step 4.2. Factory fixtures ​

Step 5. Reusable setup ​

Conclusion ​