Top Software Metrics for Dev Leaders

Any time something has to be assessed or compared, metrics are needed. They are quantifiable measures that are used to judge progress in every industry, including software development, where dev leaders rely on software metrics to track performance and production.

In our blog post on How to Measure and Improve Developer Productivity, we discussed the role of code metrics in the context of measuring and improving developer productivity. One of the challenges is selecting the right metrics for the right job at the right time, and this article is here to help you learn what it is that you need to know.

What Are Software Metrics and Why Are They Important?

The job of any manager is to create value for their company, and in software, dev leaders are in charge of building and improving products, managing time, and meeting their budget. These responsibilities all rely on the ability to assess the current state of a team or project, measure progress, and estimate the time and resources required to reach the next milestone.

Dev leaders rely on software metrics for quality assurance and to help strengthen collaboration between team members. Effective software metrics go hand-in-hand with proactive management, helping dev leaders to meet release dates and keep expenses under control. Software metrics assist in the identification of problems in current and pre-release builds, and they can be used to track and prioritize issues accordingly. Importantly, the right software metrics pave the way for more accurate forecasting and help to consider the impact of decisions made during the development lifecycle.

How Are Software Metrics Lacking?

Software metrics are tricky because measuring software quality is not only multifaceted and subjective but also, what’s important can change from project to project. The priorities and what matters most in one team or organization will be different for another, which implies that there is no one-size-fits-all software metric.

How To Choose the Right Metrics at the Right Time

Myriad software metrics exist and choosing the right ones depends on exactly what it is that you need to track. A common problem is that metrics are not always linked to the goals of the project. If memory optimization is important, for example, then reducing the number of lines of code (LOC) might be pertinent. However, if the goal is to minimize errors then a heavier weight should be applied to testing metrics such as the number of bugs reported.

The stage of a project will also dictate which software metrics should be given more consideration. Early in a project, the number of Git commits will yield valuable information about how quickly modules are being added. Later, dev leaders will be more interested in the mean time between failures (MTBF), or the application crash rate (ACR).

Evaluating and Tracking Metrics

One should remember that a valuable metric is more than just a number. A single-point indication at any given time is of little use without showing the trend, and they need to present the bigger picture. This is because from day to day, there may be little variation for a specific metric. Yet, over time, the trend will highlight gradual success or failure. If your metrics aren’t highlighting trends then they need to be rethought.

Another important point is that metrics have to suggest and promote changes. Having metrics that do not lead to significant changes is a waste of time because without this, developers will continue to make the same mistakes. Again, metrics need to be more than just a number, they need to further your goals. If there is a metric that has not led to changes in the codebase or the process, then it is best replaced with something more meaningful.

Types of Software Metrics

Software metrics fall into several different categories, each of which may be more or less useful to a project depending on the team, environment, and development phase. Different metrics are applicable with varying levels of granularity, where one is relevant to individual developers but others consider the performance of the team. Naturally, some apply to both, although individuals and teams should be rated separately.

Code metrics

Code metrics are measures that indicate specifics about a codebase. These might be size-oriented metrics such as the total number of lines of code or the ratio of logic to comments, or content-related, like an indicator of code complexity. In general, code metrics are only somewhat helpful, especially when used in isolation, as they do not fully consider the context and thus cannot tell the full story.

Size-oriented metrics

A size-oriented metric is a type of code metric, generally used to compare same-language projects in terms of thousands of lines of code (KLOC). KLOC metrics are not intended to measure the size of a project. Rather, they are statistical measures that indicate the relative number of errors, the relative number of defects, and the relative cost of each 1,000 lines of code. It uses this common baseline, regardless of project size. Due to the inherent differences between programming languages, size-oriented metrics are not useful for comparing projects that are developed using different languages.

Methodology-specific metrics

Coding methodologies vary between companies and projects. Although there are common goals, the approaches, and more specifically, the journey for each is different. When it comes to KPIs, different methodologies have different priorities. Two popular methodologies are Agile and Waterfall.

Agile process metrics

These metrics are specific to a development team following the Agile process methodology. They measure how effective teams are at releasing shippable software. For example, Sprint Burndown is used to track the completion of work during a sprint and shows how much work remains. Velocity is another Agile metric and it describes how much work a team can complete during a sprint, measured in either hours or story points.

Waterfall methodology metrics

The Waterfall methodology, which is more fixed than Agile, is often used when a project needs a high degree of reliability or one with requirements that are non-ambiguous. With a fixed timeline that does not include frequent feedback, the metrics are somewhat different. For example, the number of bugs discovered in the implementation phase is important because it sometimes necessitates returning to the design phase. Having this happen too often might be a sign that the pre-implementation review is being done too hastily.

Productivity metrics

Measures of productivity can be used to determine how much work has been done on a project, or by a team. This correlates highly with efficiency and the speed at which work can be completed, essentially highlighting where a team excels and where they need to improve. When selecting productivity measures, ensure to choose ones that are not only relevant but represent the actual goals of the project.

Productivity is indeed something that should be maximized in any project. As we discuss in our blog post on Productivity Tools for C++ Developers, there are a variety of tools to assist with boosting productivity for both individuals and teams.

Security metrics

A security metric is a measure that shows how susceptible or resistant software is to security incidents. Having software vulnerabilities can be very costly and lead to problems with governmental compliance, so this type of metric is indispensable for certain products. Examples of security metrics are the average time required to resolve a vulnerability and the number of vulnerabilities identified by automatic static code scanning. It is important to remember that certain security metrics, such as those that are compliance-related, will be more relevant to executive management than the number of vulnerabilities detected by a code analyzer. It is important to cater to all of the relevant stakeholders.

Operational

Operational metrics help to assess how well the software is running in a production environment, including a product’s annual uptime or ratio of uptime versus downtime. These are important because it speaks to quality assurance, product reliability, and whether enough resources are dedicated to maintenance and support. These metrics are not as useful for development teams working on new products.

Product metrics

A product metric indicates how well a product is doing in the market. These measures are not solely in the domain of software but still apply. They allow you to track things such as how well your product meets the company’s objectives. Examples of these are user adoption and customer retention. These metrics are aimed to answer questions for management and planners, as opposed to developers.

QA metrics

Quality assurance is a catch-all term that includes failure metrics, as well as other maintenance-related measures. This includes details like the average time between failures and how long it normally takes to fix them. This can provide insight into the amount of uptime versus downtime, and how much of the downtime can be attributed to maintenance.

Test metrics

There are a variety of test metrics that developers and product testers use before any release moves to a production environment. These measures help to provide information about how well tested a system is, which is related to QA. These metrics, however, are not intended for management, which is how they differ from more general QA metrics. A QA metric is used by management to judge the quality of a release, and a test metric is intended solely to assist developers at the pre-release stage.

Specific Metrics – A Closer Look

Now that you have an overview, let’s have a look at some specific software metrics that you might choose, depending on your project and processes.

Lead time

This reflects the length of time it takes to develop a new feature or module, from definition to delivery. It tends to show how responsive the development group is to requests from stakeholders. Even if a team is unwilling to provide an approximate time to deliver, one can be estimated by looking at previous products with a similar feature set.

Cycle time

This metric refers to the time between a change request and its shippable, or production release. This includes the time to open an issue, time to find and review the problem, time to approve the work, the time required to complete the changes, and finally the time to deploy. This is an important metric because it indicates the time to value versus efficiency ratio.

Deployment Frequency

The deployment frequency refers to the number of releases per day. This metric indicates the level of value that is being delivered to the customers. This is important to consider because a development pipeline can be efficient with a low cycle time, but at the same time, having a low deployment frequency could mean that not enough value is being delivered.

Team velocity

A team’s velocity provides insight into how much work a team completes during an Agile sprint or a release iteration. While it is useful for gauging progress within a team over time, it should not be used to compare teams because the nature and complexity of each team’s deliverables may not be fairly comparable.

Open/Close rates

An open/close is a production issue that is identified and recognized within a set period. As this rate increases over time, it shows that the team is becoming more efficient at fixing problems.

Efficiency / Productivity

Efficiency generally refers to how much of a developer’s code is in production, measured in terms of percentage rather than lines of code. A high efficiency correlates with providing value for a longer time, whereas low efficiency might indicate many false starts on an innovative feature that is difficult to implement. The opposite of efficiency is code churn, which indicates the level of non-productive coding.

Active days

The number of active days is related to a programmer’s productivity. An active day represents a coding day worked by a single developer on a single project. This tracks only programming and does not include administrative work. In fact, administrative tasks such as meetings take away from coding time, which is what this metric actually measures. Essentially, tracking the number of active days puts a spotlight on the cost of interruptions.

Impact

This metric is a subjective measure that indicates the degree of change to a project after code has been added, deleted, or modified. The idea is that changes with a heftier impact are more difficult to implement, suggesting a larger undertaking or perhaps a greater cognitive effort. For example, the addition of a novel and complex feature will have a greater impact than changing the text in a set of output statements, even if there are many more lines of modified code.

Code churn

Code Churn is a Git-based metric that provides insights into individuals and teams alike. It represents how much of a developer’s work is modified or deleted over a short period and is normally presented as the number of lines of code that have changed over the specified time. Having a high code churn can mean that a developer is unsure of what to do, has trouble with the implementation, or even they do not have anything else to work on. From a management or team perspective, it could indicate that the module or feature in question was not properly defined or was prematurely added.

Mean time between failures (MTBF)

This QA metric represents the average time between failures, defining the reliability of the system. Failures are bound to occur but it is best if they are few and far between. Ideally, when a failure does occur, the time it takes to recover is relatively short, but regardless, this metric can assist when it comes to scheduling preventative maintenance.

Mean time to recover/repair (MTTR)

Even highly reliable systems fail and when they do, customers want to minimize any downtime that occurs as a result. For this, the average time to recover from a failure must be kept as low as possible. Of course, the severity of failures will differ, as will the individuals making the necessary changes, all adding noise to the metric. However, over time, the MTTR will act as a reliable estimate when predicting how long the client will have to wait before operations return to normal.

Application crash rate (ACR)

An application’s crash rate is similar to the MTBF but refers to the ratio of how often it is used versus how often it fails. MTBF is different because it is a measure of time.

Endpoint incidents

An endpoint incident is a security-related issue, indicating how many devices have been affected by malware for a specified period. This could be the result of a vulnerability in the software.

Errors per KLOC / Defects per KLOC

Cost per KLOC

This measure describes the average cost for one thousand lines of code. It can be used to describe different phases of the project. For example, the cost per KLOC during development will be different from the cost per KLOC during post-release maintenance.

Effort per FP / Defects per FP / Cost per FP

These are function-oriented metrics that depend on one first calculating the Function Point (FP). An FP is a measure the represents business functionality available to the user in a software application and is defined according to the requirements.

As a basic unit of measure, FPs have correlative measures that include the effort per function point (EFP), the number of defects per function point (DFP), and the cost per function point (CFP). A lower EFP translates to better productivity, whereas a lower DFP is representative of a higher-quality product. CFP indicates cost efficiency, and a decreasing CFP means development and maintenance are becoming more cost-effective.

Defect removal efficiency

The defect removal efficiency (DRE) is used to express how many defects were found by end-users, as compared to how many were found during pre-release development and testing. It is calculated by dividing the number of errors found pre-delivery by the total number of errors found both before and after the software goes into production. The more errors that are found by end-users, the lower the number. A perfect score is 1.0, which would indicate that no problems were identified by end-users in production.

Bad Implications When Selecting an Improper Measure

Choosing a poor metric can have consequences that go beyond simply wasting time, especially when considered in isolation. It is important to understand what you are measuring, and the following examples illustrate some potential problems that can occur.

Lines of Code (LOC)

Consider a situation where LOC is the only or primary metric. On the face of it, one might assume that writing more lines of code is better. However, as an experienced manager, you know that this is not always the case. In reality, when LOC is a driving factor then developers tend to write long code that is less elegant, more cumbersome, and perhaps less efficient. Unless it is combined with an efficiency-rewarding metric, LOC is more of a burden than a bonus.

Code Coverage

Another problematic metric used in isolation is code coverage in testing. Code coverage alone, without utilizing other quality metrics such as the number of defects found per test, can produce a misleading result. This will happen if the tests are naïve and do not reliably identify bugs. The code coverage may be very high, yielding an impressive metric, but without knowing the results of the testing, evaluation of the tests is more difficult to do.

Are There Measures that are More Relevant to a Specific Project than others?

It is without a doubt that different metrics are more relevant to specific projects. Measuring impact, for example, is not relevant for a project that is in its initial stages. On the other hand, code metrics such as LOC are more valuable during early development, as compared to handling feature requests in a more mature product. Selecting the right measures for your project means matching metrics against the right goals at the right time, and ensuring that metrics complement each other to add validity to the statistics that ultimately lead to positive changes to the codebase, team, or processes.

Final Thoughts

Software metrics are the quantitative measures applied to the software development lifecycle that can be used to assess the current state of a project. Over time, trends will appear and dev leaders can show progress, calculate the impact of project decisions, and make reliable estimates about timelines. Metrics are invaluable and indeed critical for advancing any project.

At the same time, having too many metrics, or ones that do not contribute to the goals of the project is counterproductive. No metric is an exact science. Choose wisely! Then, track and evaluate them to ensure that each one still serves a purpose. If a metric doesn’t lead to changes then it must be revised or replaced.

Dori Exterman

An expert software developer and product strategist, Dori Exterman has 20 years of experience in the software development industry. As CTO of Incredibuild, he directs the company's product strategy and is responsible for product vision, implementation, and technical partnerships. Before joining Incredibuild, Dori held a variety of technical and product development roles at software companies, with a focus on architecture, performance, advanced technologies, DevOps, release management and C++. He is an expert and frequent speaker on technological advancement in development tools.

Cookie	Duration	Description
ARRAffinity	session	ARRAffinity cookie is set by Azure app service, and allows the service to choose the right instance established by a user to deliver subsequent requests made by that user.
ARRAffinitySameSite	session	This cookie is set by Windows Azure cloud, and is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_gat	1 minute	This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_UA-8508435-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
MR	7 days	This cookie, set by Bing, is used to collect user information for analytics purposes.
utm_campaign	2 months	Google Ad Services sets this cookie to store session campaign value if present.
utm_content	2 months	This cookie is used for storing the session content value if present.
utm_source	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
utm_term	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_mkto_trk	2 years	This cookie, provided by Marketo, has information (such as a unique user ID) that is used to track the user's site usage. The cookies set by Marketo are readable only by Marketo.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
utm_medium	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_hjSession_2537450	30 minutes	No description
_hjSessionUser_2537450	1 year	No description
AnalyticsSyncHistory	1 month	No description
BIGipServersn-mch-v2-80	session	No description
BIGipServersn02web-nginx-app_https	session	No description
ib_last_referrer	2 months	No description
incap_ses_1319_2167377	session	No description
li_gc	2 years	No description
muc_ads	2 years	No description
nlbi_2167377	session	No description
original_req_url	past	No description
referrer66_00f	1 month	No description
visid_incap_2167377	1 year	No description
visitorId	1 year	No description