Loading...
Loading...
Borys Ulanenko
CEO of ArmsLength AI

Get the latest transfer pricing insights, AI benchmarking tips, and industry updates delivered straight to your inbox.
Multi-year data can improve comparability analysis by providing context on cycles, trends, and anomalies. When averaging is appropriate, the three main approaches are:
Key caution: Using multiple-year data does not automatically mean using multi-year averages. The OECD and several jurisdictions emphasize that each year should be arm's length, and multi-year data serves primarily for comparability context.
Common practice (especially in TNMM benchmarking): Many practitioners compute 3-year weighted averages for comparable companies (weighted by the PLI base) to smooth volatility, while still evaluating the tested party on a single year (the tested year). If you do this, document why it improves reliability for comparables and why a year-by-year tested party result remains the relevant compliance test in your facts and jurisdiction.
Decision rule (practical):
Single-year data can mislead. A company might have an unusually good or bad year due to factors unrelated to transfer pricing—product launches, one-time expenses, economic shocks, or simple volatility. Multi-year analysis addresses these issues by:
OECD Position (): Examining multiple-year data is often useful for understanding comparability (e.g., business cycles, trends), but is not systematically required. Importantly, OECD notes that using multiple-year data does not necessarily mean using multi-year averages—context and comparability analysis are the primary goals.
The US Treasury Regulations §1.482-1(f)(2)(iii) contemplate—and in some applications ordinarily require—consideration of multiple-year data. Circumstances where multi-year analysis may be appropriate include:
The regulations also indicate that if multiple-year data is used for comparables, data for the tested party for the same years "ordinarily must be considered" (subject to data availability).
A simple average treats all years equally—just add the values and divide by the number of years.
Formula: Simple Average = (Year1 PLI + Year2 PLI + Year3 PLI) ÷ 3
Example: Comparable's operating margins for 2022-2024 are 4%, 6%, 5%
Simple Average = (4% + 6% + 5%) / 3 = 5.0%
When to use:
Limitations:
A weighted average gives more influence to larger years by weighting each year's result by the relevant PLI base—revenue for operating margin, costs for cost-plus, assets for ROA, etc. This is a commonly used technique in transfer pricing practice.
Formula: Weighted Average Margin = Sum of Operating Profits (all years) ÷ Sum of Revenues (all years)
Equivalently: Weighted Avg = Σ(Margin × Revenue) ÷ Σ(Revenue)
Example: Same comparable with operating profits and revenues:
| Year | Revenue | OP | Margin |
|---|---|---|---|
| 2022 | €10M | €0.4M | 4.0% |
| 2023 | €15M | €0.9M | 6.0% |
| 2024 | €12M | €0.6M | 5.0% |
Weighted Average = (0.4 + 0.9 + 0.6) / (10 + 15 + 12) = €1.9M / €37M = 5.14%
The 2023 result (highest revenue year) has more influence than the simple average would give it.
India's Approach (Rule 10CA): Under the range concept, comparables may be reflected on a weighted multi-year basis where comparable transactions exist across years. The weights depend on the method/PLI base used—sales for resale-price-type PLIs, costs for cost-plus, etc. This is effectively a weighted average computation using the relevant base.
When to use:
A period-weighted average assigns explicit weights to emphasize certain years—typically giving more weight to recent years when conditions are changing.
Formula: Period-Weighted Average = w₁ × Year1 + w₂ × Year2 + w₃ × Year3
Where weights sum to 100% (e.g., 20%-30%-50% for oldest to newest).
Example: Margins trending upward: 3%, 5%, 7%
With weights 30%-30%-40% (emphasizing 2024): Period-Weighted = (0.30 × 3%) + (0.30 × 5%) + (0.40 × 7%) = 5.2%
Compare to simple average: 5.0%
When to use:
Document carefully. Period-weighting is more subjective than other methods. Tax authorities may challenge arbitrary weights. Common defensible schemes: 30%-30%-40% or 20%-30%-50% for 3-year periods.
| Factor | 3-Year Analysis | 5-Year Analysis |
|---|---|---|
| Recency | More current; reflects recent market conditions | Includes older, potentially outdated data |
| Cycle coverage | May miss full business cycle | Better captures peaks and troughs |
| Data availability | Usually complete for most comparables | Some companies may lack 5-year history |
| Regulatory acceptance | Standard in most jurisdictions | Used for cyclical industries |
| Relevance | Better if business recently changed | Better for stable, mature operations |
Use 3 years when:
Use 5 years when:
OECD Guidance (, ): "It would not be appropriate to set prescriptive guidance as to the number of years to be covered by multiple year analyses." Importantly, OECD also notes that "The use of multiple year data does not necessarily imply the use of multiple year averages"—the data serves comparability context, not automatic averaging.
| Jurisdiction | Multi-Year Approach | Key Rules |
|---|---|---|
| OECD | No fixed requirement; multi-year data ≠ multi-year averaging | Use multiple years where helpful for comparability; no prescriptive number of years () |
| United States | May be appropriate; 3 years common in CPM practice | Same years for tested party and comparables "ordinarily must be considered" (subject to availability) |
| India | Rule 10CA: weighted multi-year under specific conditions | Up to current + prior 2 years; weights depend on method/base; 35th-65th percentile range when ≥6 entries |
| Germany | 3 years common in practice | Follows OECD principles; multi-year data may improve comparability (not a firm rule) |
| United Kingdom | 3 years common in practice | HMRC focuses on year-by-year arm's length results; multi-year data for context |
| Australia | 3-5 years common in practice | Longer periods may be used for cyclical industries (not a firm rule) |
| Canada | Year-by-year in audits; no averaging to substantiate | CRA: multi-year averaging may appear in APA analysis, but still verified annually |
| Switzerland | Year-by-year principle emphasized | Court trend (periodicity): each year must be arm's length; multi-year averaging doesn't excuse individual years |
Canada Alert: CRA's TPM-16 explicitly states that taxpayers "should not average results over multiple years for the purpose of substantiating their transfer prices." In audits, prices/margins must be determined year-by-year. Multi-year averaging may appear in APA analysis, but results are still verified annually. Use multi-year data to select comparables and understand trends—not to justify your pricing.
Short answer: In most audit contexts, you should assume the tested party’s result must be supportable year by year. Multi-year data is typically used to understand comparability (cycles, anomalies, and trends), not to “average away” a non-arm’s-length year.
Practical approach:
Be explicit about the “mix.” If you average comparables over 3 years but test the tested party on a single year, explain why that is still coherent in your facts and jurisdiction (e.g., you’re using multi-year averaging to reduce comparable volatility, while the tested party must still be arm’s length in the tested year). If you cannot articulate that logic, either avoid averaging or apply a consistent treatment.
| Column | Description |
|---|---|
| A | Comparable Name |
| B | Year |
| C | Revenue |
| D | Operating Profit |
| E | Operating Margin (D/C) |
For margins in cells E2 through E4.
Sum of operating profits divided by sum of revenues.
Alternatively, using SUMPRODUCT:
Where E2:E4 contains margins and C2:C4 contains revenues.
For weights in cells G2:G4 (e.g., 0.30, 0.30, 0.40):
Or directly:
To average only non-blank cells:
Excel's AVERAGE function automatically ignores blank cells.
To exclude zeros (if zero isn't a valid result):
Options when a comparable lacks data for one year:
| Approach | When to Use | Risk |
|---|---|---|
| Exclude comparable | Plenty of alternatives; consistency is critical | Reduces sample size |
| Use available years | Comparable is highly relevant; 2 of 3 years available | May distort average |
| Interpolate | Rarely appropriate | Speculation; hard to defend |
Note on India: The "6 data points" threshold in Rule 10CA determines whether the 35th-65th percentile arm's length range applies—it's not a permission rule for using partial data. Whether a comparable can be included with fewer years depends on whether it has comparable transactions across those years.
Best practice: Document your approach. If you include a comparable with partial data, note: "CompX lacked 2022 data; 2-year average used (2021, 2023)." Consider whether excluding the comparable entirely would be more defensible.
Don't automatically exclude—analyze the cause and decide how to use the data.
Multi-year context helps understand whether a loss reflects market conditions or indicates non-comparability. A company with margins of [-2%, 5%, 6%] averages to +3%—but remember that using multi-year data doesn't always require multi-year averaging.
When to investigate:
When including loss years may be appropriate:
OECD (): Examining prior years—including loss years—helps determine whether a tested party's loss "was part of a history of losses" or due to prior economic conditions. The goal is understanding comparability, not necessarily averaging the numbers together.
If a comparable underwent a major change (merger, divestiture, restructuring):
Scenario: European distributor benchmarked against 4 comparables
| Comparable | 2022 OP | 2022 Rev | 2023 OP | 2023 Rev | 2024 OP | 2024 Rev | 3-Yr Weighted Avg |
|---|---|---|---|---|---|---|---|
| CompA | €2.5M | €50M | €3.6M | €60M | €4.3M | €65M | 5.94% |
| CompB | €3.2M | €80M | €3.8M | €85M | €4.1M | €90M | 4.35% |
| CompC | €1.6M | €40M | €2.0M | €45M | €2.5M | €50M | 4.52% |
| CompD | €3.5M | €70M | €1.8M | €72M | €4.2M | €75M | 4.38% |
Calculation (CompA):
IQR (sorted): 4.35%, 4.38%, 4.52%, 5.94%
Scenario: Semiconductor manufacturer with volatile margins
| Year | Margin |
|---|---|
| 2020 | -2% (pandemic slump) |
| 2021 | 5% |
| 2022 | 10% (chip shortage boom) |
| 2023 | 8% |
| 2024 | 6% |
3-Year Simple Average (2022-2024): (10 + 8 + 6) / 3 = 8.0%
5-Year Simple Average (2020-2024): (-2 + 5 + 10 + 8 + 6) / 5 = 5.4%
Analysis: The 3-year average captures the recent "boom" period but misses the cycle trough. The 5-year average includes the full cycle, providing a more realistic long-term expectation.
Recommendation: For cyclical industries, the 5-year average better represents sustainable profitability—unless you can document why current conditions warrant the shorter view.
Scenario: Company restructured in early 2022; margins are trending upward
| Year | Margin | Weight | Contribution |
|---|---|---|---|
| 2022 | 2% | 20% | 0.4% |
| 2023 | 4% | 30% | 1.2% |
| 2024 | 6% | 50% | 3.0% |
Period-Weighted Average: 0.4 + 1.2 + 3.0 = 4.6%
Simple Average: (2 + 4 + 6) / 3 = 4.0%
Justification: "Given the 2022 restructuring, earlier results are less representative of current operations. We applied period-weighting (20%-30%-50%) to reflect improved post-restructuring performance while maintaining multi-year perspective."
Guides:
Glossary:
Start with 3 years as your default—it's the most common period and usually sufficient. Extend to 5 years if the industry is highly cyclical, you need to capture a full economic cycle, or you're smoothing a significant anomaly (like COVID-19 impacts). The OECD explicitly avoids prescribing a number—use whatever period genuinely improves your analysis and document why.
A simple average treats all years equally (just average the percentages). A weighted average gives more influence to larger years by weighting each year's result by the relevant PLI base (revenue for margin, costs for cost-plus, etc.). For a comparable that grew significantly, weighted averaging better reflects its overall profitability. The weights should match your PLI denominator to maintain consistency.
No. Consistency is critical. Use the same analysis period for all comparables and the tested party. Mixing periods (e.g., 3-year for some comparables, 5-year for others) creates an incoherent analysis and will draw audit scrutiny. The IRS regulations explicitly require using the same years for the tested party and comparables.
Options: (1) Exclude the comparable if you have sufficient alternatives—this is often the cleanest approach; (2) Use available years if the comparable is otherwise highly relevant—calculate its average from available data and document carefully; (3) Never interpolate or estimate the missing year. Note that India's "6 data points" threshold relates to which range calculation method applies (35th-65th percentile), not a blanket permission for partial data.
Don't automatically exclude—analyze the cause. If multiple comparables had losses in the same year, that likely reflects market conditions and is relevant comparability data. However, the OECD emphasizes that using multiple-year data doesn't necessarily mean averaging the numbers together—the goal is understanding comparability. Exclude only if the loss indicates deeper non-comparability (e.g., company going bankrupt, unique one-time event). Some jurisdictions (like CRA) explicitly note that OECD doesn't promote averaging multiple years of numerical data to establish comparability.
In audits, no. CRA's TPM-16 explicitly states that taxpayers "should not average results over multiple years" for substantiating transfer prices—prices/margins must be determined year-by-year. However, CRA notes that in an APA context, averaging historical outcomes may form part of the analysis—though results are still verified annually. You can use multi-year data to select comparables and understand trends, but don't use averaging to justify your pricing in an audit context.
Include in your benchmarking study: (1) Period selected and rationale (e.g., "3 years to capture post-COVID recovery"); (2) Averaging method used (simple, weighted, or period-weighted) and why; (3) Edge cases handled (missing data, losses, discontinuities); (4) Jurisdictional compliance confirmation that your approach meets local requirements. Be prepared to explain why your method improves the analysis.
The OECD Transfer Pricing Guidelines provide guidance on multi-year data in comparability analysis: