WPS6679


Policy Research Working Paper                    6679




  Dynamic Climate Policy with Both Strategic
         and Non-Strategic Agents
                       Taxes Versus Quantities

                                Larry Karp
                              Sauleh Siddiqui
                                Jon Strand




The World Bank
Development Research Group
Environment and Energy Team
October 2013
Policy Research Working Paper 6679


  Abstract
  This paper studies a dynamic game where each of                                    examined under the four combinations of trade policies
  two large blocs, of fossil fuel importers and exporters                            and compared with the corresponding static games
  respectively, sets either taxes or quotas to exercise power                        where climate damages are given (not stock-related).
  in fossil-fuel markets. The main novel feature is the                              The main results are that taxes always dominate quota
  inclusion of a “fringe” of non- strategic (emerging and                            policies for both the strategic importer and exporter
  developing) countries which both consume and produce                               and that “fringe”countries benet from a tax policy as
  fossil fuels. Cumulated emissions over time from global                            compared with a quota policy for the strategic importer,
  fossil fuel consumption create climate damages which                               as the import fuel price then is lower, and the strategic
  are considered by both the strategic importer and the                              importer’s fuel consumption is also lower, thus causing
  non-strategic countries. Markov perfect equilibria are                             fewer climate damages.




  This paper is a product of the Environment and Energy Team, Development Research Group. It is part of a larger effort by
  the World Bank to provide open access to its research and make a contribution to development policy discussions around
  the world. Policy Research Working Papers are also posted on the Web at http://econ.worldbank.org. The authors may be
  contacted at jstrand1@worldbank.org.




          The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development
          issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the
          names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those
          of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and
          its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent.


                                                        Produced by the Research Support Team
      Dynamic climate policy with both strategic and
       non-strategic agents: Taxes versus quantities
                 Larry Karp              Sauleh Siddiqui               Jon Strand

                                        October 28, 2013


                                              Abstract
          This paper studies a dynamic game where each of two large blocs, of fossil fuel
      importers and exporters respectively, sets either taxes or quotas to exercise power
      in fossil-fuel markets. The main novel feature is the inclusion of a “fringe” of non-
      strategic (emerging and developing) countries which both consume and produce fossil
      fuels. Cumulated emissions over time from global fossil fuel consumption create climate
      damages which are considered by both the strategic importer and the non-strategic
      countries. Markov perfect equilibria are examined under the four combinations of trade
      policies and compared with the corresponding static games where climate damages
      are given (not stock-related). The main results are that taxes always dominate quota
      policies for both the strategic importer and exporter and that “fringe”countries bene…t
      from a tax policy as compared with a quota policy for the strategic importer, as the
                                                                 s fuel consumption is also
      import fuel price then is lower, and the strategic importer’
      lower, thus causing fewer climate damages.
          Keywords: bilateral market power, optimal tax and quota, carbon-based fuel trade,
      dynamic games, commodity markets
          JEL classi…cation numbers : C63; C73; Q41; Q54.
          Sector boards: Environment; Energy

    Karp: Department of Agricultural and Resource Economics, University of California, Berkeley, and the
Ragnar Frisch Center for Economic Research; email: karp@berkeley.edu. Siddiqui: Department of Civil
Engineering and the Johns Hopkins Systems Institute, Johns Hopkins University, Baltimore, MD 21218;
email: siddiqui@jhu.edu. Strand: Development Research Group, Environment and Energy Team, The
World Bank, Washington DC 20433; email:Jstrand1@worldbank.org. We bene…tted from comments by Mike
Toman, Franz Wirl, and seminar participants at the World Bank and the International Energy Workshop in
Paris, June 2013. This research was supported by a grant from the World Bank’s Research Support Budget.
The conclusions and viewpoints presented here are those of the authors and not necessarily of the World
Bank, its management, directors or sta¤.
1    Introduction
A small group of countries account for most exports of fossil fuels, in particular oil on which
this paper focuses. Another main group of countries, which includes most of the OECD, are
fuel importers with only limited own oil production. The latter group of countries has already
established, or may soon be expected to establish, policies for limiting the carbon emissions
resulting from their fossil fuel consumption. The global consumption of fossil fuels increases
stocks of greenhouse gasses (GHGs), likely altering the climate. Climate policy might serve
as a coordination device, enabling a stragetic bloc of importers to a¤ect the price of fossil
fuels, while also controlling carbon emissions. In a game involving strategic importers and
exporters, the equilibrium policy levels depend on the choice of instrument, e.g. a trade tax
or quota. The equilibrium may also be sensitive to the presence of nonstrategic (passive)
countries with …xed trade policies. These nonstrategic countries are “innocent bystanders”in
the game among the strategic countries, but their presence alters the equilibrium to the game.
The strategic exporter in our setting represents OPEC, and the strategic importer represents
a subset of developed countries that might at some point in the future agree on a uni…ed
climate policy. That policy would likely involve trade measures, and would therefore a¤ect
both terms of trade and climate-related outcomes. The nonstrategic “innocent bystander”
represents the poorer countries. We have two research questions: How does the presence of
these countries a¤ect the equilibrium outcomes in the games involving the strategic importer
and exporter, under various combinations of policy instruments? How do the equilibria in
the di¤erent games a¤ect the welfare of the poorer countries?
    In order to address these questions, we study a model in which a monopsonistic importer
and a monopolistic exporter exercise market power, using either a trade tax or quota. There
are four policy combinations, leading to four di¤erent games. We call the nonstrategic agent
“R”  , for Rest-of-World. R’  s trade policy is …xed and exogenous: free trade in our setting.
GHGs related to fossil fuel consumption accumulate in a stock variable that causes di¤ering
levels of damages to both the strategic importer and R; the strategic exporter incurs no
damages. This assumption captures the idea that climate-related damages di¤er across
countries, both in their actual e¤ects, and in the way in which countries take these e¤ects
into consideration when determining their climate-related policies. In particular, we assume
that, while OPEC countries may su¤er damages from adverse climate developments, such
damages are not re‡  ected in the fuel-related policies of these countries.
    In order to understand the forces at work, it helps to begin with a static framework in
which there are no climate damages. In general, a country’      s optimal trade tax equals the


                                              1
inverse of their trading partner’   s (tax or quota inclusive) elasticity of import demand or
export supply. If a large importer and exporter are the sole agents in this market, and both
use a trade tax, the Nash equilibrium taxes are positive. These taxes lower aggregate world
welfare, but do not eliminate trade (Johnson 1953). In contrast, if both of these agents use
quotas, there is zero trade in the Nash equilibrium (Tower 1975). For example, suppose that
the importer takes the exporter’   s quota as given, as in a Nash equilibrium. For any export
quota, the importer has an incentive to set a still lower import quota, in order to render
the export quota non-binding and thereby capture all of the quota rents from the exporter.
The exporter who takes the import quota as given has the same incentive. Because this
incentive holds for any positive quota, the only Nash equilibrium in the quota-setting game
involves zero quotas for both agents, and thus zero trade.
    In a two stage game where the countries choose their policy instrument (either a tax
or a quota) in the …rst stage and the level of that policy in the second stage, countries’
…rst-stage dominant strategy is a tax. A country does not want to select a quota in the
…rst stage, because it understands that if it does so, the second stage equilibrium involves
zero trade if the rival also chooses a quota; if the rival chooses a tax, it extracts all of the
quota rent by setting its tax at a level that makes the quota non-binding. This qualitative
comparison survives in a dynamic setting where the level of a policy instrument (tax or quota)
changes over time, as the stock of GHGs or some other state variable changes endogenously
(Wirl 2012), (Rubio 2005).
    The introduction of a nonstrategic third agent, R, into the static game qualitatively
alters incentives, and therefore potentially alters the equilibrium (Karp 1988). Suppose for
example that the exporter sets a quota and R has a downwardly sloping excess demand for
the commodity. The combination of the export quota and R’      s excess demand function causes
the strategic importer to face a kinked, but not perfectly inelastic excess supply function.
In this situation, the strategic importer does not, in general, want to set its trade policy to
render the exporter’  s quota non-binding. Such a policy still eliminates the exporter’  s quota
rents, but it (typically) is too costly for the importer, because most or all of the rents might
be transferred to R. The exporter faces an analogous situation. It does not matter whether,
in equilibrium, R is an importer or an exporter; its mere presence causes a country, whose
rival uses a trade quota, to face a downwardly sloping excess demand function for some
range of prices. Thus, the introduction of R eliminates both of the earlier results: the quota
equilibrium does not drive trade to zero, and it may not be the case that choosing a tax is
a dominant strategy in the …rst-stage game where countries choose their policy instrument.
Even with R, the use of a quota causes the partner to face a less elastic excess supply or

                                               2
demand curve (relative to the curves under a tax). Thus, using a quota encourages the
trading partner to use an aggressive trade policy. For that reason, the forces that promote
the adoption of taxes rather than quotas operate even with the presence of R, but they might
no longer be determinative.
    Our chief policy question concerns the equilibrium welfare e¤ect, of di¤erent policy
choices, on the poorer countries, R. In our setting, R is a net importer of fossil fuels.
The strategic importer has two targets, its terms of trade and climate-related damages, and
a single (state-dependent) instrument, the tax or quota. Both of its objectives encourage
the importer to restrict trade. Its trade restriction lowers the equilibrium world fossil fuel
price and slows the growth of the GHG stocks. R is a free rider; it bene…ts from both of
these changes, because it is an importer and it su¤ers climate-related damages. R therefore
prefers the strategic importer to use an aggressive trade restriction. The strategic exporter
has a single target, improving its terms of trade, and a single instrument. A more aggressive
export restriction raises the world price, harming R, and slows the accumulation of GHGs,
bene…ting R. Therefore, the welfare e¤ect, on R, of the export policy is ambiguous. The
presence of R causes carbon leakage. As the importer lowers the market price by restrict-
ing its own demand for fossil fuel imports, R increases its demand for those imports. As
the exporter increases the market price by restricting its exports, R shifts from imports to
domestic production.
    Our calibration assumes that climate-related damages are small to moderate relative
to the bene…t of consuming fossil fuel. As a consequence, terms of trade considerations
are more important than climate-related damages for both the strategic importer and R.
Higher pollution stocks cause the strategic importer to use more aggressive equilibrium
trade restrictions, in order to reduce future damages. As this importer’      s demand for fossil
fuels diminishes, with higher pollution stocks, the strategic exporter also lowers its export
quota. The higher stocks directly harm the strategic importer, but a¤ect the strategic
exporter only indirectly, via reduced importer demand. Consequently, the importer’        s policy
is much more sensitive to the pollution stock, compared to the exporter’         s policy: higher
stocks reduce both equilibrium (strategic) imports and exports, but the e¤ect on the former
is greater. Therefore, higher pollution stocks increase the supply of imports available for R.
At least for low stock levels (and in some policy scenarios for all stock levels), climate-related
damages actually increase R’    s welfare, simply because these damages cause I to reduce its
demand for imports.
    We also …nd that R’   s payo¤ is highest when the strategic importer uses a tax, and the
strategic exporter uses a quota. The exporter’      s use of a quota encourages the strategic

                                                3
importer to use a high tari¤, in order to capture quota rents. The high tari¤ reduces
the world fossil fuel price, bene…tting R. If the strategic countries can choose the policy
instrument (in addition to choosing the level of the policy), the unique Nash equilibrium
in the policy selection game is for each to use a tax, just as in the simple static model
without R. The tax is a dominant strategy for both players at every level of stock, so
this equilibrium is subgame perfect, as is the level of every policy, conditional on the stock.
Given our calibrations, the …rst-best stock trajectory under the social planner who uses a
Pigouvian tax lies below the equilibrium trajectories in the four games corresponding to the
four combinations of trade policy. The emissions reductions arising from strategic countries’
desire to improve their terms of trade, exceed the reductions due to the Pigouvian tax. Under
the Pigouvian tax, there are no terms of trade incentives.
    To check robustness, we consider other calibration assumptions. Plausibly, the climate
externality could dominate the consumption bene…ts for some countries, notably countries in
the fringe R; many of these could su¤er substantial climate damages. In such cases, welfare
to R would be higher in the long run, when bloc I uses quotas. I ’    s use of the quota lowers
accumulation of atmospheric carbon, bene…tting R in the long run.
    Recent papers compare taxes and quotas in static models of the fossil fuel markets, with
two strategic blocs and two fuels (Strand 2011), and with one fuel and a non-strategic bloc
as in our paper (Strand 2013). Both papers …nd (as do we) that the strategic fuel importer
prefers a tax policy over a quota policy. Earlier papers focus on using an import tax to
capture a seller’s resource rent, (Bergstrom 1982), (Brander and Djajic 1983), (Karp 1984),
(Karp and Newbery 1991). Climate policy may also be a means of capturing resource rents
(Wirl and Dockner 1995), (Wirl 1995), (Amundsen and Schöb 1999), (Liski and Tahvonen
2004), (Rubio 2005), (Kalkuhl and Edenhofer 2010), (Njopmouo 2010). Wirl (2012), the
paper closest to ours, studies a dynamic model with only two (strategic) blocs and no third
(passive) bloc. This simpler model can be solved analytically. In this setting also, tax policies
are dominant for both importer and exporter. Dong and Whalley (2009)’             s computable
general equilibrium model suggests that a 20% ad valorem carbon tax could increase real
income in the U.S., E.U. and China by 0.4 –0.8%, while reducing OPEC real income by 5%.
Jørgensen, Martín-Herrán, and Zaccour (2010) and Long (2010) survey applications of the
type of dynamic game that we, and many of the other cited papers, use.




                                               4
2     The Dynamic Game
There are three agents in the game, representing three regions: the strategic importer bloc
(I ), the strategic exporter bloc (E ), and the nonstrategic rest of the world (R). The importer
and exporter blocs, I and E , exercise market power, using either a trade tax or a quota.
We take as given the combination of policy instruments and calculate the equilibria under
the four policy combinations. By comparing payo¤s, we determine the equilibrium policy
choice. Region R is a price taker and can be either a net importer or exporter of fossil fuels,
depending on the world price.
     In period t, the strategic importer (I ) incurs damages resulting from the stock of GHGs,
xt . In order to emphasize the situation where agent I is more concerned than agent E
about GHG accumulations, we suppose that only I and R su¤er stock-related damages.
These stocks are the only source of dynamics. In particular, we assume that extraction costs
are independent of cumulative extraction, and we ignore the fact that resource stocks are
…nite. These assumptions produce a model with a single state variable, xt . In view of our
functional assumptions and reliance on numerical methods, we could extend the model to
include a second state variable, cumulative extraction, and thereby take into account the
non-renewable resource aspect of the problem. However, in the one-state variable model we
can present all important results graphically; those graphs would be less useful in a two-state
model, and the results would be harder to interpret. Given the complexity of results in even
the one state variable model, it is worth beginning there, despite the fact that such a model
does not capture the real-world property that fossil resources are exhaustible.
     The trajectory of the stock of GHGs is endogenous to the model. Our solution concept is
a Markov Perfect equilibrium. In any period t, the current stock is predetermined, a function
of past stocks and emissions. Both strategic players condition their period-t policy (level)
on the period t stock level, the only “directly payo¤ relevant” state variable in this model.
The equilibrium level of I ’ s policy in period t is a function of xt , which makes the importer’  s
problem dynamic. The GHG stock does not directly a¤ect the exporter’            s payo¤, because by
assumption E does not incur climate-related costs. However, I ’         s equilibrium policy (level)
is conditioned on the stock, and I ’  s policies directly enter E ’s payo¤. Therefore, E also has
a dynamic problem, and its equilibrium policy level also depends on the stock of GHGs.
Because E and I solve mutually related dynamic problems, they play a dynamic game.
     The rest of the world, R, responds passively, taking the world price as given. R’    s presence
in the model is essential for two reasons. First, we want to know how the strategic interaction
of large buyers and sellers a¤ects nonstrategic agents, in particular, developing countries.


                                                 5
Second, the presence of R’  s net demand means that when either I or E use a quota, the
other strategic agent does not face a perfectly inelastic demand or supply function. In the
absence of R, there is 0 trade in the equilibrium when both strategic agents use a quota; if
only one strategic agent uses a quota, the other strategic agent can capture all of the gains
from trade by using a price policy, absent R. Matters are more complex in the more realistic
situation where R is present in the market.


2.1    Flow payo¤s
We assume that supply and demand curves are linear, the stock-related damage function
quadratic, and that E and R’   s average production costs increase in the rate of output (but
are independent of the stock). The world fuel price, de…ned as the price that E receives and
R pays is p. Consumers in I pay the price P . P p equals the (possibly implicit) unit tax
or quota rent in I . We …rst state the single period payo¤s of the three agents, and then use
these to de…ne the dynamic game.
    Country I has no domestic production; its demand for imports equals A BP . The
tari¤ revenue or the quota rents equal (P p) (A BP ). The climate-related damages,
conditional on x, is d2
                        x2 where d is a constant. The stock x is an amalgram of all climate-
related variables, e.g. carbon stocks and temperature changes, and thus does not have
a simple physical interpretation. Merely for purpose of exposition, we refer to it as the
pollution stock. I ’ s single period payo¤ equals consumer surplus plus tari¤ revenue (or
quota rents) minus environmental damages:
                                   R   A
                                                                                          d 2
                 s‡
                I’ ow payo¤:         P
                                       B
                                           (A        Bz ) dz + (P      p) (A   BP )       2
                                                                                            x
                                                                                                (1)
                                        2
                               1 (A BP )                                    d 2
                           =   2    B
                                               + (P      p) (A       BP )   2
                                                                              x:

   At price p, R’s domestic demand is a b0 p and its domestic supply is b1 p, so its net
imports equal a bp, with b0 + b1    b. R’s gains from trade minus its climate related
damages 2 x2 equal its ‡ow payo¤:
                                           Z    a
                                                b                    1 (a bp)2
                   R’
                    s‡ow payo¤:                     (a   bz ) dz =                     x2 :     (2)
                                            p                        2    b        2

This payo¤ is not relevant to the solution to the game, because R is passive. However, the
solution to the game determines the equilibrium trajectories of p and x. That information,
together with R’ s single period payo¤, enables us to calculate the present value of the stream

                                                         6
of R’ s payo¤, and thereby enables us to see how di¤erent policies a¤ect R’     s welfare.
    The exporter, E , has no domestic consumption and faces no stock-dependent costs. If
the fuel export price is p and the (possibly implicit) export tax in region E is , E ’s producers
receive the price p . These producers’marginal cost function, equal to E ’     s supply function,
is g + f (p    ), where g and f are constants. The exporter’    s single period payo¤ equals its
domestic pro…ts plus the tax revenue or quota rents
                      Z   p
                                                                                1 2gpf + g 2 + f 2 p2    f2   2
   s‡
  E’ ow payo¤:                 (f s + g ) ds + (g + f (p               )) =                                       :   (3)
                           g
                           f
                                                                                2             f

    Each agent has the same constant discount factor, . Welfare for each agent equals the
discounted stream of their single period payo¤.


2.2    Single period equilibrium
We can express single period payo¤s as functions of the state variable, x, and the control
variables. The identity of the control variables depends on the policy scenario. A strategic
player can either use a quota, Q for I and q for E , or they can use a unit tax, T for I or
for E . It is also possible that one agent uses a quota and the other a tax, resulting in four
scenarios.
    If the agents both uses quotas (Q and q ), the equilibrium conditions in E and in the
world at large are

                          g + f (p         ) = q and q        Q       (a       bp) = 0 =)
                                                                                                                      (4)
                          q +Q+a           A Q                 g +f p q        ( f b)q +f Q+gb+f a
                 p=          b
                                 ,   P =    B
                                               ,   and    =       f
                                                                           =            bf
                                                                                                   :

If they both use taxes (T and ) equilibrium requires

                      g + f (p         )     (A      B (p + T ) + a            bp) = 0 =)
                                                                                                                      (5)
                   f +a+A BT g                      f +a+A BT g                    (f +b)T +f +a+A g
              p=      B +f +b
                                      and P =          B +f +b
                                                                       +T =               B +f +b
                                                                                                     :

If I uses the tax T and E uses the quota q , equilibrium requires

                g + f (p         ) = q and q        (A       B (p + T ) + a           bp) = 0 =)
                                                                                                                      (6)
                a+A q BT                    bT +a+A q                 g (B +b)+f (A+a) (f +B +b)q Bf T
           p=      B +b
                               and P =          B +b
                                                         and      =                 (B +b)f
                                                                                                       :




                                                         7
If I uses the quota Q and E uses the tax , equilibrium requires

                            g + f (p      )   (Q + a   bp) = 0 =)
                                                                                           (7)
                                       Q+a g +f             A Q
                                 p=      f +b
                                                  and P =    B


     The …rst two equations on the second lines of each of (4) –(7) give the equilibrium values
of p and P as linear functions of the control variables (a combination of Q, q , T and ) for
the four scenarios. If E chooses the tax, , as in the scenarios that correspond to equations
(5) and (7), the optimality condition for E ’ s problem determines . If E chooses a quota,
q , the equilibrium condition q = g + f (p    ), together with the requirement that aggregate
demand equal aggregate supply, leads to an implicit tax, . This implicit tax is a linear
function of the control variables, as shown by the third equation in the second lines of (4)
and (6). The payo¤s, presented in Section 2.1, are quadratic functions of the prices, controls
and stock, T , , and x, and thus are quadratic functions of the control variables and x in
each of the four scenarios. Given its rival’   s level of trade policy, a country is indi¤erent
whether it supports its own trade restriction using a quota or a tax. However, the level of
its rival’s equilibrium trade restriction depends on both the level and form (a tax or quota)
of its own policy instrument.


2.3    Dynamics
In a period, the current level of GHG stocks is predetermined, at level x. Region R’          s
domestic supply is b1 p and E supplies q , so total emissions are b1 p + q . The constant decay
rate is so next period stock, denoted x0 , is

                                       x0 = x + q + b1 p.                                  (8)

   We use the same procedure as above to write the right side of this equation in terms of
the control variables. For example, if both countries use quotas, we replace p with q+bQ+a .


2.4    Calibration
This game is too complicated to easily produce analytic results, but too simple to provide an
accurate empirical description of fossil fuel markets. We solve the model numerically and
select parameter values to provide an economically meaningful context, so that the results
are informative about world markets. We assume that, if I uses no trade restrictions, I


                                                  8
imports the fraction < 1 of E ’                                                               s
                                     s exports, and R imports the remainder: for any price, I ’
elasticity of demand equals R’     s elasticity of net demand. We de…ne = bb0 , the slope of
R’ s demand relative to the slope of its import demand. By varying , we can change R’         s
fraction of world production in a competitive equilibrium. By choice of units, we set I ’     s
demand intercept A = 8, and E ’        s supply slope f = 1. We assume that E ’   s production
equals 0 at p = 0, implying g = 0. This assumption and the linearity of E ’   s supply implies
that E ’ s elasticity of supply everywhere equals 1. Our second calibration assumption is that
I’s elasticity of demand, evaluated at free trade, also equals 1. With these two calibration
assumptions (and g = 0) and the normalizations A = 8 and f = 1, the choice of and
determine the remaining supply and demand parameters.

       Variable       A=8      B=        a = 8 (1          )   b0 = (1        )   b1 = (1     ) (1         )
        Value          5:6      0:7              2:4                0:05                    0:25
 b = b1 +b2 = 1         g             = 0:1667         f            d                              = 0:0000991
                                                                                                     d(1       )
        0:3             0    0:7 0:25 3:10+4:0
                                          :0+
                                               1 2:31331 10 4 :99 :95
                                Table 1: Benchmark parameter values

    For our baseline, we set = 0:7, so (absent import restrictions) I accounts for 70% of E ’s
exports, and = 0:1667, so that in a competitive equilibrium E accounts for 80% of world
supply Table 1 summarizes our parameter choices and the relation between and and
the supply and demand parameters. Table 2 shows the formulae relating model parameters
to the elasticities and , .
    To assess the sensitivity of our results to parameters, we also considered the alternative
   = 0:3. The choice        = 0:3 corresponds, roughly, to the situation where I represents
Annex B countries under the Kyoto Protocol;          = 0:7 corresponds to a more aggressive
policy scenario, where I includes the US and China; including all BRIC countries in I
increases above 0.8. The cumulative supply is about 10% higher with = 0:3 compared
to = 0:7, because with the lower , I has less market power and restricts its fuel demand
less. But results are qualitatively unchanged. The results for this alternative calibration
can be made available upon request.


   I’
    s demand elasticity =           E’
                                     s supply                  E’
                                                                s production                  I’
                                                                                               s consumption
  R’s net demand elasticity           elasticity                    share                            share
               8 g
              8f +g
                                       f 88  g
                                          f +g             8f 8+8
                                                                      8f g
                                                                    g +8   g 8     +   g
                1                         1                             0:8                           0:7

                                                       9
Table 2: Economic interpretation of demand and supply parameters; all formulae evaluated
                               at competitive equilibrium

    We choose the unit of time equal to a year and set the discount factor = 0:95, for an
annual discount rate of about 5:3%. The persistence parameter = 0:99 implies a half-life of
the pollution stock of approximately 90 years. Despite the lack of a physical interpretation of
the stock x (see above), it is important that there be an economic and physical interpretation
of the parameter d, in order to give context to the model results. We obtain the parameter
d as a function of previously chosen parameters and the level of a threshold stock above
which it is optimal for I to consume nothing. We can choose the value of this threshold,
and thereby choose the value of d, by answering the following question: How many years of
consumption at the competitive level would it take to reach the threshold stock? Our choice
of d is consistent with the answer “105 years”   , implying a threshold value of x = 900, with
an initial value x0 = 0. Appendix A explains this calibration procedure, which we intend
only as a means of providing context for a numerical value that would otherwise be hard to
interpret. Our results imply that for this value of d the environmental objectives are low
relative to the terms of trade objectives; in that respect, our calibration represents low to
moderate levels of damages.
    We set R’  s damage parameter = d(1 ) . With this choice, the ratio of I and R’           s
damage, for any stock, equals the ratio of their import demand absent trade restrictions: I
and R have the same relative bene…t of consumption to cost of stock-related damage; they
merely di¤er in size. As a second sensitivity experiment, we hold …xed other parameter
values and double the value of , to represent a situation where R has much higher damages
than I , taking into account their size di¤erence.


3    Results
We study four scenarios, in which I chooses a sequence of either taxes or quantities, repre-
sented by T or Q, and E chooses a sequence of either taxes or quantities, or q . In each
case, a player’s equilibrium control rule is a linear function of x, equal to + x for I and
   + x for E . For example, if I chooses T and E chooses q , we have T = + x and
q = + x. The values of the four endogenous parameters, ; ; ; are di¤erent in the
di¤erent scenarios. R does not use a policy, so it has no control rule. The equilibrium payo¤
of each of the agents – the present discounted value of that agent’    s future payo¤ stream –
is a quadratic function of the current stock. The payo¤ for I is V (x) = + x + !       2
                                                                                         x2 , for

                                              10
E is W (x) = + x + 2 x2 , and for R is Y (X ) = & + x + 2 x2 . Appendix B explains how
we obtain the value of these parameters in the four scenarios. Table 3 lists the parameter
names.

                Parameter                          Importer   Exporter   ROW
                Coe¢ cient of x2                   !
                Coe¢ cient of x
                Constant in Value Function
                Coe¢ cient of x in Constant Rule                         -
                Constant in Control Rule                                 -
                       Table 3: De…nition of endogenous parameters


    We use the model parameters from Section 2.4. We …rst discuss the parameter values
for the endogenous value functions and control rules for the case where both strategic agents
use quotas. We then compare the equilibrium stock trajectories, payo¤s and prices in the
four scenarios. We use information on the payo¤s to determine the equilibrium to the game
in which agents choose their policy instrument (a tax or quota).
    With one exception, we compare results across di¤erent policy scenarios by comparing
them to the corresponding result in the scenario where both strategic agents uses quotas,
which we call the “reference scenario”   . The graphs labeled ImpTExpT, ImpQExpT, and
ImpTExpQ refer, respectively, to the graph of an outcome when both agents use tax policies,
when I chooses a quota and E chooses a tax policy, and when I chooses a tax and E chooses
a quota. In all cases but one, the outcome (e.g. a payo¤, price, or quantity) is relative to
the corresponding outcome in the reference scenario.
    The exception is for I ’ s payo¤, V (x), where the reference scenario trajectory passes
through zero. For each of the four scenarios, these payo¤s are initially positive, because the
initial value of the stock is x = 0. However, as x increases, the payo¤s become negative.
The switch in sign occurs at a di¤erent time in each of the scenarios. Normalizing I ’ s payo¤
in scenario ImpTExpT, for example, by dividing by the payo¤ when both agents use quotas
(ImpQExpQ) would involve dividing by 0. To avoid this problem, we show the payo¤s for
I in the four scenarios as levels, rather than ratios.


3.1    Equilibrium parameters when both agents use quotas
Table 4 shows the equilibrium values of the endogenous parameters under our baseline cali-
bration, when both E and I use quotas. For all x, the importer’s payo¤ decreases with x,

                                             11
so ! < 0 and            s equilibrium imports decrease as the stock rises, so
                 < 0. I ’                                                                    < 0.


 Parameter                            Importer                      Exporter                     ROW
                                                           3                             6                             6
 Coe¢ cient of x2 in value function   !=    3:3       10             = 3:2156      10                = 2:3663     10
                                                                                         2                        3
 Coe¢ cient of x in value function      =   0:1435                   =   2:75      10                = 9:5   10
 Constant in value Function             = 14:4634                    = 120:9244                      = 19:6012
                                                                4                            4
 Coe¢ cient of x in control rule       =    3:9290         10        =    1:7066        10       -
 Constant in Control Rule            = 0:5049            = 1:2624           -
  Table 4: Equilibrium values of endogenous parameters when E and I both use quotas.

    Over relevant state space, E ’s value function also decreases in the stock: < 0 and j j
is large relative to . However, > 0, so E ’      s value function is convex in x. The stock
has no direct e¤ect on E , but as x increases, E faces decreasing demand from I (because
   < 0). I eventually becomes a negligible part of the market, so further decreases in its
demand have a negligible e¤ect on E ’   s payo¤; hence, the convexity of E ’ s payo¤ in x. As
I’s demand falls with the increase in x, E ’  s exports also fall:     < 0. The fact that E
su¤ers no direct loss in utility due to higher pollution stock means that its payo¤ is much
less sensitive to x, compared to I ’s payo¤. (Compare the magnitudes of ! and and of
   and .) I ’  s equilibrium quota is about twice as sensitive to the stock, compared to E ’  s
equilibrium quota:        2.
    R’ s payo¤ is a convex increasing function of the pollution stock (both and are pos-
itive); over the relevant range of stocks, the relation is approximately linear (       0). A
higher stock has o¤setting e¤ects on R’   s payo¤. The higher stock increases R’    s damages,
lowering its net payo¤. The higher stock also decreases E ’ s supply and I ’ s imports, but the
second e¤ect is approximately twice as large as the …rst (     2), so on balance a higher stock
increases the supply absorbed by R, increasing its gains from trade. With our calibration,
the higher gains from trade dominate the higher damages, so on balance higher pollution
stocks bene…t R.


3.2    Equilibrium stock trajectories
Figure 1 (a) shows the pollution stock trajectories as functions of time in the quota-setting
game, and in the …rst-best scenario where the social planner uses Pigouvian taxes. We defer
discussion of the outcome under the social planner until Section 3.3 and here discuss the
stock trajectories under the games corresponding to di¤erent combinations of trade policies.

                                                 12
Figure 1: (a) Stock Trajectory of the Quota Setting Game and under the …rst-best social
planner; (b) Stock Trajectories Relative to the Quota Setting Game for other scenarios
considered

After 150 years, the stock reaches only 22% of the threshold level (x = 900, at which it is
optimal for I to cease imports). Recall that our calibration assumes that under unrestricted
trade the stock reaches this threshold in 105 years. This comparison shows a very signi…cant
reduction (relative to free trade) in cumulative extraction, resulting from the quota-quota
policy combination. The magnitude of that reduction is consistent with either high damages
or a high incentive to exercise market power, or both. Our subsequent results show that
our calibration actually implies rather low damages, and that the stock reduction is due
primarily to agents’incentives to exercise market power.




    Figure 1 (b) shows stock trajectories relative to the reference trajectory, beginning with
the …rst period. The initial stock equals 0 and the graphs start at time t = 1. In the early
periods, the graphs re‡  ect primarily ratios of initial emissions, whereas later values of the
graphs re‡ ect ratios of cumulative emissions, adjusted for the stock decay. These graphs
are quite ‡at, implying that relative ‡ ows, across policy scenarios, change little over time.
    Cumulative stocks are 10-35% higher in the other policy scenarios, relative to the quota-
setting game. The stocks are highest where both strategic agents use taxes, and are at
intermediate levels where one agent uses a tax and the other uses a quota. For this compar-
ison, it matters little which of the two agents uses a tax. We noted that in a static setting,
equilibrium quotas tend to reduce trade to a much greater extent than equilibrium taxes.

                                              13
When an agent uses a quota rather than a tax, its trading partner faces a less elastic excess
supply or demand function, and therefore has an incentive to use a more aggressive trade
restriction. Figure 1 (b) shows that this comparison also holds in our dynamic setting.
    The steady state when both countries use taxes is x = 329, much lower than the assumed
threshold of x = 900; after 150 years the stock reaches 80% of its steady state level. The
steady state stocks in the other policy scenarios range from 254 to 280; by year 150 the
stocks in these scenarios also equal about 80% of their respective steady states.


3.3    Payo¤s and instrument selection
Figure 2 shows the importer and exporter continuation payo¤s (value functions V and W )
as functions of the stock. (Recall that the former is in levels, and the latter shows graphs
relative to the ImpQExpQ levels, accounting for the di¤erence in scale of the two …gures.)
The principal information from these …gures is that the tax is a dominant strategy for both
countries, at every stock level. If both countries believe that they can choose their policy
instrument in perpetuity in the initial period, the unique Nash equilibrium is for both to
choose a tax. If they have the opportunity to revisit this decision at any time in the future
(i.e. at any stock level), the equilibrium policy choice does not change. Consider the more
complex game in which, at each period, agents choose both their policy instrument and the
level of the instrument. In the MPE to this game, both countries choose the tax, and the
tax equals that of ImpTExpT.
    Both countries’ payo¤s decrease with the stock. The importer su¤ers stock-related
damages. As the stock increases, I tightens its trade restriction, reducing the aggregate
demand that E faces, and reducing E ’     s‡ow payo¤ and its continuation payo¤. Because
the importer su¤ers direct damages, its payo¤ is more sensitive to the stock, relative to the
exporter’ s payo¤.
    The dots on the vertical axis identify the payo¤s in the static game, obtained by setting
the damage parameter to 0. Comparison of the dots corresponding to the static games
and the intercepts corresponding to the dynamic games shows two facts. First, the payo¤
ranking is the same in the static and in the dynamic settings. Second, the payo¤s in
the dynamic setting lie only slightly below the corresponding values in the static setting.
The di¤erences re‡  ect the fact that I su¤ers from damages in the dynamic game (harming
I ) and therefore uses more aggressive trade restrictions (harming E ). Our calibration is
consistent with relatively small damages. The static terms of trade considerations are much
more important to I ’ s payo¤, compared to environmental damages. The largest di¤erence


                                             14
Figure 2: (a) Importer’                                                        s value function
                        s value function as function of the stock; (b) Exporter’
as a function of the state

between the static and dynamic counterparts corresponds to the game in which both agents
use taxes. As Figure 1 (b) shows, the equilibrium stock is signi…cantly higher in that policy
scenario, so the welfare impact of damages is greatest there.
    If the importer is constrained to use a quota, it does not (much) matter to it whether
the exporter uses a tax or a quota. (The graphs corresponding to the importer’         s payo¤
under ImpQExpQ and ImpQExpT are nearly coincident.) In contrast, if the exporter is
constrained to use a quota, it much prefers the importer to use a quota rather than a tax.
    Although the payo¤ ranking (across policy combinations) does not change with the stock
level (i.e. the graphs in Figure 2 do not cross), the payo¤ ranking for the importer does
change as a function of time. After about 100 years, the importer’        s continuation value
is lower under ImpTExpT than under the other policy scenarios. After a century, the
stock is su¢ ciently higher when both strategic blocs use taxes, compared to other policy
combinations. This higher stock reduces the importer’     s payo¤. However, as noted above,
if countries were able to reconsider their policy instrument after 100 years of the ImpTExpT
equilibrium, the unique Nash equilibrium remains for both to continue using taxes.



3.4    R’
        s payo¤s
Figure 3 (a) shows R’s payo¤s over time in the dynamic games, and its corresponding payo¤s
in the static games (the dots on the vertical axis). As for I and E , R’s ranking of policy
scenarios is the same in the static and dynamic settings; and for any policy scenario, the

                                              15
               s payo¤, Y (xt ), in the four policy scenarios; (b) R’
Figure 3: (a) R’                                                    s payo¤ with twice the
damages (2 )

payo¤ level is similar in the static and dynamic settings. Again, these features re‡      ect the
fact that the static producer and consumer surplus are much more important to R’        s payo¤,
compared to the dynamic pollution cost. R’       s payo¤ is highest when I uses a tax and E
uses a quota. I ’   s use of a tax rather than a quota reduces E ’    s incentive to restrict its
supply, bene…tting R, which in our calibration is an importer. From Figure 1 (b), the stock
trajectory is highest when both I and E use taxes. But I consumes much of that additional
supply. When I continues to use a tax and E switches from a tax to a quota, aggregate
supply falls, lowering R’ s gains from trade (and slightly lowering its damages). E ’   s switch
to a quota causes I to face a less elastic excess supply function, inducing I to increase its
tari¤, and reduce its consumption. The net e¤ect is to increase R’     s supply, thus increasing
its gains from trade, and (because damages are relatively small) increasing its payo¤.
    R’ s payo¤ is higher in the dynamic setting (with damages) compared to the static setting
without damages. In contrast, both I and E have lower payo¤s in the dynamic setting.
Section 3.1’ s discussion of endogenous parameters explains this relation: stock-related dam-
ages cause both I and E to impose tighter trade restrictions, lowering their equilibrium
gains from trade; because I su¤ers directly from the higher stocks, and E su¤ers only indi-
rectly (via the induced tightening of I ’                        s response to the higher stock
                                         s trade restriction), I ’
is greater than E ’ s. Thus, the net e¤ect of the higher stock is to increase supply available
to R, increasing its gains from trade. That increased gain swamps the direct cost to R,
arising from stock-related damages. In the quota-setting game, we noted that R’       s payo¤ is
monotonic in the stock, but this relation does not hold for all games.


                                               16
    The non-monotonicity is easiest to see in Figure 3 (b), where we double R’  s damages by
doubling . Higher damages do not alter the comparison of static and dynamic payo¤s; in
this respect, damages remain small relative to the gains from trade, even when doubles.
However, for higher damages R’    s payo¤ is non-monotonic in time. Because the stock is
monotonically increasing over time, we conclude that R’     s payo¤ is non-monotonic in the
stock. As above, a higher stock decreases I ’      s demand more than it lowers E ’ s supply,
thereby increasing the supply available to R and increasing its gains from trade; and the
higher stock increases R’ s damages. At low stocks, early in the program, the …rst e¤ect
dominates; at high stocks, later in the program, the second e¤ect dominates when we double
the damage parameter . In this case, the relation between R’       s payo¤ is …rst increasing
and then decreasing over both time and over stock levels. When the climate damages are
                                                                   s payo¤ in the ImpTExpT
su¢ ciently important for R, relative to fossil-fuel consumption, R’
game must eventually fall below its payo¤ in the ImpQExpT game, simply because the rate
of accumulation of the carbon stock is greater in the former case.


3.5    Price and policy trajectories
Figure 4 (a) shows the equilibrium world price, p (the price that R pays and E receives) and
Figure 4 (b) shows the importer’   s domestic price P . As the stock increases and I tightens
                                                             s import demand falls, the world
its trade restriction, P rises. As the stock increases and I ’
price falls. E is in the strongest position to exercise market power when it uses a tax and
I uses a quota; therefore, this scenario leads to the highest market fuel world price. I is
in the strongest position to exercise market power when it uses a tax and E uses a quota;
therefore, this scenario leads to the lowest world price. The other scenarios, where both
agents use taxes or both use quotas, result in intermediate levels of the world price.
    Recall that absent R, the equilibrium when both agents use quotas implies that no fuel is
traded. As discussed in the Introduction, the presence of R moderates this extreme result.
With R, it is too costly for the strategic agents to try to capture all of their rival’s quota
rent. Nevertheless, trade between I and E is lowest in the quota setting game, so that
scenario results in the highest domestic price for I . For similar reasons, trade between I
and E is highest when both agents use tari¤s, so that scenario leads to the lowest domestic
price for I . The dots on the vertical axes show the equilibrium prices in the static games,
where damage equal 0.
    Figure 5 graphs the explicit or (in the case where an agent uses a quota) implicit trade
tax. Consistent with our previous discussion, these …gures show that an agent has the


                                             17
                                                                         s domestic price in
Figure 4: (a) The world price, p, in the four scenarios; (b) The importer’
the four scenarios

greatest incentive to exercise market power, and therefore uses the most restrictive trade
policy, when it uses a tax and its rival uses a quantity restriction. The agent’   s trade policy
is aggressive when it uses a quantity restriction and its rival uses a tax. For all policy
combinations, the importer’  s trade tax (or quota price) increases over time, i.e. it increases
with the pollution stock. The exporter’     s implicit or explicit taxes fall slightly over time.
E’ s exports fall over time, with the fall in the price that E receives. As this price falls, a
lower export tax supports reduced levels of exports. (In contrast, at a constant world price,
the export tax would have to increase in order to support reduced exports.)
    Figure 5 also shows the Pigouvian tax trajectory, for comparison with the equilibrium
trade taxes in the di¤erent policy scenarios. The Pigouvian tax supports the …rst best
outcome. Figure 1 shows that the stock trajectory under the social planner who uses a
Pigouvian tax is lower than the trajectory under any of the four combinations of trade
policy. The strategic countries want to improve their terms of trade and, in the case of I , to
control the emissions-related future damages that they su¤er. In pursuit of these objectives,
the strategic countries reduce emissions. Those reductions exceed the reductions achieved
by the social planner who uses a Pigouvian tax imposed on all units on fuel consumption.
Under this tax, consumers in I and R and producers in E and R face the same prices; the
di¤erence between those prices equals the Pigouvian tax. In the absence of R, where one
country (E ) has production but no consumption, and the other (I ) has consumption but no
production, the …rst best output path can be supported with any combination of import and
export tax that sum to the Pigouvian tax. The division of this sum between the import
and export taxes determines the amount of tax revenue that each country collects, but has


                                               18
                      s (explicit or implicit) trade tax; (b) Importer’
Figure 5: (a) Exporter’                                               s (implicit or explicit)
trade tax

no e¤ect on equilibrium sales, and therefore has no e¤ect on e¢ ciency.
    In the presence of R, the …rst best outcome cannot be implemented using only trade
policies for I and E (simply because the …rst best requires that all consumers face the same
price, and all producers face the same price). Therefore, there is no direct way to compare the
Pigouvian tax with the sum of the trade taxes in the di¤erent policy scenarios. However, we
note that in all policy scenarios the sum of the equilibrium trade taxes exceeds the Pigouvian
tax at least for the …rst 50 years (and, except for ImpTExpT, this comparison also holds for
the entire 150 year period that we consider). In order to interpret this comparison, consider
the case of a planner whose objective is to maximize the sum of world welfare, and who is
constrained to use only an export tax for E and an import tax for I (or quota-equivalents to
such taxes). This planner cannot achieve the …rst best. The trade taxes create a distortion
in the process of achieving the desired reduction in the stock; therefore, in general, the sum of
the optimal export and import tax for this planner is less than the Pigouvian tax. The fact
that the sum of the equilibrium trade taxes exceeds the Pigouvian tax re‡       ects the fact that
the trade taxes are set (primarily) in order to improve a country’  s terms of trade, rather than
to correct the environmental distortion (which is the planner’   s sole objective). Comparison
of the two graphs in Figure 1 reinforces this interpretation.


4     Conclusion
This paper extends previous literature on dynamic games between a large bloc of fuel ex-
porters and a large bloc of fuel importers by including a nonstrategic third bloc of countries,
R , representing the group of developing countries with no climate policy nor strategic trade


                                               19
policy. The presence of this nonstrategic bloc means that even if a strategic country uses a
trade quota, the excess supply or demand function facing its trading partner is not perfectly
inelastic. We …nd, under our preferred calibration assumptions, that a tax policy by both
the strategic importer and exporter constitutes the Markov (or subgame) equilibrium to
this game, at any value of the state variable. This result echos results from related models,
especially the static three-bloc model in Strand (2013), and the dynamic two-bloc model
(without the fringe) in Wirl (2012).
    The strategic importer and exporter both use trade policies to improve their terms of
trade. The strategic importer also uses trade policy to control the future stock-related
damages, but does not internalize the damages facing R. The fact that the stock changes
over time renders the importer’    s problem dynamic. Although the exporter has no intrinsic
concern about the stock, its equilibrium trade policy depends on the importer’       s policy and
therefore is also stock dependent. For our calibration, the terms of trade objectives dominate
the environmental objective in explaining policy levels. OPEC countries appear to be
concerned that a uni…ed climate policy among OECD countries might provide both “green
cover”and a coordinating device that would enable the OECD countries to exercise greater
market power in the fuel markets. Our results indicate that OECD countries might indeed
have an incentive to behave strategically in this way; although our model has little to say
about whether uni…ed climate policy would actually induce such behavior. In our calibration,
the strategic countries’terms of trade objectives and concern for country-speci…c damages,
lead to smaller equilibrium pollution stocks than under the social planner who can use a
Pigouvian tax.
    The nonstrategic agent, R, also su¤ers stock-related damages. This country, a net fossil
fuel importer, is a free rider, bene…ting from the importer’ s trade restriction; that restriction
lowers the equilibrium price of fossil fuels and also reduces the equilibrium stock trajectory,
lowering damages to R. R’       s equilibrium payo¤ is higher in the dynamic setting, where it
incurs stock related damages, compared to the static setting where it incurs no damages.
The explanation is that stock-related damages cause the strategic importer to use more
aggressive trade restrictions, bene…ting R. The reduced competition for fossil fuel imports
more than o¤sets the stock-related damages. The social planner’      s optimal solution is to set
a Pigouvian tax applied to all fossil fuel consumption, including by the fringe. In our model,
by contrast, the fringe faces lower fossil fuel prices than the strategic importer.
    Our calibration assumes that, under free trade, the strategic importer accounts for 70%
of fossil fuel imports. This scenario corresponds to a situation where most large countries
cooperate on trade and environmental policy; those two policies are indistinguishable in our

                                               20
setting, where the strategic importer consumes but does not produce fossil fuels. We have
also considered an alternative calibration, where strategic importers account for only 30% of
imports under free trade. The qualitative results in the two cases are similar, although the
smaller importer obviously has less market power and therefore uses less aggressive trade
restrictions.




                                             21
References
Amundsen, E., and R. Schöb (1999): “Environmental taxes on exhaustible resources,”
 European Journal of Political Economy, 15(2), 311–329.

Bergstrom, T. (1982): “On capturing oil rents with a national excise tax,” The American
  Economic Review, 72(1), 194–201.

Brander, J., and S. Djajic (1983): “Rent-extracting tari¤s and the management of
 exhaustible resources,” Canadian Journal of Economics, pp. 288–298.

Dong, Y., and J. Whalley (2009): “A Third Bene…t of Joint Non-OPEC Carbon Taxes:
 Transferring OPEC Monopoly Rent,” CESifo Working Paper Series.

Johnson, H. (1953): “Optimum tari¤s and retaliation,” The Review of Economic Studies,
  21(2), 142–153.

Jørgensen, S., G. Martín-Herrán, and G. Zaccour (2010): “Dynamic games in
  the economics and management of pollution,” Environmental Modeling and Assessment,
  15(6), 433–467.

Kalkuhl, M., and O. Edenhofer (2010): “Prices vs. quantities and the intertemporal
 dynamics of the climate rent,” CESifo Working Paper Series No. 3044.

Karp, L. (1984): “Optimality and consistency in a di¤erential game with non-renewable
 resources,” Journal of Economic Dynamics and Control, 8(1), 73–97.

Karp, L. (1988): “A comparison of tari¤s and quotas in a strategic setting,” Giannini
 Foundation Working 88-6.

Karp, L., and D. Newbery (1991): “Optimal tari¤s on exhaustible resources,” Journal
 of International Economics, 30(3-4), 285–299.

                                                           s rents?,” Journal of
Liski, M., and O. Tahvonen (2004): “Can carbon tax eat OPEC’
  Environmental Economics and Management, 47(1), 1–12.

Long, N. V. (2010): A Survey of Dynamic Games in Economics. World Scienti…c, Singa-
  pore.

Njopmouo, O. (2010): “On Capturing Foreign Oil Rents,” Working Paper, University of
  Montreal.

                                          22
Rubio, S. J. (2005): “Tari¤ Agreements and Non-Renewable Resource International Mo-
 nopolies: Prices Versus Quantities,” Department of Economic Analysis, University of
 Valencia, Discussion paper no 2005-10.

Strand, J. (2011): “Taxes and Caps as Climate Policy Instruments With Domestic and
  Imported Fuels,” Gilbert Metcalf (ed.): U.S. Energy Tax Policy, Cambridge University
  Press.

        (2013): “Strategic Climate Policy with O¤sets and Incomplete Abatement: Carbon
  Taxes Versus Cap-and-Trade,” Journal of Environmental Economics and Management,
  forthcoming, (Published online, April 28, 2013).

Tower, E. (1975): “The optimum quota and retaliation,” The Review of Economic Studies,
 42(4), 623–630.

Wirl, F. (1995): “The exploitation of fossil fuels under the threat of global warming and
 carbon taxes: A dynamic game approach,” Environmental and Resource Economics, 5(4),
 333–352.

        (2012): “Global warming: prices versus quantities from a strategic point of view,”
  Journal of Environmental Economics and Management, 64, 217–    229.

Wirl, F., and E. Dockner (1995): “Leviathan governments and carbon taxes: Costs and
 potential bene…ts,” European Economic Review, 39, 1215–1236.




                                           23
A     Appendix: The calibration of d
Suppose that I believes that if it were to drop out of the market (e.g. use a prohibitive
tari¤ or set its import quota to 0), E would subsequently behave as a monopolist with
respect to R’ s import demand function. In that case (assuming f = 1; g = 0), E would set
q = 2+b , implying that p = ba2+
      a                          ab
                                +2b
                                    . The single period emissions in this case is the constant
      a       a+ab
y    2+b
         + b1 b2 +2b and the equation of motion is xt+1 = xt + y . If I ceases consumption
when the stock reaches z , the stock n periods later, denoted x , equals

                                                      X
                                                      n 1                        n
                                                                                     1
                                        n                   n        n
                                 xn =       z+y                  =       z+y           :
                                                      n=0
                                                                                     1

The present discounted value of the stream of marginal damages, when the stock reaches z ,
is then

                       X
                       1                    X
                                            1
                                                        t                y           dy X
                                                                                          1
                             t                                                                 t
                   d             xt = d           (     )       z+
                       t=0                  t=0
                                                                             1         1 t=0
                                             (1   )z + y
                                   = d                    :
                                            (   1) (   1)

The marginal value to I of consuming the …rst unit is the di¤erence between its choke price
                         A     a+ab
and the monopoly price, B     b2 +2b
                                     . If it is optimal for I to cease consumption, under the
belief that subsequent emissions would be y in each period, then the marginal bene…t of an
additional unit of production equals the present discounted value of the stream of future
marginal damages,
                             A      a + ab        (1    )z + y
                                     2
                                             =d                   :                       (9)
                             B b + 2b            (   1) (      1)
This expression gives d as an implicit function of z , the threshold stock above which it is
optimal for I to cease consumption.
    Under perfect competition, let annual production equals s. Denote N as the number of
years that it would take the stock to reach z units, starting from a zero stock level, given
                                                N
annual emissions s: N is the solution to z = s 11 . We can use this equation to eliminate
z from equation (9), resulting in an implicit expression for d as a function of the previously
de…ned parameters and the new parameter, T . Our choice d = 3:3043 10 4 is equivalent
to setting N = 105. In summary, our choice of d is consistent with a circumstance where
it would be optimal for I to stop consuming the carbon intensive good after approximately
105 years of world consumption at the competitive level, given I ’    s belief that subsequent


                                                            24
consumption would be at the monopoly price with respect to R excess demand.1


B      Appendix: The solution to the model
We …rst explain how we re-write the problem in order to unify the four scenarios This
procedure enables us to solve a single game, and then obtain each of the policy scenarios by
appropriate choice of parameters. We then explain how to solve the uni…ed model.


B.1      The uni…ed model
In all four scenarios, corresponding to the di¤erent policy mixes, we can write the single
period payo¤s of E and I and their “perceived”equation of motion (de…ned below) as

                                                                            dI 2
                         s payo¤: fI Q2 + gI Qx + hI Q + rI x + sI
                        I’                                                  2
                                                                              x
                                                                                                     (10)
                                                        0
                           Equation of motion: x = kI x + mI Q + nI :

                                                                             dE 2
                        s payo¤: fE q 2 + gE qx + hE q + rE x + sE
                       E’                                                     2
                                                                                x
                                                                                                     (11)
                                                       0
                           Equation of motion: x = kE x + mE q + nE :
We intentionally abuse notation here in order to obtain a uni…ed (for all four policy scenarios)
expression of the game, so that we can use a single program to obtain the equilibrium in all
four cases. We now explain the relation between equations (10) and (11).
    Consider …rst the case where both I and E choose quantities, Q and q . In a linear MPE
both agents believe that their rival uses a linear control rule. Suppressing time subscripts, I
believes that E sets q = + x and E believes that I sets Q = + x, where the endogenous
parameters ; ; ; are to be determined. The beliefs are con…rmed in equilibrium. That
            s belief about E ’
is, given I ’                             s optimal policy is Q = + x, and given E ’
                              s policy, I ’                                               s belief
about I ’s policy, E ’s optimal policy is q = + x.
    Using the price under quotas, and I ’   s belief, I expects the equilibrium price to be

                                     Q+a       q       Q+a     ( + x)
                                p=                 =                  :
                                       b                      b
   1
    As noted above, this explanation is intended to provide context for an otherwise hard-to-interpret nu-
merical value, not to represent a plausible outcome. In particular, the calibration described here implies
z = 900. However, world equilibrium production under the monopoly price, when I has exited the market,
would be too little to sustain the stock at that level.


                                                   25
Using this expression and P = ABQ in I ’          s‡ ow payo¤, equation (1), we write that payo¤ as
a quadratic function in q and x, as in the …rst line of equation (10). Equating coe¢ cients of
terms of the same power (e.g., equating the coe¢ cient of x2 in both equations), we obtain the
                                                                      s “perceived”equation of motion
formulae for fI ; gI ; hI ; rI ; sI . Similarly, given its beliefs, I ’
(i.e., its belief about the equation of motion) is

                                  x0 = x + ( + x) + b1 Q+a ( b
                                                               + x)

                                 = +       b1 b x + bb1 Q + + b1 a b ;

which has the same form as the second line in equation (10). Again, equating coe¢ cients
of terms of the same power, we obtain the formulae for kI ; mI ; nI . We obtain the formulae
for the coe¢ cients in equation (11) using the same procedure.
    We use the same method to obtain formulae for the coe¢ cients of the other three control
problems.


B.2      Solution to the uni…ed model
We now work with the control problems de…ned by equations (10) and (11). Each agent’  s
                                                                                      s
equilibrium control rule, q = + x for E and Q = + x for I , appears in the other agent’
control problem. Consider E ’s control problem. Its dynamic programming equation (DPE)
is

                           W (x) = maxq [fE q 2 + gE qx + hE q + rE x + sE
                                                                                                           (12)
                                     dE 2
                                      2
                                        x   + W (kE x + mE q + nE ) ;
where the second line uses the second line in equation (11) to write W (x0 ) as a function of
the current x and the current choice q . Because of our choice of a linear equilibrium, E
solves a linear quadratic control problem, for which it is well known that the unique solution
is a quadratic value function.2 We write this function as W (x) = + x + 2 x2 , where the
parameters ; ; are to be determined. Using this function to eliminate W (x0 ) on the
right side of equation (12), we express the right side as a linear quadratic function of q; x
and the unknown coe¢ cients. We maximize this expression with respect to q to obtain the
   2
     There are non-linear equilibria in this model. However, the linear equilibrium is an obvious choice to
study, because it is the limit of the sequence of equilibria in the …nite horizon game, as the time horizon goes
to in…nity. It is also the only equilibrium that is de…ned for all state space, ignoring inequality constraints.
  The linearity of the equilibrium requires that inequality constraints, e.g. the non-negativity of prices and
quantities, are never binding in equilibrium. For our calibration, these inequality constraints are in fact
never binding for values of the state variable reached in equilibrium.


                                                      26
                 s control rule Q =
coe¢ cients of E ’                                    + x:

                                                         hE +      mE +2 mE nE
                                                    =           (2fE + m2E)
                                                                                                                       (13)
                                                             gE + m E k E
                                                        =    (2fE + m2
                                                                          :
                                                                      E)



    The maximized value of the right side of the DPE (12) is a quadratic function in x, as is
the left side. The DPE holds identically in x if and only if the coe¢ cients of terms of order
of x are equal. We de…ne

                                                                                 2
               E   = 2fE + 2gE mE kE + dE m2
                                           E
                                                                           2
                                                                     2 fE kE              4 m2
                                                                                             E (gE + 2dE fE )          (14)

and equate coe¢ cients of terms of order of x on the two sides of the maximized DPE to
obtain the following formula for the unknown parameters3

                                 1
                           =   2 m2
                                       ( (2fE + 2gE mE kE + dE m2
                                                                E
                                                                                                2
                                                                                         2 f E kE )         E)
                                   E


                                     hE   mE kE +gE     mE nE 2 fE nE kE +gE hE rE (2fE +             m2
                                                                                                       E)
                               =                      2fE + m2                                                         (15)
                                                              E 2 fE kE +gE mE

                                                                        2 2 2
                       1   2       fE n2   2
                                       E +hE +2hE   mE +2hE mE nE +        mE        4    nE 2sE (2fE +      m2
                                                                                                              E)
                   =   2                                 (2fE + m2
                                                                                                                   :
                                                                 E )(     1)


    The importer I solves a similar control problem, where its single period payo¤ is the …rst
line of equation (10) and its perceived equation of motion is the second line of that equation.
Denoting I ’s value function as V (x), we write its DPE as

                                   V (x) = maxQ [fI Q2 + gI Qx + hI Q + rI x + sI
                                            dI 2
                                                                                                                       (16)
                                            2
                                              x + V (kI x + mI Q + nI )

Equations (16) has the same form as the exporter’   s DPE (12), except that the subscript I
replaces the subscript E on parameter coe¢ cients, the function V replaces W , and the control
Q replaces q . Denote the quadratic value function as V (x) = + x + !      2
                                                                             x2 . Substituting
   3
     The equations for and for ! are quadratics. For both of these equations we take the smaller root, leading
to the …rst line of equation (15). The smaller root satis…es the transversality condition. In addition, when
we repeat this procedure for the importer, the smaller root is the only negative root. The coe¢ cient of x2
in the importer’ s value function must be negative, as discussed in the text.
   We con…rmed that the choice of the smaller root for both quadratics is correct by solving these equations
for the other three combinations of roots. For two of these combinations, there was no equilibrium candidate
because there was no solution to the two equations given by the two roots. For the third combination, there
was a solution to these two equations, but it resulted in negative stocks, and thus violates the requirement
that stocks be non-negative.


                                                                27
this function into the DPE (16) we repeat the procedure above to obtain expressions for the
endogenous parameters ; ; !; ; . These formulae are identical to those in equations (13)
and (15), except that the subscript I replaces the subscript E , and the parameters ; ; !; ;
replace the parameters ; ; ; ; ; we also de…ne a function I using an equation analogous
to (14).
    The system consisting of (13) and (15) and the de…nition (14), together with the cor-
responding equations (not shown) for I can be solved recursively. We …rst solve the four
equations that determine !; ; ; . This four dimensional system can be reduced to a two-
dimensional system by noting that for all policy scenarios, gE is a linear function of , and gI
is a linear function of . The second line of equation (13) shows that is a linear function
of gE , and hence a linear function of . Inspection of the analogous equation for I (not
shown), shows that is a linear function of . We can solve this two dimensional linear
system to obtain values of and as functions of ! and . Substituting these expressions
into the equations that determine ! and (the …rst line of equation (15) for ! and the
corresponding equation – not shown – for ), we obtain two cubics in ! and . We can
numerically solve these two cubics to …nd the correct values of ! and .
    Given the values of ! and , we can then obtain and using the the expressions
described in the previous paragraph. With numerical values for !; ; ; , we then use the
equations for and and the corresponding equations (not shown) for and to solve for
these four parameters; this system is linear. We then solve the decoupled equations for
and (again, the equation for is not shown).
    We also need an expression for the present discounted value of the stream of R’    s payo¤.
Equation (2) gives R’  s single period payo¤. Denote p = R x + R and Q = R x + R as the
equilibrium values of p and Q. The parameters of these functions depend on the particular
policy scenario, and their values are obtained from the solution to the di¤erent games. R’    s
‡ ow payo¤ depends on p, which in equilibrium is a function of x, and the evolution of x
depends on both p and Q, via equation (8). R’     s continuation payo¤ is therefore a function
of x, which we denote Y (x). The value of the stream of R’   s payo¤ equals its ‡ow payo¤ plus
its discounted continuation payo¤. Therefore, Y (x) must satisfy the functional equation

                                    1 (a bp)2
                          Y (x) =             + Y ( x + Q + b1 p):                        (17)
                                    2    b

Substituting the quadratic trial solution, Y (x) = 2 x2 + x + & , into equation (17) and
                                                                                          s
equating coe¢ cients of terms in order of x provides the equations for the parameters of R’



                                              28
value function:
                                                          b   2
                           =     2
                                     +2        2 +2
                                                              R
                                                                                    2 b2
                                          R+   R           R b1 +2         R R+     R 1    1

                      a                                                                              2
                          R +b R R +      R+      R R+             R R b1 +       R R b1 +      R R b1
                  =                       (1                      2
                                                      R           R     R b1 )

                                 1            1       2+                              1        2 2
                          a   R+ 2 b R+    R+ 2                   R b1 +     R b1 R + 2        R b1
                      =                               R
                                                      1
                                                                                                      :




                                                   29
C     Appendix: Calculation of a Pigouvian Tax
As in the text, the world price, de…ned as the price that E receives, is p. Consumers in I
pay an additional Pigou Tax ( ) added to the price: p + and consumers in R face the
same price.
    Country I has no domestic production; its demand for imports equals A B (p + ). The
climate-related damages, conditional on x, is d2
                                                 x2 where d is a constant. I ’
                                                                             s single period
payo¤ equals consumer surplus minus environmental damages:
                                                       R   A
                                                                                        d 2
                              s‡
                             I’ ow payo¤:                  B
                                                           p+
                                                                (A        Bz ) dz       2
                                                                                          x
                                                                                                                (18)
                                                         2
                                           1 (A B (p+ ))                  d 2
                                       =   2      B                       2
                                                                            x:

    At price p + , R’ s domestic demand is a b0 (p + ) and its domestic supply is b1 p, so
its net imports equal a bp b0 , with b0 + b1 b. R’   s gains from trade minus its climate
                    2
related damages 2 x equal its ‡ ow payo¤:
                   Z    a                      Z
                       b0
                                                   p
                                                                          1 (a      b0 (p + ))2 b1 p2
 R’
  s‡ow payo¤:               (a   b0 z ) dz +           (b1 z ) dz =                            +            x2 : (19)
                     p+                        0                          2            b0         2     2

   The exporter, E , has no domestic consumption. These producers’marginal cost function,
           s supply function, is f + gp, where f and g are constants. The exporter’
equal to E ’                                                                      s single
period payo¤ equals its domestic pro…ts
                                                                Z   p
                                  s‡
                                 E’ ow payo¤:                           (f + gz ) dz:                           (20)
                                                                0


Each agent has the same constant discount factor, . Welfare for each agent equals the
discounted stream of their single period payo¤.
    The social planner maximizes the sum of the payo¤s plus rents collected through the tax.


              1 (A     B (p + ))2       d 2 1 (a                b0 (p + ))2 b1 p2
                                                                         1
social payo¤:                             x+                     x2 +f p+ gp2 + (f +gp+b1 p)
                                                                           +
              2          B              2    2                 2   b0    2    2
                                                                                  (21)
We can write the total demand equal to total supply (to get p in terms of ) and the




                                                           30
“perceived”equation of motion (de…ned below) as

     Equating Supply with Demand: f + gp + b1 p = a                          b0 (p + ) + A    B (p + )

                                                              a b0 +A B      f
                               which results in p =             g +b1 +b0 +B                               (22)
                                                         0
                     Equation of motion: x = x + f + gp + b1 p
         which results in: x0 = x + f + g ( a b0 +A B
                                              g +b1 +b0 +B
                                                           f
                                                             ) + b1 ( a b0 +A B
                                                                        g +b1 +b0 +B
                                                                                                f
                                                                                                    )

The social planner will choose a tax which in equilibrium is a linear function of the state,
  = + x. The social planner solves the following optimization problem

                 h                 2                      2
                     1 (A B (p+ ))         1 (a b0 (p+ ))         b1 p2
   S (x) = max       2      B
                                       +   2       b0
                                                              +     2
                                                                          + fp + 1
                                                                                 2
                                                                                   gp2 + (f + gp + b1 p)
                                       d
                           +      2
                                           x2 + S ( x + f + gp + b1 p) ;                                   (23)
                                                   s:t:
                                           p= a b 0 +A B
                                                 g +b1 +b0 +B
                                                              f



where the second line uses the equation of motion to write S (x0 ) as a function of the current
x and the current choice . The social planner solves a linear quadratic control problem,
for which it is well known that the unique solution is a quadratic value function. We write
this function as S (x) = + x + 2 x2 , where the parameters ; ; are to be determined.
Using this function to eliminate S (x0 ) on the right side of equation (12), we express the right
side as a linear quadratic function of ; x and the unknown coe¢ cients. We maximize this
expression with respect to to obtain the coe¢ cients of the control rule = + x:
    The maximized value of the right side of the DPE (12) is a quadratic function in x, as
is the left side. The DPE holds identically in x if and only if the coe¢ cients of terms of
order of x are equal. We equate coe¢ cients of terms of order of x on the two sides of the
maximized DPE to obtain the unknown coe¢ cients. Hence, we obtain                 = + x, the
optimal Pigou tax as determined by the social planner.




                                                         31