Keyboard Shortcuts?

×
  • Next step
  • Previous step
  • Skip this slide
  • Previous slide
  • mShow slide thumbnails
  • nShow notes
  • hShow handout latex source
  • NShow talk notes latex source

Click here and press the right key for the next slide (or swipe left)

also ...

Press the left key to go backwards (or swipe right)

Press n to toggle whether notes are shown (or add '?notes' to the url before the #)

Press m or double tap to slide thumbnails (menu)

Press ? at any time to show the keyboard shortcuts

\title {Joint Action \\ Lecture 09}
 
\maketitle
 

Lecture 09

Joint Action

\def \ititle {Lecture 09}
\def \isubtitle {Joint Action}
\begin{center}
{\Large
\textbf{\ititle}: \isubtitle
}
 
\iemail %
\end{center}

distributive vs collective

‘The injections saved her life.’

Consider the statement, ‘The injections saved her life.’ This could be true in virtue of her receiving several injections on different occasions, each of which saved her life. In this case, the injections saving her life is just a matter of each injection individually saving her life; this is the distributive interpretation. But the statement is also true if she was given two injections on a single occasion where each injection was necessary but not sufficient to save her life. In this case the injections saving her life is not, or not just, a matter of each injection individually saving her life; this is the collective interpretation.
The difference between distributive and collective interpretations is clearly substantial, for on the distributive interpretation the statement can only be true if her life has been saved more than once, whereas the truth of the collective interpretation requires only one life-threatening situation.
Just as some injections can be collectively life-saving, so some actions can be collectively directed to a goal. For example, consider this sentence:

‘The goal of their actions is to find a new home.’

This can be interpreted distributively: each of their actions is separately directed to finding a new home. But it can also be interpreted collectively: finding a home is an outcome to which their actions are directed and this is not, or not just, a matter of each of their actions being individually directed to finding a home.
To say that an outcome is a \emph{collective goal} of some actions is just to say that it is an outcome to which the actions are directed and this is not, or not just, a matter of each action being individually directed to that outcome.
Note that collective goals do not plausibly require any kind of intentions or commitments. After all, there is a sense in which some of the actions of swarming bees are directed to finding a nest and this is not, or not just, a matter of each bee’s actions being individually directed to finding a nest. So finding a nest is a collective goal of the bees’ actions.
 
\section{Collective Goals and Motor Representations}
 
\section{Collective Goals and Motor Representations}
Motor representations can ground collective goals in this sense: in some cases, two or more actions involving multiple agents have a collective goal in virtue of the actions being appropriately related to an interagential structure of motor representations.
Motor representations concern not only bodily configurations and movements but also more distal outcomes such as the grasping of a mug or the pressing of a switch \citep{butterfill:2012_intention,hamilton:2008_action,cattaneo:2009_representation}.
Some motor processes are planning-like in that they involve deriving means by which the outcomes could be brought about and in that they involve coordinating subplans \citep{jeannerod_motor_2006,zhang:2007_planning}.
Motor processes concerning actions others will perform occur in observing others act \citep{Gangitano:2001ft}---and even in observing several others act jointly \citep{manera:2013_time}---and enables us to anticipate their actions \citep{ambrosini:2011_grasping,aglioti_action_2008}.
In joint action, motor processes concerning actions another will perform can occur \citep{kourtis:2012_predictive, meyer:2011_joint}, and can inform planning for one's own actions \citep{vesper:2012_jumping,novembre:2013_motor,loehr:2011_temporal}.
In some joint actions, the agents have a single representation of the whole action (not only separate representations of each agent's part) \citep{tsai:2011_groop_effect,loehr:2013_monitoring,Menoret:2013fk}, and may each make a plan for both their actions \citep{meyer:2013_higher-order,kourtis:2014_attention}.
An interagential structure of motor representation: \begin{enumerate} \item there is an outcome to which a joint action could be collectively directed and in each agent there is a motor representation of this outcome; \item these motor representations trigger planning-like processes in each agent which result in plan-like hierarchies of motor representations; \item the plan-like hierarchy in each agent involves motor representations concerning another's actions as well as her own; \item the plan-like hierarchies of motor representations in the agents nonaccidentally match. \end{enumerate}
\section{Motor Representation}
A \emph{goal} is an outcome to which an action is directed.
Motor representations represent goals such as the grasping of an egg or the pressing of a switch. These are outcomes which might, on different occasions, involve very different bodily configurations and joint displacements (see \citealp{rizzolatti_functional_2010} for a selective review).
Motor representations trigger processes which are planning-like insofar as they involve (a) computing means from representations of ends; and (b) satisfying relational constraints on actions.

In virtue of what do actions involving multiple agents ever have collective goals?

Recall how Ayesha takes a glass and holds it up while Beatrice pours prosecco; and unfortunately the prosecco misses the glass, soaking Zachs’s trousers. Ayesha might say, truthfully, ‘The collective goal of our actions was not to soak Zach's trousers in sparkling wine but only to fill this glass.’ What could make Ayesha’s statement true?
light
[Not supported by viewer]
smoke
[Not supported by viewer]
drop
[Not supported by viewer]
throw
[Not supported by viewer]
discard
[Not supported by viewer]
amuse
[Not supported by viewer]
scare
[Not supported by viewer]
freak out
[Not supported by viewer]
block
[Not supported by viewer]
shared intention
or i.a.s.m.r.
coordinates
[Not supported by viewer]
represents
[Not supported by viewer]
As this illustrates, some actions involving multiple agents are purposive in the sense that
among all their actual and possible consequences,
there are outcomes to which they are directed
and the actions are collectively directed to this outcome
so it is not just a matter of each individual action being directed to this outcome.
In such cases we can say that the actions have a collective goal.
As what Ayesha and Beatrice are doing---filling a glass together---is a paradigm case of joint action, it might seem natural to answer the question by invoking a notion of shared (or `collective') intention. Suppose Ayesha and Beatrice have a shared intention that they fill the glass. Then, on many accounts of shared intention,
the shared intention involves each of them intending that they, Ayesha and Beatrice, fill the glass; or each of them being in some other state which picks out this outcome.
The shared intention also provides for the coordination of their actions (so that, for example, Beatrice doesn't start pouring until Ayesha is holding the glass under the bottle). And coordination of this type would normally facilitate occurrences of the type of outcome intended. In this way, invoking a notion of shared intention provides one answer to our question about what it is for some actions to be collectively directed to an outcome.
Are there also ways of answering the question which involve psychological structures other than shared intention? In this paper we shall draw on recent discoveries about how multiple agents coordinate their actions to argue that the collective directedness of some actions to an outcome can be explained in terms of a particular interagential structure of motor representations. Our actions having collective goals is not always only a matter of what we intend: sometimes it constitutively involves motor representation.
Few discussions of joint action have considered the existence of Very Small Scale Shared Agency ...

Very Small Scale
Shared Agency

A \emph{very small scale action} is one that is typically distantly related as a descendent by the means-end relation to the actions which are sometimes described as ‘small scale’ actions, such as playing a sonata, cooking a meal or painting a house \citep[e.g.][p.~8]{bratman:2014_book}.
There are very small scale joint actions like playing a chord together in the course of playing a duet, clinking glasses in the course of toasting our success, or plassing a plate between us in the course of doing the washing up together.

Small Scale
Shared Agency

Very Small Scale
Shared Agency

Playing a piano duet

Playing a chord together

Toasting our success together

Clinking glasses

Washing up together

Passing a plate between us

Philosophers have rarely considered such very small scale joint actions. But at least some such cases seem to involve exercising shared agency no less than larger scale activities like painting a house together.
But very small scale joint actions create a challenge to views like Gilbert’s.
Those views hinge on the roles of intention and practical reasoning.
But in at least some cases, very small scale joint actions are not a consequence of practical reasoning concerning those particular actions, nor need they involve intentions which specify outcomes to which the very small scale joint actions are directed. There is simply no need for practical reasoning, or intention, in many such cases. This is particularly obvious if you think about very small scale joint actions which occur in the context of larger scale activities, such as our playing a chord in the course of playing a piano duet.
Of course, there is no principled bar to having intentions concerning the goals of such very small scale actions (as far as I know), and such intentions may sometimes exist. But in very small scale cases of acting together, intentions and practical reasoning are often superfluous and sometimes absent.
Despite this, these very small scale interactions appear to involve exercises of shared agency no less than small scale activities such as playing a piano duet.
For very small scale cases, in virtue of what might they have collective goals?

motor representations represent outcomes and trigger planning-like processes

Let me mention some almost uncontroversial facts about motor representations and their action-coordinating role.
Suppose you are a cook who needs to take an egg from its box, crack it and put it (except for the shell) into a bowl ready for beating into a carbonara sauce. Even for such mundane, routine actions, the constraints on adequate performance can vary significantly depending on subtle variations in context. For example, the position of a hot pan may require altering the trajectory along which the egg is transported, or time pressures may mean that the action must be performed unusually swiftly on this occasion. Further, many of the constraints on performance involve relations between actions occurring at different times. To illustrate, how tightly you need to grip the egg now depends, among other things, on the forces to which you will subject the egg in lifting it later. It turns out that people reliably grip objects such as eggs just tightly enough across a range of conditions in which the optimal tightness of grip varies. This indicates (along with much other evidence) that information about the cook’s anticipated future hand and arm movements appropriately influences how tightly she initially grips the egg (compare \citealp{kawato:1999_internal}). This anticipatory control of grasp, like several other features of action performance (\citealp[see][chapter 1]{rosenbaum:2010_human} for more examples), is not plausibly a consequence of mindless physiology, nor of intention and practical reasoning. This is one reason for postulating motor representations, which characteristically play a role in coordinating sequences of very small scale actions such as grasping an egg in order to lift it.
The scale of an actual action can be defined in terms of means-end relations. Given two actions which are related as means to ends by the processes and representations involved in their performance, the first is smaller in scale than the second just if the first is a means to the second. Generalising, we associate the scale of an actual action with the depth of the hierarchy of outcomes that are related to it by the transitive closure of the means-ends relation. Then, generalising further, a repeatable action (something that different agents might do independently on several occasions) is associated with a scale characteristic of the things people do when they perform that action. Given that actions such as cooking a meal or painting a house count as small-scale actions, actions such as grasping an egg or dipping a brush into a can of paint are very-small scale. Note that we do not stipulate a tight link between the very small scale and the motoric. In some cases intentions may play a role in coordinating sequences of very small scale purposive actions, and in some cases motor representations may concern actions which are not very small scale. The claim we wish to consider is only that, often enough, explaining the coordination of sequences of very small scale actions appears to involve representations but not, or not only, intentions. To a first approximation, \emph{motor representation} is a label for such representations.% \footnote{% Much more to be said about what motor representations are; for instance, see \citet{butterfill:2012_intention} for the view that motor representations can be distinguished by representational format. }
What do motor representations represent? An initially attractive, conservative view would be that they represent bodily configurations and joint displacements, or perhaps sequences of these, only. However there is now a significant body of evidence that some motor representations do not specify particular sequences of bodily configurations and joint displacements, but rather represent outcomes such as the grasping of an egg or the pressing of a switch. These are outcomes which might, on different occasions, involve very different bodily configurations and joint displacements (see \citealp{rizzolatti_functional_2010} for a selective review).
Such outcomes are abstract relative to bodily configurations and joint displacements in that there are many different ways of achieving them.
Motor representations make very small scale actions the actions they are. Which very small scale action you are performing or attempting to perform---for example, which phoneme you are articulating or attempting to articulate---is a matter of which outcomes are specified by the motor representations controlling your action.
Relatedly, motor representations can trigger processes which are like planning in some respects. These processes are planning-like in that they involve starting with representations of relatively distal outcomes and gradually filling in details, resulting in motor representations that can be hierarchically arranged by the means-end relation \citep{bekkering:2000_imitation,grafton:2007_evidence}. Some processes triggered by motor representations are also planning-like in that they involve meeting constraints on the selection of means by which to bring about one outcome that arise from the need to select means by which, later, to bring about another outcome \citep{jeannerod_motor_2006, zhang:2007_planning, rosenbaum:2012_cognition}.

Conjecture :
collective goals are represented motorically

\begin{enumerate} \item There is one outcome which each agent represents motorically, and \item in each agent this representation triggers planning-like processes \item concerning all the agents’ actions, with the result that \item coordination of their actions is facilitated. \end{enumerate}
Let me explain what this amounts to.

I.e. sometimes, when two or more actions involving multiple agents are, or need to be, coordinated:

  1. Each represents a single outcome motorically, and
  2. in each agent this representation triggers planning-like processes
  3. concerning all the agents’ actions, with the result that
  4. coordination of their actions is facilitated.
  1. Each represents a single outcome motorically, and
  2. in each agent this representation triggers planning-like processes
  3. concerning all the agents’ actions, with the result that
  4. coordination of their actions is facilitated.
What do we need? (i) Evidence that a single outcome to which all the actions are directed is represented motorically.
(ii) Evidence that this triggers planning-like processes,
(iii) where these concern all the agents' actions,
and (iv) the existence of such representations facilitates coordination of the agents' actions.
 
\section{Kourtis et al (2014)}
So far I've been providing some evidence for the conjecture that collective goals are represented motorically.
It’s important to stress that is is only one bit of the evidence. There’s quite a bit more.

Conjecture :
collective goals are represented motorically

I.e. sometimes, when two or more actions involving multiple agents are, or need to be, coordinated:

  1. Each represents a single outcome motorically, and
  2. in each agent this representation triggers planning-like processes
  3. concerning all the agents’ actions, with the result that
  4. coordination of their actions is facilitated.
In particular the evidence I have provided doesn’t address the issue of coordination. There is a little bit of direct evidence for this that I won't mention.
And of course much more evidence would be needed before we could regard the conjecture as established.

In virtue of what do actions involving multiple agents ever have collective goals?

Recall how Ayesha takes a glass and holds it up while Beatrice pours prosecco; and unfortunately the prosecco misses the glass, soaking Zachs’s trousers. Ayesha might say, truthfully, ‘The collective goal of our actions was not to soak Zach's trousers in sparkling wine but only to fill this glass.’ What could make Ayesha’s statement true?
light
[Not supported by viewer]
smoke
[Not supported by viewer]
drop
[Not supported by viewer]
throw
[Not supported by viewer]
discard
[Not supported by viewer]
amuse
[Not supported by viewer]
scare
[Not supported by viewer]
freak out
[Not supported by viewer]
block
[Not supported by viewer]
shared intention
or i.a.s.m.r.
coordinates
[Not supported by viewer]
represents
[Not supported by viewer]

cooperation?

So much for motor representation. How does this relate to cooperation?
Earlier I suggested that

Purposive actions are cooperative to the extent that, for each agent, her performing these actions rather than any other actions depends in part on how good an overall pattern of trade-offs between demandingness and well-suitedness can be achieved for all of the actions.

Now I want to suggest that

Where we each represent a collective goal motorically, our actions will normally be cooperative in this sense.

Why is this true?
Motor representation of collective goal in each of us.
In each of us, the motor representation triggers a process which results in a plan-like structure of motor representations. These motor representations concern all of our actions, yours and mine.
These motor representations are agent-neutral. The motor process constructs the plan-like structure in such a way as to achieve a good pattern of trade offs between demandingness and well-suitedness across all of the actions, yours and mine.
Further, because humans are quite similar motorically, in many situations our motor plans will nonaccidentally match. That is, they will be identical or be such that differences between them do not matter to our coordination.
So the actions you perform will have been selected in part because they enable a good balance of pattern of trade offs between demandingness and well-suitedness across all of our actions, yours and mine. And this is just what cooperation requires.
[ALT: ...]
Because in the sort of parallel-planning triggered by the motor representations in an interagential structure of motor representations, there is no distinction between actions I will perform and actions you will perform.
I am, in effect, just trying to work out the best way to achieve the outcome irrespective of who does what; and so are you; and this is the hallmark of effective cooperation.
To illustrate, consider a case where I grasp a mug and pass it to you. Considering just my action, let’s say it is most comfortable for me to grasp the mug by the handle. But this would force you to grip the mug awkwardly, making the overall awkwardness of our action very high. If our actions are cooperative, I will attempt to reduce the overall awkwardness by grasping the mug in a way that is slightly less awkward for me. And I am likely not to grasp it by the handle \citep{meyer:2013_higher-order}. Instead, I will adopt a more awkward
Where we each represent a collective goal motorically, our actions will normally be cooperative.

Limit: very small scale joint actions

What’s exciting isn’t that all cooperation involves motor representations. It’s that mere motor representation makes any cooperative actions possible at all. Cooperative action isn’t something that depends on terribly sophisticated notions like shared intention or joint commitment. It begins with the motoric.
\textbf{A certain interagential structure of motor representation is among the things which can enable humans to cooperate.}

break

Why motor representation?

If you want to understand how something works, it seems to me that thinking seriously about motor representations is unavoidable.
WHY MOTOR REPRESENTATION? ONE ADVANTAGE OF THEORISING ABOUT MOTOR REPRESENTATIONS IS THAT YOU CAN GENERATE TESTABLE PREDICTIONS. INTENTIONS ARE FICTIONS. YOU CANNOT GENERATE TESTABLE PREDICTIONS FROM A THEORY ABOUT INTENTION. SO IF YOUR QUESTION IS ABOUT WHAT ENABLES COOPERATION, YOU WILL PROBABLY NEED A STORY ABOUT PSYCHOLOGICAL MECHANISMS.
ONTOLOGY : YOU WOULDN’T DO IT WITHOUT THINKING ABOUT PHYSICS, WOULD YOU? SOCIAL ONTOLOGY NEEDS PSYCHOLOGY LIKE ONTOLOGY NEEDS PHYSICS.
Gilbert is explicit about this. What grounds her theorising>

‘informal observation including self-observation’ and my ‘own sense of the matter’.

(Gilbert, 2014 pp. 24, 358)

This is fine. But it doesn't involve anything measurable or repeatable. You will never be able to generate any testable predictions. So you are quite unlikely to be on the path to discovering about how things work.
Which is fine if you’re Gilbert, of course, because she has no such aim. But if you’re me, you want to know how things work.

D'Ausilio et al (2009, figure 1)

‘Double TMS pulses were applied just prior to stimuli presentation to selectively prime the cortical activity specifically in the lip (LipM1) or tongue (TongueM1) area’ \citep[p.~381]{dausilio:2009_motor}
‘We hypothesized that focal stimulation would facilitate the perception of the concordant phonemes ([d] and [t] with TMS to TongueM1), but that there would be inhibition of perception of the discordant items ([b] and [p] in this case). Behavioral effects were measured via reaction times (RTs) and error rates.’ \citep[p.~382]{dausilio:2009_motor}

Conjecture :
collective goals are represented motorically

I.e. sometimes, when two or more actions involving multiple agents are, or need to be, coordinated:

  1. Each represents a single outcome motorically, and
  2. in each agent this representation triggers planning-like processes
  3. concerning all the agents’ actions, with the result that
  4. coordination of their actions is facilitated.
There is more evidence for this conjecture than I have given here, but there is not a lot more converging evidence. This is a conjecture that we hope will be tested further rather than something we take to be established already.
What about coordination? There is a little bit of direct evidence for this that I won't mention. But I do want to take you through why the interagential structure of motor representation might in theory result in the agents actions being coordinated.
Consider this case. An agent fills a glass by holding it in one hand, holding the bottle in the other, bringing the two together and pouring from the bottle into the glass.
[*demonstrate].
It’s a familiar idea that motor representations of outcomes resemble intentions in that they can trigger processes which are like planning in some respects. These processes are like planning in that they involve starting with representations of relatively distal outcomes and gradually filling in details, resulting in a structure of motor representations that can be hierarchically arranged by the means-end relation \citep{bekkering:2000_imitation, grafton:2007_evidence}. Processes triggered by motor representations of outcomes are also planning-like in that they involve selecting means for actions to be performed now in ways that anticipate future actions \citep{jeannerod_motor_2006,zhang:2007_planning,rosenbaum:2012_cognition}.
Now in this action of moving a mug, there is a need, even for the single agent, to coordinate the exchange between her two hands. (If her action is fluid, she may proactively adjust her left hand in advance of the mug’s being lifted by her right hand \citep[compare][]{diedrichsen:2003_anticipatory,hugon:1982_anticipatory, lum:1992_feedforward}.) How could such tight coordination be achieved? Part of the answer involves the fact that motor representations and processes concerning the actions involving each hand are not entirely independent of each other. Rather there is a plan-like structure of motor representation for the whole action and motor representations concerning actions involving each hand are components of this larger plan-like structure. It is in part because they are components of a larger plan-like structure that the movements of one hand constrain and are constrained by the movements of the other hand.
We have just considered what is involved in performing an ordinary, individual action, where an agent fills a glass from a bottle, taking one in each hand and moving them in a carefully coordinated way. Compare this individual action with the same action performed by two agents as a joint action. One agent takes the glass while the other takes the bottle. The joint action is like the individual action in several respects.
First, the goal to which the joint action is directed is the same, namely to move the mug from here to there.
Second, there is a similar coordination problem---the agents’ two hands have to meet.
And, third, the evidence we have mentioned suggests that in joint action, motor representations and processes occur in each agent much like those that would occur if this agent were performing the whole action alone.
Why would this be helpful?
 
Suppose the agents' planning-like motor processes are similar enough that, in this context, they will reliably produce approximately the same plan-like structures of motor representations.
Then having a single planning-like motor process for the whole joint action in each agent means that
\begin{enumerate}
\item in each agent there is a plan-like structure of motor representations concerning each of the others’ actions,
\item each agent's plan-like structure concerning another's actions is approximately the same as any other agent's plan-like structure concerning those actions,
\item each agent's plan-like structure concerning her own actions is constrained by her plan-like structures concerning the other’s actions.
\end{enumerate}
So each agent’s plan-like structure of motor representations concerning her own actions is indirectly constrained by the other agents' plan-like structures concerning their own actions
by virtue of being directly constrained by her plan-like structures concerning their actions.
In this way it is possible to use ordinary planning-like motor processes to achieve coordination in joint action.
What enables the two or more agents' plan-like structures of motor representations to mesh is not that they represent each other's plans but that they processes motorically each other's actions and their own as parts of a single action.
 
So how does the joint action differ from the corresponding individual action?
There are at least two differences.
First, we now have two plan-like structures of motor representations because in each agent there is a planning-like motor process concerning the whole action.
These two structures of motor representations have to be identical or similar enough that the differences don’t matter for the coordination of the agents’ actions---let us abbreviate this by saying that they have to \emph{match}.
The need for matching planning-like structures is not specific to joint action;
it is also required where one agent observing another is able to predict her actions thanks to planning-like motor processes concerning the other’s actions (we mentioned evidence that this occurs above).
A second difference between the joint action and the individual action is this.
In joint action there are planning-like motor processes in each agent concerning some actions which she will not eventually perform.
There must therefore be something that prevents part but not all of the planning-like motor process leading all the way to action.
Exactly how this selective prevention works is an open question.
We expect bodily and environmental constraints are often relevant.
There may also be differences in how others’ actions are processed motorically \citep[compare][]{novembre:2012_distinguishing}.
\footnote{\citep[p.\ 2901]{novembre:2012_distinguishing}: 'in the context of a joint action—the motor control system is particularly sensitive to the identity of the agent (self or other) of a represented action and that (social) contextual information is one means for achieving this distinction'}
And inhibition could be involved too \citep[compare][]{sebanz:2006_twin_peaks}.
My proposal, then, is this. In both practical reasoning and motorically, sometimes agents are able to achieve coordination for joint action not by representing each others’ plans but by treating each other's actions and their own as if they were parts of a single action.
So perhaps joint action is not always only a matter of intention, knowledge or commitment: perhaps sometimes joint action constitutively involves motor representation.

In virtue of what
do very small scale joint actions
have collective goals?

Interagential structures of motor representations.

In virtue of what
are very small scale joint actions
ever trade-off cooperative?

Interagential structures of motor representations.

What about the Simple View Revised?

At this point I want to return to considering Simple View Revised. So far I haven’t offered an objection directly to this view.

Simple View

Two or more agents perform an intentional joint action
exactly when there is an act-type, φ, such that
each agent intends that
they, these agents, φ together
and their intentions are appropriately related to their actions.

Simple View Revised

... and

we engage in parallel planning;

for each of us, the intention that we, you and I, φ together leads to action via our contribution to the parallel planning

(where the intention, the planning and the action are all appropriately related).

[mistake to think that all joint action involves intention]
[link between iasmr and parallel planning (same structure)]

Some intentions specify collective goals.

Some intentions specify collective goals. But how do they do this? In some cases, intentions specify collective goals in virtue of their contents explicitly involving the general notion of a collective goal. For example, I might intend that we, you and I, perform actions which have the collective goal of moving this table.
But note that this is not the only way in which intentions can specify collective goals. I might conceive of an act-type, such as jumping together, in such a way that acts of this type essentially involve collective goals. In that case, my intention that we jump together specifies a collective goal, even though the general notion of a collective goal is not involved in its content.
There may also be other, more primitive ways in which intentions can require for their fulfilment that we perform actions directed to a collective goal.
This thought suggests that we can refine the Simple View ...

The Simple View

As you may recall, the Simple View says that

Two or more agents perform an intentional joint action
exactly when there is an act-type, φ, such that
each agent intends that
they, these agents, φ together
and their intentions are appropriately related to their actions.

This Simple View seemed inadequate for distinguishing shared agency from parallel but merely individual agency. Why? One reason was that the strangers blocking the aisle of an aeroplane are not exercising shared agency but do each intend that they, the two strangers, block the aisle. But we are now in a position to improve on the Simple View. Note that the strangers’ intentions do not require for their fulfilment that they, the strangers, perform actions with the collective goal of blocking the aisle. Indeed, by stipulation each stranger falsely believes that the other is not performing actions directed to blocking the aisle. So what each stranger believes is straightforwardly incompatible with blocking the aisle being a collective goal of their actions. This suggests that we can improve on the simple idea by requiring that the relevant intentions must require for their fulfilment actions with a corresponding collective goal.
What we should require, I think, is that the act type, φ, is one that involves a collective goal. Or at least the intention should require for its fulfilment that we perform actions with a collective goal.
This refinement of the Simple View allows it to distinguish the strangers blocking the aisle (who have no intentions specifying a collective goal) from the sisters blocking the aisle (whose intentions do specify a collective goal).

In virtue of what
do very small scale joint actions
have collective goals?

Interagential structures of motor representations.

In virtue of what
are very small scale joint actions
ever trade-off cooperative?

Interagential structures of motor representations.

break

Question

What distinguishes genuine joint actions from parallel but merely individual actions?

Requirement

Any account of shared agency must draw a line between joint actions and parallel but merely individual actions.

Aim

Which forms of shared agency underpin our social nature?

This is the organising question for our project (the project to be investigated in this series of lectures). Of course there will be lots of further questions, but I like to have something simple to frame our thinking and this question serves that purpose.
My hope is that by answering this seemingly straightforward question, we will be in a position to answer the hard question about which forms of shared agency underpin our social nature.
Let’s start by trying to get a pre-theoretical handle on the notion of joint action. I’ve already given you some examples, but it’s even better to use contrast cases ...
Recall our question, What distinguishes joint actions from parallel but merely individual actions?

shared intention

Consider the shared intention strategy
‘A first step is to say that what distinguishes you and me from you and the Stranger is that you and I share an intention to walk together—we (you and I) intend to walk together—but you and the Stranger do not. In modest sociality, joint activity is explained by such a shared intention; whereas no such explanation is available for the combined activity of you and the Stranger. This does not, however, get us very far; for we do not yet know what a shared intention is, and how it connects up with joint action’ \citep[p.~152]{Bratman:2009lv}.

‘joint activity is explained by such a ; whereas no such explanation is available for the combined activity’ of those acting in parallel but merely individually.

‘This does not, however, get us very far; for we do not yet know what a is’

Bratman, 2009 p. 152

So the idea is that shared intention is to joint action as intention is to ordinary, individual action.
Of course, invoking shared intention doesn’t by itself amount to explaining anything. It simply takes us in a circle. (Which is not necessarily bad.)
To characterise joint action, we have two tasks: (i) characterise shared intention; (ii) characterise a relation between shared intentions and events such that the events are joint actions.

Challenge: Give an account of shared intention.

Three broad strategies: Bratman, Gilbert or aggregate subjects

aggregate subject

Simple View

Two or more agents perform an intentional joint action
exactly when there is an act-type, φ, such that
each agent intends that
they, these agents, φ together
and their intentions are appropriately related to their actions.

Simple View Revised

... and

we engage in parallel planning;

for each of us, the intention that we, you and I, φ together leads to action via our contribution to the parallel planning

(where the intention, the planning and the action are all appropriately related).

Question

What distinguishes genuine joint actions from parallel but merely individual actions?

Requirement

Any account of shared agency must draw a line between joint actions and parallel but merely individual actions.

Aim

Which forms of shared agency underpin our social nature?

There’s a good chance we can find a somewhat satisfying answer to the question somewhere here. But should we be satisfied with answering the question?

a deeper understanding

But I argued that we need a deeper understanding. We are wrong to think that there is a single binary distinction, and we are wrong to think that we can identify a target for theorising by merely pointing to intuitive contrast cases. We need to do some work to turn intuitions into observations; these observations will then provide the basis for a theory.

Separate
the thing to be explained
from
the thing that explains it.

How to ground a theory of joint action?

Step 1: identify features ...

- collective goals

- coordination

- cooperation

- contralateral commitments

- experience

Step 2: ... which generate how questions.

A theory of joint action is then an attempt to answer the how questions.
CHALLENGE : Isn’t there something missing from this approach?
Here’s how I think Bratman would respond:

Bratman on strategic equilibrium: This ‘seems not by itself to ensure the kind of sociality we are after [...] a shared activity of the sort we are trying to understand---[...] in the relevant sense, walking together [... There are] important aspects of such shared activities that seem not to be captured [...] our job is to say what those aspects are and how best to understand them’

\citep[pp.~5--6]{bratman:2014_book}

How to ground a theory of joint action?

Step 1: identify features ...

- collective goals

- coordination

- cooperation

- contralateral commitments

- experience

Step 2: ... which generate how questions.

conclusion

Which forms of shared agency underpin our social nature?

So now we have the Revised Simple View that gives us an account of intentional shared agency, and the motor view that gives us an account of very small-scale, non-intentional shared agency.
More than one form of shared agency!
CHALLENGE 1: If I’m wrong---if there’s one central form of shared agency---then please show me.

Rakoczy et al (in progress)

This is just the beginning ...
\section{Four Challenges}
--- Rakoczy, Pacherie, Butterfill, Tomasello, Schilbach, Kita & Matthews (in progress)
\emph{The Diversity Challenge}
The fundamental challenge for an integrative framework of shared intentionality is to move beyond the 'one size fits all' assumption that one set of simple necessary and sufficient conditions can be given to characterize all forms of shared intentionality. Various dimensions along which different forms of shared intentionality vary need to be taken into account. For instance, cognitive representations and processes related to shared intentionality exist on various levels of complexity, ranging from conscious intentions of the form “I intend that we spend the rest of our lives together” to largely unconscious processes of alignment and coordination that occur when partners sit side-by-side in rocking chairs or play drums together (Sebanz, Knoblich & Butterfill, 2011). Different forms of shared intentionality vary dramatically with regard to both their scale, ranging from two people carrying a table together to nation states of millions of people fighting wars, and their scope, ranging from transient projects such as shaking hands to long-lasting projects such as those implied by jointly adopting a child. From a psychological point of view, such different forms of shared intentionality as, say, two young children rolling a ball back and forth together and two adults writing a paper together, will vary dramatically with regard to the participants’ representations of sharedness behind this activity. Finally, varieties of shared intentionality differ to the degree to which they involve conventionality and rule following: conventions and rules are necessary for marrying or playing chess, but not for carrying a table together (e.g. Searle, 1969, 1995). Recognizing and charting diversity is relatively easy. What is still missing – and what the Diversity Challenge demands – is a systematic, comprehensive and differentiated conceptual apparatus which brings into view what is common to all forms of shared intentionality, what sets them apart from merely individual intentionality, and how various forms relate to each other. Meeting the Diversity Challenge immediately raises three further, no less formidable challenges.
\emph{The Emergence Challenge}
The first challenge concerns emergence: What is involved in the transitions between different forms of shared intentionality? What are the ontogenetic motors, the neural underpinnings, and the phylogenetic mechanisms underlying these transitions?
\emph{The Interface Challenge}
A second challenge that follows from meeting the Diversity Challenge arises from the fact that different forms of shared intentionality will involve representations shared at various levels. These representations typically differ in their formats (from conceptual to motor), specificity of information, exact timing, and availability to consciousness. Thus, we need to understand how shared representations at different levels interface, and how the processes at play at each level interact to yield (or reliably give the appearance of) integrated systems of shared representations. Philosophy and cognitive science have made significant progress concerning such interface issues for individual intentionality (e.g. Pacherie, 2008, 2011; Butterfill & Sinigaglia, 2014). The challenge is now to do the same for shared intentionality.
\emph{The Integration Challenge}
So far the challenges have concerned shared intentionality narrowly. But shared intentionality is arguably the key to understanding a manifold of cognitive capacities. Here we focus on social cognition and communication. How are the different forms of shared intentionality variously integrated with abilities to track others’ mental states and actions, and to communicate with others?

Which forms of shared agency underpin our ?

How to characterise shared agency? I think we can’t. Instead there is a bundle of important features that do not necessarily hand together very tightly: collective goals, coordination, cooperation, commitment, an interpersonal sense of agency, ...
CHALLENGE 2: If I’m wrong---if there’s some shared-ness to be captured---then please show me.
Our social nature includes everything from things which happen in a moment, as when we pass a stone between us cooperatively, to things which take many lifetimes to unfold, like the construction of a sewer system.
We have barely begun to explore that in this course. Perhaps that should be the topic of another module.
CHALLENGE 3: What more is there to explain about humans’ social nature? What are the questions that we have yet to even ask?

Conjecture:

Let me finish with a conjecture ...

Shared agency begins with primitive forms of acting together.

Our actions have a collective goal in virtue of some simple coordination mechanisms which do not require reflection or understanding.

Interagential structures of motor representation provide a basis for insight.

Ultimately, sophistication hinges on our becoming self-reflective aggregate agents.

 

Sharing a Smile

 
\section{Sharing a Smile}
 
\section{Sharing a Smile}
[THIRD POINT: smiling is a goal-directed action, the goal of which is to smile that smile]
My topic is sharing a smile. But first think about ordinary, individual actions like genuine smiles.
What distinguishes a genuine smile from a muscle spasm or the exhalation of wind?
I want to suggest that it's this: the smile is a goal-directed action where the goal is to simile that smile.
But why think of the smile as goal-directed? Because smiling the smile requires considerable motor coordination: it’s not a matter of simple muscle contractions but more like the production of a phonetic gesture where context affects what is needed to realise the smile.
Further, like grasping an object or articulating a particular phoneme, it is an action that can be realised by different bodily movements in different contexts.
This is why I put slides of two quite different but both genuine smiles.
[Objection:]
Now you might say that the smile can't be goal-directed because is isn't explicable by appeal to belief, desire and intention
This is because the genuine smile is spontaneous and not something that can be produced at will (although it could probably be inhibited, at least to some extent); after all, this is what distinguishes the genuine from the polite smile.
\footnote{
From web source: The Duchenne smile involves both voluntary and involuntary contraction from two muscles: the zygomatic major (raising the corners of the mouth) and the orbicularis oculi (raising the cheeks and producing crow's feet around the eyes). The zygomatic major can be voluntarily contracted but many people can't voluntarily contract the orbicularis oculi muscle.
}
So now we might be tempted by the view that a smile is merely caused by an emotion in the way that gasses can cause you to burp.
[Reply:]
Maybe there are smiles like this, but some genuine smiles are sustained.
And what sustains them is a process of controll
How could this be if such smiles are not consequences of beliefs, desires and intentions?
I think a reasonably natural view here is to think that part of what makes an event a smile, a goal-directed action and not just a muscle spasm caused by excess wind, is the way that motor control is involved. Specifically, the genuine smile will involve a motor representation of the outcome, the smile, and this motor representation will lead to movements by way of planning-like motor processes.
But you don't have to buy this to agree with me.
All you have to accept is that actions like some smilings can be goal-directed and controlled even in the absence of relevant beliefs, desires and intentions.
I think smiles fall into the category of actions like graspings, reachings and gesturings which are goal-directed but do not necessarily involve intention.
 
So far, then, I've suggested that smiling is a goal-directed action, the goal of which is to smile that smile.
Now imagine a situation where a single individual encounters and event (a clown’s falling) which causes amusement which causes her to smile
Note that the smile also modulates the emotion; if, for example, she supressed the smile, the quality of her amusement would change.
How could we gain insight into the fine-grained dynamics of others’ emotions?
How could we ever appreciate the unfolding of another’s grief, or the way their engagement leads to an explosion of ecstasy at the climax of a concert?
Part of the answer is obvious: by being there, with them.
[Not that this is the only possibility --- in some cases we might be told.]
But how exactly does being there, in the same situation help?
Merely being in the same situation is surely not enough.
It’s not enough that we each experience amusement, grief or ecstasy.
After all, individuals are different. Different individuals’ feelings don’t unfold in the same way just because they are in the same situation.
It’s just here that collective intentionality is relevant.
\textbf{What is involved in sharing a smile?}
Minimally, I think there have to be two kinds of connection between us for us to share a smile.
First, the way your smile unfolds is shaped by how mine unfolds and conversely.
I also suppose that our smiles can be minutely coordinated with each other.
But it’s not just that our smiles are interdependent in this way ...
It’s also that each of our smiles is shaping the way our amusement unfolds.
So the way your amusement unfolds is being controlled by, and controlling, the way mine unfolds.
In sharing a smile, we are emotionally locked together.
[*todo: remove motor stuff for this talk! Also: don't lose sight of idea that control is a way of knowing.]
[*todo: need slide with control arrows highlighted (my emotion controls yours).]
[*Structure: (i) I know because my emotion controls yours; (ii) But if my emotion controls yours, how can yours be amusement at the clown's falling? because control is partial, and reciprocal; (iii) But the mere fact of control isn't enough for knowledge; rather, control must show up in experience somehow. After all, for all I have said so far, we might, in sharing a smile, be unaware that our emotions are locked together. (iv) There must be an experience that is distinctive of sharing a smile. (iv) Note that I don’t want to say that someone who is sharing a smile needs to understand the situation in the way I’m describing it. All I'm claiming is that the fact of reciprocal control somehow affects our awareness. (v) It may affect in our awareness insofar as we are sensitive to contingencies between our own actions' and others' actions, and between our actions and the causes of them. (vi) So my position is this: the reciprocal control justifies each agent in making judgements about how the others' amusement is unfolding, and this justification is at least indirectly available to the agents by virtue of their having experiences characteristic of sharing a smile. ]
Our being emotionally locked together means that to a significant extent I am feeling what you are feeling, that the way my amusement is unfolding matches they way your amusement is unfolding. So if you know how your own amusement is unfolding and you know that we are emotionally locked together, you can know much about how my amusement is unfolding. So joint expressions of emotion like sharing a smile have the potential to enable us to know not just that others are amused but how their amusement is unfolding.
But the fact of reciprocal control (which means our emotions are locked together) together doesn’t all by itself mean that we can know how each other’s emotions are unfolding. After all, for all I have said so far, we might, in sharing a smile, be unaware that our emotions are locked together. Now you might think this sounds implausible because its hard to imagine sharing a smile without an experience that is distinctive of sharing a smile. And it might be natural to describe this experience as an experience of sharing. But even if that is correct, it’s necessary to say exactly why someone who is sharing a smile is in a position to know things about how the other’s emotion is unfolding.
I don’t want to say that interaction only helps if you know that your emotions are locked together. That is, I don’t want to say that someone who is sharing a smile needs to understand the situation in the way I’m describing it. But minimally the fact of reciprocal control must somehow feature in our awareness.
[*The idea in outline: \begin{enumerate} \item the ways our amusements unfold is locked together \item this is in part because a single motor plan has two functions, production of your smile and prediction of my smile \item the single motor process means that we might experience being locked together in some way (not that our emotions are locked together but that our actions are, in something like (but not exactly) we experience actions when seeing ourselves in a mirror or on CCTV (check Johannes’ discussion of this)). \end{enumerate} ]
Here I want to offer a wild conjecture. In joint expressions of emotion there is a single motor plan with two functions, production and prediction. The motor plan both produces your own smile and enables you to predict the way the other’s smile will unfold. [*missing step about monitoring and experience. (The Haggard idea: motor planning can give rise to experiences concerning one's own actions \citep{Haggard:2005sc}.)] Because your plan has this dual function, your experience of the other’s (my?) smile is special. From your point of view, it is almost as if the other is smiling your smile.% \footnote{ Joel caricatured this idea seeing me eating fruit: ‘it’s almost as if I’m eating that fruit.’ } This means that sharing a smile has characteristic phenomenology.
This odd phenomenological effect means that in sharing a smile we can each think of the situation almost as if there were a single smile. And almost as if there were a single state of amusement. (In thinking of the situation like this it is important that we have a subject-neutral conception of the amusement and an agent-neutral conception of the smile.% \footnote{ Tom Smith asked about this. I clarified that I wasn’t suggesting there was a state of amusement which is ours, nor that the subjects are thinking of the situation in this way. That’s the point of the appeal to subject-neutral amusement. It’s a partial model of the situation. } [*Here I think I’m shifting back from the perspective of the participants in sharing a smile to the perspective of the theorist. Probably what I should say is, first, that a theorist can think of the situation in this way and use this to argue, second, that there is a simple, partial conception of the situation that doesn’t require understanding reciprocal control and interlocking emotions but is sufficient for each smiler to have knowledge of the way the other’s emotion unfolds.] So my suggestion is that in sharing a smile you experience my smile almost as if it were yours (or: you experience me almost as if I were smiling your smile), and so you might also experience our situation almost as if it involved a single state of amusement.
It's more like we each plan a single smile.
But---to reply to the objection---these plans have a dual function. Your plan both produces your own smile and enables you to simulate---to experience---my smile. And likewise for my plan. The interdependence of our smilings means that we could each think of the situation as if it were one in which a single state of amusement were responsible for our actions.