Chapter V: Conclusion and Future Work
B.3 Safety Lemmas
95 If another agent Ag0occupies a state in this set, then execution of that agentβs backup plan will cause it to intersect with the set of grid points that are in agents setZAg(π 0). Let
SR, π΅ π
Ag0 (Ag, π 0) = Γ
π βSπ΅ π
Ag0(Ag)
Rβ1
Ag0(π ).
This is the set of all states backward reachable to the states in Sπ΅ π
Ag0(Ag, π 0). If an agent Ag0 occupies any of these states, it will end up in a state where its backup plan will intersect with agent Agβs potential grid points that are defined inZAg. We project this set of states to a set of grid points as
GR, π΅ π
Ag0 (Ag, π 0) ={Gπ΄π0(π ) | π β Sπ΄ππ΅ π0(π΄π, π 0)}.
Note, this set of grid points is shown in the right-most subfigure in Fig. B.1. The bubble is then defined as the union of all the sets of grid points specified above.
Definition 26(Bubble). Let us consider an agent Ag with stateπ 0 β πAgand agent Ag0be another agent. Then the bubble of Ag with respect to agents of the same type as Ag0is given by
BAg/Ag0(π 0) , ZAg(π 0) βͺ GR
Ag0(Ag, π 0) βͺ GR, π΅ π
Ag0 (Ag, π 0). Note that under almost all circumstances, we should have
ZAg(π 0) β GR
Ag0(π΄π, π 0) β GR, π΅ π
Ag0 (Ag, π 0) so BAg(π 0) is simply equal to GR, π΅ π
Ag0 (Ag, π 0). This holds true for the abstract dynamics we consider in this paper. This means the bubble contains any grid points in which another agent Ag0occupying those grid points can interfere (via its own forward reachable states or the backup plan it would use in any of its forward reachable states) with at least one of agent Agβs next possible actions and the backup plan it would use if it were to take any one of those next actions.
Proof: This follows from the definition of the agent bubble, whose construction is defined in Section B.2.
The following lemma states that an agent Ag following the agent protocol will not take an action for which it violates the safety of its own backup plan.
Lemma 6. If Ag is following the agent protocol, and πAg,π π(π’) = T, Ag will only choose an action π β π΄ππ‘Ag for which the following condition holds: βAg0 β π,
Β¬( (Ag, π)β₯π΄π0), whereπ ={Ag}.
Proof: We prove this by using definitions of elements in the agent protocol.
1. Let us first show that any action π β π΄ππ‘Ag that Ag takes will satisfy the oracles in the top two tiers (safety and traffic rules) of Agβs profile defined in Section. 3.5.
a) According to the Action Selection Strategy defined in Section 3.5, Ag will choose one of three actions: the agentβs intended actionππ, the best straight actionππ π‘, or its backup plan actionππ π.
b) Let us consider the actionsππandππ π‘.
i. Bothππandππ π‘are selected via the behavioral profile and consistent- function evaluator defined in Section 3.5.
ii. Since πAg,π π(π’) = T, the agent will have at least one action (ππ π) for which the top two tiers of specifications are satisfied.
iii. By definition of the behavioral profile and the consistent evaluator function, if πAg,π π(π’) = T, the safety backup plan action ππ π will always be chosen over an action where any of the specifications in the top two tiers of the profile are not satisfied.
iv. By 1(b)ii and 1(b)iii, Ag will have π β π΄ππ‘π΄π and will choose an action for which the top two tiers of the behavioral profile are satisfied and thusππ andππ π‘ are actions where all oracles in the top two tiers of the profile are satisfied.
c) Let us consider the actionππ π.
i. This follows from the assumption that πAg,π π(π’) = T and the defi- nition ofπAg,π π(π’).
97 2. If the oracles in the top two tiers are satisfied by an actionπ, by the definition of the oracles in Section 3.5, this implies that the action π will take Ag to a stateπ 0and the system will be in a new global stateπ’0whereππ΄π,π π(π’0)=T.
3. ππ΄π,π π(π’0) =π means Ag will end up in a state where ππ π will be an action that satisfies traffic rules, avoids inevitable collision with static obstacles, and thus will not violate its own safety backup plan actionΒ¬( (π΄π, ππ)β₯Ag).
The following lemma states that if all Agβπare following the agent protocol, any agent Ag will not take an action for which it collides with or violates the safety backup plan of any agent with higher precedence.
Lemma 7. If Ag is following the agent protocol, and πAg,π π(π’) = T, Ag will only choose an action π β π΄ππ‘Ag for which the following two conditions hold: 1) Gπ΄π(π , π) β© (βͺπ΄π0βπGAg0(π 0, π0)) = β and 2) βAg0 β π, Β¬( (π΄π, π)β₯π΄π0), where the setπ , {Ag0|Ag βΊ Ag0}, i.e. agents with higher precedence than Ag.
Proof: We prove this by using arguments based on the definition of precedence, the agent protocol, and agent dynamics.
1. Let us first consider all Ag0where AgβΊ Ag0and π΄π0βBAg(π ). a) Proof by Lemma 5.
2. Now, let us consider all Ag0where AgβΊ Ag0and Ag0β BAg(π ).
3. According to Lemma 6, Ag will only take an action that satisfies all oracles in the top two tiers, including
πAg, dynamic safety(π , π, π’).
4. Sinceπis such thatπAg, dynamic safety(π , π, π’) =T, by definition of the oracle, π΄π will not cause collision with anyπ΄π0β BAg(π ).
5. For any AgβΊ Ag0, where Ag0has higher precedence than Ag, then projlong(Ag) <
projlong(Ag0), i.e. Ag0is longitudinally ahead of Ag.
6. In order for (Ag, π)β₯Ag0, the action π would have to be such that π π = ππ΄π(π , π), and πΏ π(π π) = πΏ π(π 0) and projlong(Ag) > projlong(Ag0), where Ag is directly in front of Ag0.
7. Because of the agent dynamics defined in Section 3.3, any π such that (Ag, π)β₯Ag0will requireG (Ag, π) β© G (Ag0) β β .
8. Thus, any such actionπwill not satisfy the oracle πAg, dynamic safety(π , π, π’).
9. SinceπAg,π π(π’) =T, by Assumption 6 in Section B.4, the agent will have at least one actionππ πfor which
πAg, dynamic safety(π , π, π’) =T.
10. Since the agent will only choose an action for which
πAg, dynamic safety(π , π, π’) = T and it always has at least one action ππ π that satisfies the oracle, the agent will always choose an action for which
πAg, dynamic safety(π , π, π’) = T and thus will take an action such that βπ΄π0 β πΒ¬( (Ag, π)β₯Ag0).
The following lemma states that if all Agβπare following the agent protocol, any agent Ag will not take an action for which it collides with or violates the safety backup plan of any agent with lower precedence.
Lemma 8. If Ag is following the agent protocol, and πAg,π π(π’) = T, Ag will only choose an action π β π΄ππ‘Ag for which the following two conditions hold: 1) Gπ΄π(π , π) β© (βͺπ΄π0βπGAg0(π 0, π0)) = β and 2) βπ΄π0 β π, Β¬( (π΄π, π)β₯π΄π0), where the setπ , {Ag0|Ag0 βΊAg}, i.e. agents with lower precedence than Ag.
Proof: We prove this by using arguments based on the definition of precedence, the agent protocol, and agent dynamics.
1. Let us first consider all Ag0where AgβΊ Ag0and Ag0βBAg(π ). a) Proof by Lemma 5.
2. Now, let us consider all Ag0where AgβΊ Ag0and Ag0β BAg(π ).
3. According to 3, Ag will only take an action that satisfies all oracles in the top two tiers, includingπAg, dynamic safety(π , π, π’).
4. Sinceπis such thatπAg, dynamic safety(π , π, π’) =T, by definition of the oracle, π΄π will not cause collision with anyπ΄π0β BAg(π ).
99 5. According to the Action Selection Strategy defined in Section 3.5, Ag will choose one of three actions: the agentβs intended action ππ, the best straight actionππ π‘, or its backup plan actionππ π.
6. Let us consider the backup plan actionππ π.
a) By violation of safety backup plan,( (Ag, ππ π)β₯Ag0)only if πΏ π(Ag) = πΏ π(Ag0).
b) W.l.o.g., let us consider Ag0that is directly behind Ag.
c) Sinceπ
Ag0,π π(π , π’) =T, by Assumption 6 in Section B.4,
πAg, dynamic safety(π , ππ π, π’) =T, meaning Ag0will be far enough behind Ag so that if Ag executes its backup plan action ππ π, Ag0 can safely execute its own backup plan action.
d) Thus, by Definition 19,Β¬( (Ag, ππ π)β₯Ag0). 7. Let us consider the best straight actionππ π‘.
a) This follows from the arguments made in 6, since ππ π‘ is a less severe action thanππ π.
8. Let us consider the intended actionππ.
a) Let us consider whenπΎAg={straight}. i. This follows from 6.
b) Let us consider whenπΎAgβ {right-turn,left-turn}.
i. If Ag takes such an action, Ag will end up in a state whereπ΅π’(Ag0) β π΅π’(Ag) and from Definition 19, agents in different bundles cannot violate each othersβ backup plans.
c) Let us consider whenπΎAg β {right-lane change,left-lane change}. i. (Ag, ππ)β₯Ag0 when ππ is a lane change and the agents Ag and
Ag0 are at a state such that π π = π(π , ππ) and π 0
π = π(π 0, ππ π), respectively, where π(π π, π 0
π) < ππ ππ π π, where π(π π, π 0
π) is the π2 distance between π π andπ 0
π.
ii. When this condition holds, the agentβs max-yielding-not-enough flagFAg(π’, ππ)defined in Section 13 will be set.
iii. According to the action-selection strategy, Ag will only takeππwhen FAg(π’, ππ) =F.
iv. Thus, Ag will only takeππ whenΒ¬( (π΄π, ππ)β₯π΄π0).
The following lemma states that if all Agβπare following the agent protocol, any agent Ag will not take an action for which it collides with or violates the safety backup plan of any agent with equal precedence.
Lemma 9. If Ag is following the agent protocol, and πAg,π π(π’) = T, Ag will only choose an action π β π΄ππ‘Ag for which the following two conditions hold:
1)GAg(π , π) β© (βͺAg0βπGAg0(π 0, π0)) =β and 2)βAg0 βπ,Β¬( (Ag, π)β₯Ag0), where the setπ , {Ag0|Ag0βΌAg}, i.e. agents with equivalent precedence as the agent.
Proof: We prove this by using arguments based on the definition of precedence, Agent Dynamics, and the agent protocol.
1. Let us first consider all Ag0where AgβΊ Ag0and Ag0βBAg(π ). a) Proof by Lemma 5.
2. Now, let us consider all Ag0where AgβΊ Ag0and Ag0β BAg(π ).
3. Let us first consider the agent itself, since an agent has equivalent precedence to itself.
a) This is true by Lemma 6.
4. This can be proven for any other agents of equivalent precedence that is not the agent itself as follows.
5. Agents with equal precedence take actions simultaneously so πAg, dynamic safety(π , π, π’) does not guarantee no collision.
6. According to the Action Selection Strategy defined in Section 3.5, Ag will choose one of three actions: the agentβs intended action ππ, the best straight actionππ π‘, or its backup plan actionππ π.
7. By definition of precedence assignment, any Ag0for which Ag0βΌAg will be such thatπΏ π(Ag) β πΏ π(Ag0).
8. Let us show if Ag selectsππ π, it will 1) not collide with any Ag0 β π and 2)
Β¬( (Ag, ππ π)β₯Ag0).
101 a) W.l.o.g., let us consider Ag0where Ag0βΌ π΄π.
b) The flagFAgβ(π’, ππ) =Tif Ag0π intended actionππcauses collision with Ag or (π΄π0, ππ)β₯π΄π, i.e. it collides with or violates the safety of Agβs backup plan action.
c) By the action-selection-strategy, Ag0 will not take the action ππ when FAgβ(π’, ππ) = T, so this guarantees Ag will not collide with Ag0 when Ag takesππ π.
d) By the Agent Dynamics, Agβs backup plan action cannot cause Ag to end up in a position where it can violate Ag0βs backup plan without colliding with itβfor which Ag0βs flagFAg(π’, ππ)would be set.
9. Let us show that Ag will only choose an ππ π‘ if it will 1) not collide with Ag0β πand 2)Β¬( (Ag, ππ π‘)β₯Ag0).
a) Whenππ π‘ =ππ π, then the arguments in 8 hold.
b) Ag selects anππ π‘ that is notππ πonly when 1) its conflict cluster is empty (i.e. πΆAg=β ) or 2) when it has received a conflict request from another agent and it has won its conflict cluster resolution (i.e.πAg =T).
c) IfπΆAg = β , by definition of how conflict clusters are defined in Section 14, the agentβs actionππ π‘ will not cause Ag to collide with any Ag0 βπ, andβAg0β π,Β¬( (Ag, ππ π‘)β₯Ag0).
d) In the case Ag has received a conflict request and has won πAg, by Lemma 3, ifπAg =T, it will be the only agent in its conflict cluster that has won.
e) By definition of the conflict cluster, any Ag0βπΆAgwhere Ag βΌAg0will take a straight action.
f) Since agents of equivalent precedence are initially in separate lanes by 7 and any Ag0 β π will take a straight action, then πΏ π(π Ag,π‘+1) β πΏ π(π Agβ,π‘+1) when Ag takesππ π‘.
g) Thus, by definition of agent dynamics and Definition 19, the action will not cause Ag to collide with any Ag0β π, andβAg0 βπ,Β¬( (Ag, ππ π‘)β₯Ag0).
10. Let us show that Ag will only choose anππ if it will 1) not collide with any Ag0β πand 2)Β¬( (Ag, ππ)β₯Ag0).
a) Let us consider whenπΎAg=straightforππ.
i. This follows from the same arguments presented in 9.
b) Let us consider whenπΎAg β{right-turn,left-turn}forππ.
i. This follows from the fact that all other agents are following the agent protocol and will not take a lane-change action in the intersection, and because of the definition of the Agent Dynamics and Road Network.
c) Let us consider whenπΎAg β {right-lane change,left-lane change}. i. Ag will only take its intended actionππ if the flag
FAg(π’, ππ) =F, and in the case that it is part of a conflict cluster, it is the winner of the conflict cluster resolution, i.e. WAg =T.
ii. By definition ofFAg(π’, ππ), the agent will not takeππwhenππcauses Ag to collide with any agent Ag0β πor when it causesπ΄πto violate the safety of the back up plan of another agent π΄π0, i.e. βπ΄π0s.t.
(Ag, ππ)β₯Ag0.
iii. In the case the agent has received a conflict request and has won WAg, by Lemma 3, if WAg = T, it will be the only agent in its conflict cluster that has won.
iv. By definition of the conflict cluster, any Ag0βπΆAgwhere AgβΌ Ag0 will take its backup plan action ππ π, and thus π π = π(π , ππ π‘), and π 0
π =π(π , ππ π), whereπ(π π, π 0
π) β₯ππ πreq.
v. Thus,ππwill only be selected whenππdoes not cause Ag to collide with any Ag0βπand
βAg0 βπ,Β¬( (Ag, ππ)β₯Ag0).
The following lemma states that if all Agβπare following the agent protocol, any agent Ag will not take an action for which it collides with or violates the safety backup plan of any agent with incomparable precedence to it.
Lemma 10. If Ag is following the agent protocol, and πAg,π π(π’) = T, Ag will only choose an action π β π΄ππ‘Ag for which the following two conditions hold: 1) GAg(π , π) β© (βͺAg0βπGAg0(π 0, π0)) = β and 2) βAg0 β π, Β¬( (Ag, π)β₯Ag0), where the setπ , {Ag0|Ag0 Ag}, i.e. agents with precedence incomparable to the agent.
Proof: We prove this by using arguments based on the definition of precedence, Agent Dynamics, and the agent protocol.
103 1. Let us show when Ag choosesππ π, it will 1) not collide with any Ag0βπand
2)Β¬( (Ag, ππ π)β₯Ag0).
a) Since πAg,π π(π’) = T, the agent will have at least one action (ππ π) for which the top two tiers of specifications are satisfied.
b) By 1a, the actionππ πwill only take Ag into the intersection if the traffic light is green.
c) By Assumption 4, all traffic lights are coordinated so if agents respect traffic light rules, they will not collide.
d) By the assumption that all other Ag0βπare obeying the same protocol, each agent will only take actions that satisfy the top two tiers of their profile.
e) Any Ag0in a perpendicular bundle will not enter the intersection since they have a red light.
f) Thus, Ag cannot collide or violate the backup plan of agents in perpen- dicular bundles.
g) Any Ag0 in an oncoming traffic bundle must only take an unprotected left-turn when it satisfies
πAg, unprotected left-turn(π , π, π’).
h) Thus Ag will not collide or violate the backup plan of agents in bundles of oncoming traffic.
2. Let us show that when Ag choosesππ π‘, it will 1) not collide with any Ag0βπ and 2)Β¬( (Ag, ππ π‘)β₯Ag0).
a) Sinceππ π‘ is chosen according to the behavioral profile, it will only be a straight action that is notππ π as long as it satisfies the top-two tiers of the profile and more.
b) Thus,ππ π‘ will only take Ag into intersection if traffic light is green.
c) By the same arguments in 1, this holds.
3. Let us show that when Ag choosesππ, it will 1) not collide with any Ag0 βπ and 2)Β¬( (Ag, ππ)β₯Ag0).
a) Let us consider whenππis such thatπΎπ΄π=straight.
i. This follows from the same arguments presented in 2.
b) Let us consider whenππis such thatπΎπ΄π β {left-lane change,right-lane change}.
i. Ag will never select such an action at an intersection since πAg, intersection lane-change(π , π, π’)will evaluate toF.
c) Let us consider whenππis such thatπΎπ΄π β {left-turn, right-turn}. i. By the assumption that all other agents are following the agent protocol, all Ag0 that are in bundle perpendicular to π΅π’(π΄π) will not be in the intersection and will not collide with Ag.
ii. Further, the agent will only takeπΎAg =right-turnwhen
πAg, dynamic safety(π , π, π’) =TandπAg, traffic light(π , π, π’) =T. Thus,
Β¬( (Ag, ππ)β₯Ag0).
iii. For an action ππ where πΎAg = left-turn,Ag will only take ππ if πAg, traffic-light(π , π, π’)=Tand
πAg, unprotected left-turn(π , π, π’) =T.
iv. Since all agents are following the traffic laws based on Proof B.4, ππ΄π,traffic light(π , π, π’) =T means action will not cause the agent to collide with or violate the safety of the backup plan in perpendicular bundles.
v. By the definition of the unprotected-left-turn oracle, π΄π will only take the left-turn action when it does not violate the safety of the backup plan of agents in oncoming traffic.