Examples

Block World Problem

In this problem, we require the LLM to generate a plan to control a robotic arm to stack blocks on a plane as specified.

Fill in your gpt api key in the file case1/auto.py. Then in the case1 directory, run the following command to start the tool.

python auto.py

The tool reads the prompt from first_prompt_formula.txt and instructs the LLM to generate FOL specifications in the format required by the Z3 Python API.

After manually checking the correctness of the generated specification, we proceed to request the LLM to generate a plan using prompt from first_prompt_plan.txt.

Important

You can find the correct specification code in case1/draft.py for this example.

The specific problem is as follows.

(define (problem BW-rand-6)
(:domain blocksworld-4ops)
(:objects b1 b2 b3 b4 b5 b6 )
(:init
(arm-empty)
(on b1 b6)
(on b2 b3)
(on-table b3)
(on b4 b1)
(on-table b5)
(on-table b6)
(clear b2)
(clear b4)
(clear b5)
)
(:goal
(and
(on b1 b2)
(on b5 b3)
(on b6 b4))
)
)

The figure below shows the initial state and the goal state of the problem.

The first plan generated by LLM is shown below.

START-PLAN
pick-up b5
stack b5 b3
unstack b1 b6
put-down b1
pick-up b6
stack b6 b4
pick-up b2
stack b2 b1
END-PLAN

After customizing the verifier (inherited from class Fol_verifier) based on the task, the plan is verified against the specification, and the reasoning is provided as follows.

The plan is invalid according to the steps below.
1. pick-up b5
2. stack b5 b3

The reasoning is passed as a new prompt to the LLM to generate a revised plan, which is then verified again. This process continues until a plan that satisfies the specification is obtained or the user-defined iteration limit is reached.

Correct result is shown below.

START-PLAN
unstack b4 b1
put-down b4
unstack b1 b6
put-down b1
pick-up b6
stack b6 b4
unstack b2 b3
put-down b2
pick-up b5
stack b5 b3
pick-up b1
stack b1 b2
END-PLAN

Navigation Problem

In this case study, we address a navigation problem that requires the LLM to devise a plan for a driver while adhering to temporal constraints.

Fill in your gpt api key in the file case2/auto.py. Then in the case2 directory, run the following command to start the tool.

python auto.py

We read the prompt from first_prompt_formula.txt and instruct the LLM to generate LTL formulas.

The temporal constraints in this problem is You should have been to C and D before you go to G. The correct LTL formula is shown below.

G(!(g) U (c & d))

After manually checking the correctness of the generated specification, we proceed to request the LLM to generate a plan using prompt from first_prompt_plan.txt.

The specific problem is as follows.

(define (problem driver-1)
(:cities A B C D E F G)
(:constraints
(You should have been to C and D before you go to G))
(:roads
(A-B)
(A-E)
(E-D)
(B-C)
(B-F)
(F-G)
)
(:init
(A))
(:goal
(G))
)

The figure below shows the cities and roads of the problem.

The first plan generated by LLM is shown below.

START-PLAN
A -> B
B -> C
C -> B
B -> F
F -> G
END-PLAN

After customizing the verifier (inherited from class LTL_verifier) based on the task, the plan is verified against the specification, and the reasoning is provided as follows.

The plan is invalid according to the steps below.
A -> B
B -> C
C -> B
B -> F
F -> G

The reasoning is passed as a new prompt to the LLM to generate a revised plan, which is then verified again. This process continues until a plan that satisfies the specification is obtained or the user-defined iteration limit is reached.

Correct result is shown below.

START-PLAN
A -> B
B -> C
C -> B
B -> A
A -> E
E -> D
D -> E
E -> A
A -> B
B -> F
F -> G
END-PLAN