Generator

`generator = TKGQAGenerator(table_name=args.table_name, host=args.host, port=args.port, user=args.user, password=args.password, db_name=args.db_name, first_draw_size=1000, paraphrased=args.paraphrased, bulk_sql_size=args.bulk_sql_size)` `module-attribute`

Question Types:

Simple: Timeline and One Event Involved
- Ask for the timeline
  - Timeline Position Retrieval
- Ask for the event
  - Temporal Constrained Retrieval
Medium: Timeline and Two Events Involved
- Timeline Position Retrieval => Temporal Constrained Retrieval
- Timeline Position Retrieval + Timeline Position Retrieval
Complex: Timeline and Three Events Involved
- Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval
- Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval

`TKGQAGenerator`

How human handle the temporal information and answer the temporal questions?

Information Indexing

When we see something, for example, an accident happen near our home in this morning. We need to first index this event into our brain. As we live in a three dimension space together with a time dimension, when we want to store this in our memory, (we will treat our memory as an N dimension space) - Index the spatial dimensions: is this close to my home or close to one of the point of interest in my mind - Index the temporal dimension: Temporal have several aspects - Treat temporal as Straight Homogenous(Objective) Timeline: Exact date when it happens, for example, [2023-05-01 10:00:00, 2023-05-01 10:30:00] - Treat temporal as Cycle Homogenous(Objective) sTimeline: Monday, First day of Month, Spring, 21st Century, etc. (You can also cycle the timeline based on your own requirement) - Treat temporal as Straight Heterogeneous(Subjective) Timeline: If you sleep during night, it will be fast for you in the 8 hours, however, if someone is working overnight, time will be slow for him. - Treat temporal as Cycle Heterogeneous(Subjective) Timeline: Life has different turning points for everyone, until they reach the end of their life. - Then index the information part: What happen, who is involved, what is the impact, etc.

So in summary, we can say that in our mind, if we treat the event as embedding, part of the embedding will represent the temporal dimension information, part of the embedding will represent the spatial dimension information, the rest of the embedding will represent the general information part. This will help us to retrieve the information when we need it.

Information Retrieval

So when we try to retrieval the information, especially the temporal part of the information. Normally we have several types:

Timeline Position Retrieval: When Bush starts his term as president of US?
- First: General Information Retrieval [(Bush, start, president of US), (Bush, term, president of US)]
- Second: Timeline Position Retrieval [(Bush, start, president of US, 2000, 2000), (Bush, term, president of US, 2000, 2008)]
- Third: Answer the question based on the timeline information
Temporal Constrained Retrieval: In 2009, who is the president of US?
- First: General Information Retrieval [(Bush, president of US), (Obama, president of US), (Trump, president of US)]
- Second: Temporal Constraint Retrieval [(Obama, president of US, 2009, 2016)]
- Third: Answer the question based on the temporal constraint information

Three key things here: - General Information Retrieval: Retrieve the general information from the knowledge graph based on the question - Temporal Constrained Retrieval: Filter on general information retrieval, apply the temporal constraint - Timeline Position Retrieval: Based on general information retrieval, recover the timeline information

Temporal Questions

We can try to classify the temporal questions from quite a few perspectives: - Based on Answer: Entity, Temporal - Based on Temporal Relations in Question: Before, After, During , etc. or First, Last, etc. - Based on Temporal Representation Type: Point, Range, Duration, etc. - Based on Complexity of Question: Simple (direct retrieval), Complex (Multiple hops with the three key things we mention above)

There is still no agreement or clear classification here, most of them stays in the first two. However, it is obvious that they have overlaps, so will not be the best way to advance the temporal embedding algorithms development.

We are trying to decompose the question into the three key parts we mentioned above, so we can evaluate the ability of the models for this three key capabilities.

Simple: Timeline and One Event Involved

Timeline Position Retrieval: When Bush starts his term as president of US?
- General Information Retrieval => Timeline Position Retrieval => Answer the question
- Question Focus can be: Timestamp Start, Timestamp End, Duration, Timestamp Start and End
Temporal Constrained Retrieval: In 2009, who is the president of US?
- General Information Retrieval => Temporal Constraint Retrieval => Answer the question
- Question Focus can be: Subject, Object, Predicate. Can be more complex if we want mask out more elements

Medium: Timeline and Two Events Involved

Timeline Position Retrieval + Timeline Position Retrieval: Is Bush president of US when 911 happen?
- (General Information Retrieval => Timeline Position Retrieval) And (General Information Retrieval => Timeline Position Retrieval) => Timeline Operation => Answer the question
- Question Focus can be: A new Time Range, A temporal relation (Before, After, During, etc.), A list of Time Range (Ranking), or Comparison of Duration
Timeline Position Retrieval + Temporal Constrained Retrieval: When Bush is president of US, who is the president of China?
- (General Information Retrieval => Timeline Position Retrieval) => Temporal Constraint Retrieval => Answer the question
- This is same as above, Question Focus can be: Subject, Object

Complex: Timeline and Multiple Events Involved

In general, question focus (answer type) will only be two types when we extend from Medium Level - Timeline Operation - (Subject, Predicate, Object)

So if we say Complex is 3 events and Timeline.

Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval: When Bush is president of US and Putin is President of Russian, is Hu the president of China?
- (General Information Retrieval => Timeline Position Retrieval) And (General Information Retrieval => Timeline Position Retrival) And (General Information Retrieval => Timeline Position Retrival) => Timeline Operation => Answer the question
Timeline Position Retrieval + Timeline Position Retrieval + Temporal Constrained Retrieval: When Bush is president of US and Putin is President of Russian, who is the president of China?

input will be a unified knowledge graph, it will be stored in a table subject subject_json predicate predicate_json object object_json start_time end_time

Source code in TimelineKGQA/generator.py

class TKGQAGenerator:
    """
    **How human handle the temporal information and answer the temporal questions?**

    ## Information Indexing
    When we see something, for example, an accident happen near our home in this morning.
    We need to first index this event into our brain.
    As we live in a three dimension space together with a time dimension,
    when we want to store this in our memory, (we will treat our memory as an N dimension space)
    - Index the spatial dimensions: is this close to my home or close to one of the point of interest in my mind
    - Index the temporal dimension: Temporal have several aspects
        - Treat temporal as Straight Homogenous(Objective) Timeline: Exact date when it happens, for example,
            [2023-05-01 10:00:00, 2023-05-01 10:30:00]
        - Treat temporal as Cycle Homogenous(Objective) sTimeline:
            Monday, First day of Month, Spring, 21st Century, etc.
            (You can also cycle the timeline based on your own requirement)
        - Treat temporal as Straight Heterogeneous(Subjective) Timeline:
            If you sleep during night, it will be fast for you in the 8 hours, however,
            if someone is working overnight, time will be slow for him.
        - Treat temporal as Cycle Heterogeneous(Subjective) Timeline: Life has different turning points for everyone,
        until they reach the end of their life.
    - Then index the information part: What happen, who is involved, what is the impact, etc.

    So in summary, we can say that in our mind, if we treat the event as embedding,
    part of the embedding will represent the temporal dimension information,
    part of the embedding will represent the spatial dimension information,
    the rest of the embedding will represent the general information part.
    This will help us to retrieve the information when we need it.

    ## Information Retrieval
    So when we try to retrieval the information, especially the temporal part of the information.
    Normally we have several types:

    - Timeline Position Retrieval: When Bush starts his term as president of US?
        - First: **General Information Retrieval** [(Bush, start, president of US), (Bush, term, president of US)]
        - Second: **Timeline Position Retrieval** [(Bush, start, president of US, 2000, 2000),
                                                   (Bush, term, president of US, 2000, 2008)]
        - Third: Answer the question based on the timeline information
    - Temporal Constrained Retrieval: In 2009, who is the president of US?
        - First: **General Information Retrieval** [(Bush, president of US), (Obama, president of US),
                                                   (Trump, president of US)]
        - Second: **Temporal Constraint Retrieval** [(Obama, president of US, 2009, 2016)]
        - Third: Answer the question based on the temporal constraint information

    Three key things here:
    - **General Information Retrieval**: Retrieve the general information from the knowledge graph based on the question
    - **Temporal Constrained Retrieval**: Filter on general information retrieval, apply the temporal constraint
    - **Timeline Position Retrieval**: Based on general information retrieval, recover the timeline information

    ## Temporal Questions
    We can try to classify the temporal questions from quite a few perspectives:
    - Based on Answer: Entity, Temporal
    - Based on Temporal Relations in Question: Before, After, During , etc. or First, Last, etc.
    - Based on Temporal Representation Type: Point, Range, Duration, etc.
    - Based on Complexity of Question: Simple (direct retrieval), Complex
        (Multiple hops with the three key things we mention above)

    There is still no agreement or clear classification here, most of them stays in the first two.
    However, it is obvious that they have overlaps,
        so will not be the best way to advance the temporal embedding algorithms development.

    We are trying to decompose the question into the three key parts we mentioned above,
        so we can evaluate the ability of the models for this three key capabilities.

    ### Simple: Timeline and One Event Involved
    - Timeline Position Retrieval: When Bush starts his term as president of US?
        - General Information Retrieval => Timeline Position Retrieval => Answer the question
        - Question Focus can be: Timestamp Start, Timestamp End, Duration, Timestamp Start and End
    - Temporal Constrained Retrieval: In 2009, who is the president of US?
        - General Information Retrieval => Temporal Constraint Retrieval => Answer the question
        - Question Focus can be: Subject, Object, Predicate. Can be more complex if we want mask out more elements

    ### Medium: Timeline and Two Events Involved
    - Timeline Position Retrieval + Timeline Position Retrieval: Is Bush president of US when 911 happen?
        - (General Information Retrieval => Timeline Position Retrieval) And (General Information Retrieval
            => Timeline Position Retrieval) => Timeline Operation => Answer the question
        - Question Focus can be: A new Time Range, A temporal relation (Before, After, During, etc.),
            A list of Time Range (Ranking), or Comparison of Duration
    - Timeline Position Retrieval + Temporal Constrained Retrieval:
        When Bush is president of US, who is the president of China?
        - (General Information Retrieval => Timeline Position Retrieval) =>
            Temporal Constraint Retrieval => Answer the question
        - This is same as above, Question Focus can be: Subject, Object

    ### Complex: Timeline and Multiple Events Involved
    In general, question focus (answer type) will only be two types when we extend from Medium Level
    - Timeline Operation
    - (Subject, Predicate, Object)

    So if we say Complex is 3 events and Timeline.

    - Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval:
        When Bush is president of US and Putin is President of Russian, is Hu the president of China?
        - (General Information Retrieval => Timeline Position Retrieval) And (General Information Retrieval
            => Timeline Position Retrival) And (General Information Retrieval => Timeline Position Retrival)
                => Timeline Operation => Answer the question
    - Timeline Position Retrieval + Timeline Position Retrieval + Temporal Constrained Retrieval:
        When Bush is president of US and Putin is President of Russian, who is the president of China?

    input will be a unified knowledge graph, it will be stored in a table
        subject
        subject_json
        predicate
        predicate_json
        object
        object_json
        start_time
        end_time
    """

    def __init__(
        self,
        table_name: str,
        host: str,
        port: int,
        user: str,
        password: str,
        db_name: str = "tkgqa",
        first_draw_size: int = 100,
        paraphrased: bool = False,
        bulk_sql_size: int = 100,
    ):
        # set up the db connection
        self.host = host
        self.port = port
        self.user = user
        self.password = password
        self.db_name = db_name
        self.connection = psycopg2.connect(
            host=self.host,
            port=self.port,
            user=self.user,
            password=self.password,
            dbname=self.db_name,
        )
        self.unified_kg_table = table_name
        # we also need to create a new table, we can call it
        self.unified_kg_table_questions = f"{self.unified_kg_table}_questions"
        self.cursor = self.connection.cursor()
        self.bulk_sql_size = bulk_sql_size
        self.first_draw_size = first_draw_size
        # create a table to store retrieval questions
        self.cursor.execute(
            f"""
            CREATE TABLE IF NOT EXISTS {self.unified_kg_table_questions} (
                id SERIAL PRIMARY KEY,
                source_kg_id BIGINT,
                question VARCHAR(1024),
                answer VARCHAR(1024),
                paraphrased_question VARCHAR(1024),
                events VARCHAR(1024)[],
                question_level VARCHAR(1024),
                question_type VARCHAR(1024),
                answer_type VARCHAR(1024),
                temporal_relation VARCHAR(1024) DEFAULT NULL
            );
        """
        )
        self.cursor.connection.commit()
        self.paraphrased = paraphrased
        with timer(the_logger=logger, message="Getting the events from the database"):
            self.cursor.execute(
                f"SELECT * FROM {self.unified_kg_table} ORDER BY RANDOM() LIMIT {self.first_draw_size};"
            )
            events_df = pd.DataFrame(self.cursor.fetchall())
            # set the column names
            columns = [desc[0] for desc in self.cursor.description]
            if len(events_df) > 0:
                events_df.columns = columns
            else:
                events_df = pd.DataFrame(columns=columns)
            self.events_df = events_df
        self.sample_simple_events = []
        self.sample_medium_events = []
        self.sample_complex_events = []

    def simple_question_generation(self):
        """
        ## Types of Questions
        This is used to generate the simple question, we will have two types of questions.

        For each type of questions, based on the answer or the question focus, we can further divide them into
        - Timeline Position Retrival
            - Start TimePoint
            - End TimePoint
            - Time Range
            - Duration
        - Temporal Constrained Retrieval (Ignore predicate for now)
            - Subject
            - Object

        Simple: Timeline and One Event Involved
        - Timeline Position Retrival: When Bush starts his term as president of US?
            - General Information Retrieval => Timeline Position Retrival => Answer the question
            - Question Focus can be: Timestamp Start, Timestamp End, Duration, Timestamp Start and End
        - Temporal Constrained Retrieval: In 2009, who is the president of US?
            - General Information Retrieval => Temporal Constraint Retrieval => Answer the question
            - Question Focus can be: Subject, Object, Predicate. Can be more complex if we want mask out more elements


        ## Templates
        To generate the questions, We can try to feed into the LLM, and generate the questions.
        However, the diversity of the questions is not guaranteed, so we can use the template to generate the questions.
        Then use LLM to paraphrase the questions.

        Template examples:
        - Timeline Position Retrival
            - Start TimePoint: When did {subject} start the term as {object}?
            - End TimePoint: When did {subject} end the term as {object}?
            - Time Range: When did {subject} serve as {object}?
            - Duration: How long did {subject} serve as {object}?
        - Temporal Constrained Retrieval
            - Subject:
                - Who is affiliated to {subject} from {timestamp start} to {timestamp end}?
                - Who is affiliated to {subject} in {timestamp}?
            - Object:
                - {subject} is affiliated to which organisation from {timestamp start} to {timestamp end}?
                - {subject} is affiliated to which organisation during {temporal representation}?

        ## Process
        - Extract {subject}, {predicate}, {object}, {start_time}, {end_time} from the unified graph
        - Generate the questions based on the template for each type
        - Use LLM to paraphrase the questions

        Output format will be:
        - {question}
        - {answer}
        - {paraphrased_question}
        - subject, predicate, object, start_time, end_time
        - {question_level} => Simple
        - {question_type} => Timeline Position Retrival, Temporal Constrained Retrieval
        - {answer_type} => Subject, Object | Timestamp Start, Timestamp End, Duration, Timestamp Start and End
        """
        # get records not yet generated questions

        insert_values_list = []
        bulk_sql_pointer = 0
        for item in self.sample_simple_events:
            event = self.events_df.iloc[item]
            questions = self.simple_question_generation_individual(
                subject=event["subject"],
                predicate=event["predicate"],
                tail_object=event["object"],
                start_time=event["start_time"],
                end_time=event["end_time"],
                template_based=True,
                paraphrased=self.paraphrased,
            )

            # insert each qa into the table, have a flat table
            for question_obj in questions:
                question_obj["source_kg_id"] = int(event["id"])
                # get dict to tuple, sequence should be the same as the sql command
                data = (
                    question_obj["source_kg_id"],
                    question_obj["question"],
                    question_obj["answer"],
                    question_obj["paraphrased_question"],
                    question_obj["events"],
                    question_obj["question_level"],
                    question_obj["question_type"],
                    question_obj["answer_type"],
                    "timeline",
                )
                insert_values_list.append(data)
                bulk_sql_pointer += 1
                if bulk_sql_pointer % self.bulk_sql_size == 0:
                    self.bulk_insert(values=insert_values_list)
                    insert_values_list = []

        self.bulk_insert(values=insert_values_list)

    @staticmethod
    def simple_question_generation_individual(
        subject: str,
        predicate: str,
        tail_object: str,
        start_time: str,
        end_time: str,
        template_based: bool = False,
        paraphrased: bool = False,
    ) -> List[dict]:
        """
        This will try to generate four questions belong to RE type

        The questions will be:
        - ? p o during the time range from start_time to end_time?
        - s p ? during the time range from start_time to end_time?
        - s p o from ? to end_time?
        - s p o from start_time to ?
        - s p o from ? to ?
        - [How long/What's the duration, etc.] ? for the statement s p o

        Args:
            subject (str): The subject
            predicate (str): The predicate
            tail_object (str): The tail_object
            start_time (str): The start time
            end_time (str): The end time
            template_based (bool): Whether you use the template based question generation
            paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                    then the paraphrased_question will be the same as the question

        Returns:
            dict: The generated questions
                - question
                - answer
                - paraphrased_question
                - events
                - question_level: Simple
                - question_type: The type of the question
                - answer_type: The type of the answer
        """

        questions = [
            {
                "question": f"??? {predicate} {tail_object} during the time range from {start_time} to {end_time}?",
                "answer": f"{subject}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "temporal_constrained_retrieval",
                "answer_type": "subject",
            },
            {
                "question": f"{subject} {predicate} ??? during the time range from {start_time} to {end_time}?",
                "answer": f"{tail_object}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "temporal_constrained_retrieval",
                "answer_type": "object",
            },
            {
                "question": f"{subject} {predicate} {tail_object} from ??? to {end_time}?",
                "answer": f"{start_time}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "timeline_position_retrieval",
                "answer_type": "timestamp_start",
            },
            {
                "question": f"{subject} {predicate} {tail_object} from {start_time} to ???",
                "answer": f"{end_time}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "timeline_position_retrieval",
                "answer_type": "timestamp_end",
            },
            {
                "question": f"{subject} {predicate} {tail_object} from ??? to ???",
                "answer": f"{start_time} and {end_time}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "timeline_position_retrieval",
                "answer_type": "timestamp_range",
            },
            {
                "question": f"[How long/What's the duration/etc] ??? for the statement "
                f"{subject} {predicate} {tail_object}",
                "answer": f"{end_time} - {start_time}",
                "paraphrased_question": None,
                "events": [
                    f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
                ],
                "question_level": "simple",
                "question_type": "timeline_position_retrieval",
                "answer_type": "duration",
            },
        ]
        if template_based:
            # we will random pick one from the template
            for question_draft in questions:
                this_type_templates = QUESTION_TEMPLATES[
                    question_draft["question_level"]
                ][question_draft["question_type"]][question_draft["answer_type"]]
                logger.debug(f"this_type_templates: {this_type_templates}")
                random_pick_template = random.choice(this_type_templates)
                # replace {subject}, {predicate}, {tail_object}, {start_time}, {end_time} with the real value
                question_draft["question"] = random_pick_template.format(
                    subject=subject,
                    predicate=predicate,
                    tail_object=tail_object,
                    start_time=start_time,
                    end_time=end_time,
                )

        if paraphrased:
            for question_obj in questions:
                paraphrased_question = paraphrase_simple_question(
                    question=question_obj["question"]
                )
                logger.info(f"paraphrased_question: {paraphrased_question}")
                question_obj["paraphrased_question"] = paraphrased_question

        return questions

    def medium_question_generation(self):
        """
        This will involve mainly two types of questions

        - **Timeline Position Retrival => Temporal Constrained Retrieval**
        - **Timeline Position Retrival + Timeline Position Retrival**

        ---

        - question_level: medium
        - question_type:
            - timeline_recovery_temporal_constrained_retrieval
            - timeline_recovery_timeline_recovery
        - answer_type:
            - entity:
                - subject:
                - object
            - temporal related
                - Infer a new time range: Union/Intersection
                - Infer a temporal relation: Allen
                - Infer duration, and then compare
                - Note: Ranking will be the same as Allen, so it will be in **Complex** level

        """

        insert_values_list = []
        bulk_sql_pointer = 0

        for item in self.sample_medium_events:
            first_event = self.events_df.iloc[item[0]]
            second_event = self.events_df.iloc[item[1]]

            source_kg_id = first_event["id"] * 1000000 + second_event["id"]
            logger.debug(f"Generating question for source_kg_id: {source_kg_id}")
            questions = self.medium_question_generation_individual(
                first_event=first_event.to_dict(),
                second_event=second_event.to_dict(),
                template_based=True,
                paraphrased=self.paraphrased,
            )

            for question_obj in questions:

                question_obj["source_kg_id"] = int(source_kg_id)
                # get dict to tuple, sequence should be the same as the sql command
                data = (
                    question_obj["source_kg_id"],
                    question_obj["question"],
                    question_obj["answer"],
                    question_obj["paraphrased_question"],
                    question_obj["events"],
                    question_obj["question_level"],
                    question_obj["question_type"],
                    question_obj["answer_type"],
                    question_obj["temporal_relation"],
                )
                insert_values_list.append(data)
                bulk_sql_pointer += 1
                if bulk_sql_pointer % self.bulk_sql_size == 0:
                    self.bulk_insert(values=insert_values_list)
                    insert_values_list = []
        self.bulk_insert(values=insert_values_list)

    def medium_question_generation_individual(
        self,
        first_event: dict,
        second_event: dict,
        template_based: bool = True,
        paraphrased: bool = False,
    ) -> List[dict]:
        """

        Args:
            first_event (dict): The first event
            second_event (dict): The second event
            template_based (bool): Whether you use the template based question generation
            paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                    then the paraphrased_question will be the same as the question

        Returns:
            dict: The generated questions
                - question
                - answer
                - paraphrased_question
                - events
                - question_level: Medium
                - question_type: The type of the question
                - answer_type: The type of the answer

        - question_type:
            - timeline_position_retrieval_temporal_constrained_retrieval
                - For this one, the logic/reasoning/math part will be like: **TimeRange** +
                    Temporal Semantic Operation => **TimeRange**
                - Then the interesting part will be the Timeline Operation,
                    we have mentioned several types of operations below.
                    - There are mostly from numeric to semantic perspective
                    - Here is the reverse process: name it Temporal Semantic Operation
                    - So this is trying to convert the temporal semantic representation to
                        a numeric operation and then get a new operation.
            - timeline_position_retrieval_timeline_position_retrieval
                - For the logic/reasoning/math side, it actually is **TimeRange** vs **TimeRange** => Timeline Operation
                    - Get a way to ask about this comparison relations.
                    - So the question will mainly be about whether this relation is True, or which relation it is.
                    - For duration, we can ask about the duration of the two events, and then compare
                    - Or we can compare the event ranking based on the time range
            - there is another types: Three years before 2019,  who is the president of China? =>
                It is a valid question, but nobody will in this way.
                - It will be normally classified into **simple**: in 2016, who is the president of China?
                - Or it will be something like: Three years before bush end the term, who is the president of China?
                    => This will be classified into **Medium**,
                        and belong to the timeline_position_retrieval_temporal_constrained_retrieval
        - answer_type:
            - subject, object for timeline_position_retrieval_temporal_constrained_retrieval
                - subject
                - object
                - only focus on the first one, as the second will always become the first later
            - temporal related for timeline_position_retrieval_timeline_position_retrieval
                - Infer a new time range: Union/Intersection
                - Infer a temporal relation: Allen
                - Infer a list of time ranges: Ranking
                - Infer duration, and then compare

        Process:

        The quality of the question is not guaranteed by LLM directly if we just mask out the answer.
        So we will use the template to generate the questions, then use LLM to paraphrase the questions.

        """
        first_event_subject = first_event["subject"]
        first_event_predicate = first_event["predicate"]
        first_event_object = first_event["object"]
        first_event_start_time = first_event["start_time"]
        first_event_end_time = first_event["end_time"]

        second_event_subject = second_event["subject"]
        second_event_predicate = second_event["predicate"]
        second_event_object = second_event["object"]
        second_event_start_time = second_event["start_time"]
        second_event_end_time = second_event["end_time"]

        first_event_start_time_dt, first_event_end_time_dt = self.util_str_to_datetime(
            [first_event_start_time, first_event_end_time]
        )
        second_event_start_time_dt, second_event_end_time_dt = (
            self.util_str_to_datetime([second_event_start_time, second_event_end_time])
        )

        # first generate
        # timeline_position_retrieval => timeline_position_retrieval
        # this will ask for the subject or object in one of the event

        medium_type_1_a_questions = []
        questions = []
        """
        Timeline Position Retrieval => Temporal Constrained Retrieval Questions
        """
        # NOTES: question here actually is not used, because we will replace it with the template.
        # It is putting there to get the idea about the types of questions we are generating
        """
        The key part of this type is:
        We need to cover as many temporal semantic operations as possible
        - Before, After, During, this is the most common one and shown in the literature
        - Starts from the same time, Ends at the same time, Meets, Overlap, this is another way to add the
            temporal condition (inspired by the allen logic)
        - Above are from allen temporal logic and intersection/union
        - We can also add the ranking ones, however, before/after is the same as first/last, under this category
        - Then the rest is the one for duration, question like 3 years before, 3 years after, etc.

        So we have main two types of questions here:
        - Relation: Before, After, During ｜ Starts from the same time, Ends at the same time, Meets, Overlap
            - calculate the relation first, then generated based on template
            - Before: Who is the president of US before the end of Bush's term?
            - After: Who is the president of US after the start of Bush's term?
            - Starts from the same time: Who and Bush start their term as father and
                President of US respectively at the same time?
            - Ends at the same time: Who and Bush end their term as father and
                President of US respectively at the same time?
            - Meets: ?
            - During: Who is the president of US during Bush's term?
            - Overlap: Bush as the president of US meets who when the guy become the father?
        - Duration: 3 years before, 3 years after, 3 years after the end, etc.
            - calculate the duration first, then generated based on template
            - 3 years before: Who is the president of US 3 years before the end of Bush's term?
            - 3 years after: Who is the president of US 3 years after the start of Bush's term?
            - 3 years after the end: Who is the president of US 3 years after the end of Bush's term?
            - 3 years after the start: Who is the president of US 3 years after the start of Bush's term?
            - meets/during/overlap hard to get a time point, so not considered here.
        """
        # ask for first subject
        medium_type_1_a_questions.append(
            {
                "question": f"??? {first_event_predicate} {first_event_object} [Timeline Operation on "
                f"({first_event_start_time}, {first_event_end_time}) vs ({second_event_start_time}, "
                f"{second_event_end_time})] {second_event_subject}"
                f"{second_event_predicate} {second_event_object}?",
                "answer": f"{first_event_subject}",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                ],
                "question_level": "medium",
                "question_type": "timeline_position_retrieval_temporal_constrained_retrieval",
                "answer_type": "subject",
                "temporal_relation": None,
            }
        )

        # ask for first object
        medium_type_1_a_questions.append(
            {
                "question": f"{first_event_subject} {first_event_predicate} ??? [Timeline Operation on "
                f"({first_event_start_time}, {first_event_end_time}) vs ({second_event_start_time},"
                f"{second_event_end_time})] {second_event_subject}"
                f" {second_event_predicate} {second_event_object}?",
                "answer": f"{first_event_object}",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                ],
                "question_level": "medium",
                "question_type": "timeline_position_retrieval_temporal_constrained_retrieval",
                "answer_type": "object",
                "temporal_relation": None,
            }
        )
        questions += medium_type_1_a_questions

        """
        For duration before, duration after type  question
        """
        medium_type_1_b_questions = []
        # this will be added later when we process the questions with template

        """
        Timeline Position Retrieval + Timeline Position Retrieval Questions

        This one is mainly from numeric to temporal semantic

        - Infer a new time range: Union/Intersection
        - Infer a temporal relation: Allen
        - Infer a list of time ranges: Ranking (not considered here)
        - Infer duration, and then compare
        """
        # Timeline Position Retrieval + Timeline Position Retrieval

        # ask for union/intersection of the time range

        medium_type_2_questions = [
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} ???"
                f"[Timeline Operation on ({first_event_start_time}, {first_event_end_time}) vs "
                f"({second_event_start_time}, {second_event_end_time})]??? "
                f"{second_event_subject} {second_event_predicate} {second_event_object}?",
                "answer": "Union/Intersection of the time range",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                ],
                "question_level": "medium",
                "question_type": "timeline_position_retrieval_timeline_position_retrieval",
                "answer_type": "relation_union_or_intersection",
                "temporal_relation": "intersection",
            },
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
                f"???[Timeline Operation on ({first_event_start_time}, {first_event_end_time}) "
                f"vs ({second_event_start_time}, {second_event_end_time})]??? "
                f"{second_event_subject} {second_event_predicate} {second_event_object}?",
                "answer": "Union/Intersection of the time range",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                ],
                "question_level": "medium",
                "question_type": "timeline_position_retrieval_timeline_position_retrieval",
                "answer_type": "relation_union_or_intersection",
                "temporal_relation": "union",
            },
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
                f"???[Timeline Operation on ({first_event_start_time}, "
                f"{first_event_end_time}) vs ({second_event_start_time}, "
                f"{second_event_end_time})]??? {second_event_subject} "
                f"{second_event_predicate} {second_event_object}?",
                "answer": "Duration",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                ],
                "question_level": "medium",
                "question_type": "timeline_position_retrieval_timeline_position_retrieval",
                "answer_type": "relation_duration",
                "temporal_relation": None,
            },
        ]
        questions += medium_type_2_questions
        temporal_answer = None
        if template_based:
            for question_draft in questions:
                this_type_templates = QUESTION_TEMPLATES[
                    question_draft["question_level"]
                ][question_draft["question_type"]][question_draft["answer_type"]]

                if (
                    question_draft["answer_type"] == "subject"
                    or question_draft["answer_type"] == "object"
                ):
                    """
                    Handle the Medium Type 1 Questions here: Both a and b
                    First calculate the relations, then based on relations to select the template
                    """
                    temporal_relation = self.relation_allen_time_range(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            second_event_start_time_dt,
                            second_event_end_time_dt,
                        ],
                    )
                    temporal_relation_semantic = temporal_relation.get("semantic")
                    question_draft["temporal_relation"] = temporal_relation["relation"]
                    random_pick_template = random.choice(
                        this_type_templates[temporal_relation_semantic]
                    )

                    question_draft["question"] = random_pick_template.format(
                        first_event_subject=first_event_subject,
                        first_event_predicate=first_event_predicate,
                        first_event_object=first_event_object,
                        temporal_relation=temporal_relation,
                        second_event_subject=second_event_subject,
                        second_event_predicate=second_event_predicate,
                        second_event_object=second_event_object,
                    )
                    # this will generate the basic temporal relation questions.
                    # TODO: we also need to generate the one duration_before, duration_after
                    # If relation is before or after, then we can generate the duration_before, duration_after
                    # Add it to variable medium_type_1_b_questions
                    if temporal_relation_semantic in ["before", "after"]:
                        random_pick_template = random.choice(
                            this_type_templates[
                                f"duration_{temporal_relation_semantic}"
                            ]
                        )
                        # get the duration year
                        # Example: 3 years before Bush as the president of US, who is the president of China?
                        # The duration is calculated based on first_end_time - second_start time
                        # NOTE: It can be extended further later to calculate first_start - second_start
                        duration = self.relation_duration_calculation(
                            time_range_a=[
                                first_event_start_time_dt,
                                first_event_end_time_dt,
                            ],
                            time_range_b=[
                                second_event_start_time_dt,
                                second_event_end_time_dt,
                            ],
                            temporal_operator=f"duration_{temporal_relation_semantic}",
                        )
                        # copy a new question draft
                        duration_question_draft = copy.deepcopy(question_draft)
                        duration_question_draft["question"] = (
                            random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                temporal_relation=f"{duration} {temporal_relation}",
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                        )
                        duration_question_draft["temporal_relation"] = (
                            f"duration_{temporal_relation_semantic}"
                        )
                        medium_type_1_b_questions.append(duration_question_draft)
                else:
                    """
                    Handle in theory four types of questions here
                    """
                    if (
                        question_draft["answer_type"]
                        == "relation_union_or_intersection"
                    ):
                        temporal_relation = question_draft["temporal_relation"]
                        random_pick_template = random.choice(
                            this_type_templates[temporal_relation]
                        )
                        temporal_answer = self.relation_union_or_intersection(
                            time_ranges=[
                                (first_event_start_time_dt, first_event_end_time_dt),
                                (second_event_start_time_dt, second_event_end_time_dt),
                            ],
                            temporal_operator=temporal_relation,
                        )

                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                        if temporal_answer is None:
                            temporal_answer = "No Answer"
                        question_draft["answer"] = temporal_answer
                    elif question_draft["answer_type"] == "relation_allen":
                        temporal_allen_relation = self.relation_allen_time_range(
                            time_range_a=[
                                first_event_start_time_dt,
                                first_event_end_time_dt,
                            ],
                            time_range_b=[
                                second_event_start_time_dt,
                                second_event_end_time_dt,
                            ],
                        )
                        question_draft["temporal_relation"] = temporal_allen_relation[
                            "relation"
                        ]
                        # random select from [choices, true_false]
                        question_format = random.choice(["choice", "true_false"])
                        if question_format == "choice":
                            random_pick_template = random.choice(
                                this_type_templates["choice"]
                            )
                            temporal_answer = temporal_allen_relation["relation"]
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                            question_draft["answer"] = temporal_answer

                        else:
                            random_pick_template = random.choice(
                                this_type_templates["true_false"]
                            )
                            random_yes_no_answer = random.choice(["True", "False"])
                            if random_yes_no_answer == "True":
                                temporal_relation = temporal_allen_relation["relation"]
                            else:
                                temporal_relation = random.choice(
                                    list(
                                        set(self.allen_relations)
                                        - {temporal_allen_relation["relation"]}
                                    )
                                )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                temporal_relation=temporal_relation,
                            )
                            question_draft["answer"] = random_yes_no_answer
                    elif question_draft["answer_type"] == "relation_duration":
                        """There are four types in this category
                        - duration => which is the intersection of the two time range
                        - duration_compare => longer shorter equal
                        - sum => total duration of the two time range, which is actually the union
                        - average => average duration of the two time range
                        """
                        temporal_relation = random.choice(
                            [
                                "duration",
                                "duration_compare",
                                "sum",
                                "average",
                            ]
                        )
                        random_pick_template = random.choice(
                            this_type_templates[temporal_relation]
                        )
                        question_draft["temporal_relation"] = temporal_relation
                        if temporal_relation == "duration":
                            temporal_answer = self.relation_union_or_intersection(
                                time_ranges=[
                                    (
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ),
                                    (
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ),
                                ],
                                temporal_operator="intersection",
                            )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                        elif temporal_relation == "duration_compare":
                            temporal_relation_duration = (
                                self.relation_allen_time_duration(
                                    time_range_a=[
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    time_range_b=[
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                )
                            )
                            temporal_answer = temporal_relation_duration["semantic"]
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                temporal_relation=temporal_answer,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                        elif temporal_relation == "sum":
                            temporal_answer = self.util_average_duration_calculation(
                                time_ranges=[
                                    [
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    [
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                ],
                                temporal_operator="sum",
                            )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                        elif temporal_relation == "average":
                            temporal_answer = self.util_average_duration_calculation(
                                time_ranges=[
                                    [
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    [
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                ],
                                temporal_operator="average",
                            )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                            )
                        question_draft["answer"] = temporal_answer
                        question_draft["temporal_relation"] = temporal_relation

        questions += medium_type_1_b_questions
        if paraphrased:
            for question_obj in questions:
                paraphrased_question = paraphrase_medium_question(
                    question=question_obj["question"],
                )
                logger.info(f"paraphrased_question: {paraphrased_question}")
                question_obj["paraphrased_question"] = paraphrased_question

        return questions

    def complex_question_generation(self):
        """
        This is to generate the complex question, which will involve three events and timeline

        - Type 1: Before Bush, after Kennedy, who is the president of US?
        - Type 2: Who is the first president of US among Bush, Kennedy, and Obama?
        """

        # next step is to construct three events
        insert_values_list = []
        bulk_sql_pointer = 0

        for item in self.sample_complex_events:
            first_event = self.events_df.iloc[item[0]]
            second_event = self.events_df.iloc[item[1]]
            third_event = self.events_df.iloc[item[2]]
            source_kg_id = (
                first_event["id"] * 1000000 * 1000000
                + second_event["id"] * 1000000
                + third_event["id"]
            )

            questions = self.complex_question_generation_individual(
                first_event=first_event.to_dict(),
                second_event=second_event.to_dict(),
                third_event=third_event.to_dict(),
                template_based=True,
                paraphrased=self.paraphrased,
            )
            for question_obj in questions:
                question_obj["source_kg_id"] = int(source_kg_id)
                # get dict to tuple, sequence should be the same as the sql command
                data = (
                    question_obj["source_kg_id"],
                    question_obj["question"],
                    question_obj["answer"],
                    question_obj["paraphrased_question"],
                    question_obj["events"],
                    question_obj["question_level"],
                    question_obj["question_type"],
                    question_obj["answer_type"],
                    question_obj["temporal_relation"],
                )
                insert_values_list.append(data)
                bulk_sql_pointer += 1
                if bulk_sql_pointer % self.bulk_sql_size == 0:
                    self.bulk_insert(insert_values_list)
                    insert_values_list = []

        self.bulk_insert(insert_values_list)

    def complex_question_generation_individual(
        self,
        first_event: dict,
        second_event: dict,
        third_event: dict,
        template_based: bool = True,
        paraphrased: bool = True,
    ) -> List[dict]:
        """
        Args:
            first_event (dict): The first event
            second_event (dict): The second event
            third_event (dict): The third event
            template_based (bool): Whether you use the template based question generation
            paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                    then the paraphrased_question will be the same as the question

        Returns:
            dict: The generated questions
                - question
                - answer
                - paraphrased_question
                - events
                - question_level: Complex
                - question_type: The type of the question
                - answer_type: The type of the answer


        - question_type:
            - timeline_position_retrieval *2 + temporal constrained retrieval
            - timeline_position_retrieval *3
        - answer_type:
            - type1:
                - subject
                    - trel b, trel c, ? predicate object
                - object
                    - trel b, trel c, subject predicate ?
            - type2:
                - Infer a new time range: Union/Intersection
                    - trel b, trel c, from when to when the subject predicate object? (intersection)
                    - within (trel b, trel c), who is the subject predicate object? (union)
                - Infer a temporal relation: Allen
                    - ? More making sense one is ranking
                    - hard to justify the question that
                    - If we ask for choice question, it will be between two events
                    - If we ask for true/false, event a,b,c; ab, ac, bc;  Question, ab+ac => is bc relation True
                - Infer a list of time ranges: Ranking
                    - Who is the {} among a,b,c? => an
                - Infer duration, and then compare
                    - Who is the president of US for the longest time among a, b, c? => a

        """

        first_event_subject = first_event["subject"]
        first_event_predicate = first_event["predicate"]
        first_event_object = first_event["object"]
        first_event_start_time = first_event["start_time"]
        first_event_end_time = first_event["end_time"]

        second_event_subject = second_event["subject"]
        second_event_predicate = second_event["predicate"]
        second_event_object = second_event["object"]
        second_event_start_time = second_event["start_time"]
        second_event_end_time = second_event["end_time"]

        third_event_subject = third_event["subject"]
        third_event_predicate = third_event["predicate"]
        third_event_object = third_event["object"]
        third_event_start_time = third_event["start_time"]
        third_event_end_time = third_event["end_time"]

        first_event_start_time_dt, first_event_end_time_dt = self.util_str_to_datetime(
            [first_event_start_time, first_event_end_time]
        )
        second_event_start_time_dt, second_event_end_time_dt = (
            self.util_str_to_datetime([second_event_start_time, second_event_end_time])
        )
        third_event_start_time_dt, third_event_end_time_dt = self.util_str_to_datetime(
            [third_event_start_time, third_event_end_time]
        )

        # first generate
        complex_type_1_a_questions = []
        questions = []

        # timeline_position_retrieval *2 + temporal constrained retrieval
        # ask for the first subject
        complex_type_1_a_questions.append(
            {
                "question": f"??? {first_event_predicate} {first_event_object} {second_event_predicate} "
                f"{second_event_object} {third_event_predicate} {third_event_object}?",
                "answer": f"{first_event_subject}",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*2+temporal_constrained_retrieval",
                "answer_type": "subject",
                "temporal_relation": None,
            }
        )
        # ask for first object
        complex_type_1_a_questions.append(
            {
                "question": f"{first_event_subject} {first_event_predicate} ??? {second_event_predicate} "
                f"{second_event_object} {third_event_predicate} {third_event_object}?",
                "answer": f"{first_event_object}",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*2+temporal_constrained_retrieval",
                "answer_type": "object",
                "temporal_relation": None,
            }
        )

        questions += complex_type_1_a_questions

        """
        For duration before, duration after type question
        """

        complex_type_1_b_questions = []
        # this will be added later when we process the questions with template

        """
        Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval
        """

        complex_type_2_questions = [
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
                f"{second_event_predicate} {second_event_object} "
                f"{third_event_predicate} {third_event_object}?",
                "answer": "Union/Intersection of the time range",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*3",
                "answer_type": "relation_union_or_intersection",
                "temporal_relation": "intersection",
            },
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
                f"{second_event_predicate} {second_event_object} "
                f"{third_event_predicate} {third_event_object}?",
                "answer": "Union/Intersection of the time range",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*3",
                "answer_type": "relation_union_or_intersection",
                "temporal_relation": "union",
            },
            {
                "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
                f"{second_event_predicate} {second_event_object} "
                f"{third_event_predicate} {third_event_object}?",
                "answer": "Duration",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*3",
                "answer_type": "relation_duration",
                "temporal_relation": None,
            },
            # add ranking one
            {
                "question": f"Who is the xxx among {first_event_subject}, {second_event_subject}, "
                f"and {third_event_subject}?",
                "answer": "Ranking",
                "paraphrased_question": None,
                "events": [
                    f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                    f"{first_event_start_time}|{first_event_end_time}",
                    f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                    f"{second_event_start_time}|{second_event_end_time}",
                    f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                    f"{third_event_start_time}|{third_event_end_time}",
                ],
                "question_level": "complex",
                "question_type": "timeline_position_retrieval*3",
                "answer_type": "relation_ranking",
                "temporal_relation": "ranking",
            },
        ]

        questions += complex_type_2_questions
        temporal_answer = None

        if template_based:
            for question_draft in questions:
                this_type_templates = QUESTION_TEMPLATES[
                    question_draft["question_level"]
                ][question_draft["question_type"]][question_draft["answer_type"]]

                if (
                    question_draft["answer_type"] == "subject"
                    or question_draft["answer_type"] == "object"
                ):
                    """
                    Handle the Complex Type 1 Questions here:
                    a,b,c, will ask for a information.
                    Get temporal_relation_12, temporal_relation_13, then generate the question
                    """
                    temporal_relation_12 = self.relation_allen_time_range(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            second_event_start_time_dt,
                            second_event_end_time_dt,
                        ],
                    )
                    temporal_relation_13 = self.relation_allen_time_range(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            third_event_start_time_dt,
                            third_event_end_time_dt,
                        ],
                    )

                    temporal_relation_12_semantic = temporal_relation_12.get("semantic")
                    temporal_relation_13_semantic = temporal_relation_13.get("semantic")
                    question_draft["temporal_relation"] = (
                        f"{temporal_relation_12['relation']}&{temporal_relation_13['relation']}"
                    )
                    random_pick_template = random.choice(this_type_templates)

                    question_draft["question"] = random_pick_template.format(
                        first_event_subject=first_event_subject,
                        first_event_predicate=first_event_predicate,
                        first_event_object=first_event_object,
                        temporal_relation_12=temporal_relation_12_semantic,
                        second_event_subject=second_event_subject,
                        second_event_predicate=second_event_predicate,
                        second_event_object=second_event_object,
                        temporal_relation_13=temporal_relation_13_semantic,
                        third_event_subject=third_event_subject,
                        third_event_predicate=third_event_predicate,
                        third_event_object=third_event_object,
                    )

                    # this will generate the basic temporal relation questions.
                    # then we will want to generate the duration_before, duration_after
                    can_generate_duration_question = False
                    if temporal_relation_12_semantic in ["before", "after"]:
                        duration = self.relation_duration_calculation(
                            time_range_a=[
                                first_event_start_time_dt,
                                first_event_end_time_dt,
                            ],
                            time_range_b=[
                                second_event_start_time_dt,
                                second_event_end_time_dt,
                            ],
                            temporal_operator=f"duration_{temporal_relation_12_semantic}",
                        )
                        temporal_relation_12_semantic = (
                            f"{duration} {temporal_relation_12_semantic}"
                        )
                        logger.debug(temporal_relation_12_semantic)
                        can_generate_duration_question = True
                    if temporal_relation_13_semantic in ["before", "after"]:
                        duration = self.relation_duration_calculation(
                            time_range_a=[
                                first_event_start_time_dt,
                                first_event_end_time_dt,
                            ],
                            time_range_b=[
                                third_event_start_time_dt,
                                third_event_end_time_dt,
                            ],
                            temporal_operator=f"duration_{temporal_relation_13_semantic}",
                        )
                        temporal_relation_13_semantic = (
                            f"{duration} {temporal_relation_13_semantic}"
                        )
                        logger.debug(temporal_relation_13_semantic)
                        can_generate_duration_question = True
                    if can_generate_duration_question:
                        # copy a new question draft
                        duration_question_draft = copy.deepcopy(question_draft)
                        duration_question_draft["question"] = (
                            random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                temporal_relation_12=temporal_relation_12_semantic,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                temporal_relation_13=temporal_relation_13_semantic,
                                third_event_subject=third_event_subject,
                                third_event_predicate=third_event_predicate,
                                third_event_object=third_event_object,
                            )
                        )
                        duration_question_draft["temporal_relation"] = (
                            f"duration_{temporal_relation_12_semantic}&duration_{temporal_relation_13_semantic}"
                        )
                        complex_type_1_b_questions.append(duration_question_draft)
                else:
                    # handle the Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval
                    if (
                        question_draft["answer_type"]
                        == "relation_union_or_intersection"
                    ):
                        temporal_relation = question_draft["temporal_relation"]
                        random_pick_template = random.choice(
                            this_type_templates[temporal_relation]
                        )
                        temporal_answer = self.relation_union_or_intersection(
                            time_ranges=[
                                (first_event_start_time_dt, first_event_end_time_dt),
                                (second_event_start_time_dt, second_event_end_time_dt),
                                (third_event_start_time_dt, third_event_end_time_dt),
                            ],
                            temporal_operator=temporal_relation,
                        )

                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                        logger.debug(question_draft["question"])
                        logger.debug(temporal_answer)
                        if temporal_answer is None:
                            temporal_answer = "No Answer"
                        question_draft["answer"] = temporal_answer
                    elif question_draft["answer_type"] == "relation_duration":
                        """
                        There are four types in this category
                        - duration => which is the intersection of the two time range
                        - duration_compare => longer shorter equal
                        - sum => total duration of the two time range, which is actually the union
                        - average => average duration of the two time range
                        """
                        temporal_relation = random.choice(
                            [
                                "duration",
                                "duration_compare",
                                "sum",
                                "average",
                            ]
                        )
                        random_pick_template = random.choice(
                            this_type_templates[temporal_relation]
                        )
                        question_draft["temporal_relation"] = temporal_relation
                        if temporal_relation == "duration":
                            temporal_answer = self.relation_union_or_intersection(
                                time_ranges=[
                                    (
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ),
                                    (
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ),
                                    (
                                        third_event_start_time_dt,
                                        third_event_end_time_dt,
                                    ),
                                ],
                                temporal_operator="intersection",
                            )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                third_event_subject=third_event_subject,
                                third_event_predicate=third_event_predicate,
                                third_event_object=third_event_object,
                            )
                        elif temporal_relation == "duration_compare":
                            # we do duration ranking here
                            duration_rank_by_index = self.relation_duration(
                                time_ranges=[
                                    [
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    [
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                    [
                                        third_event_start_time_dt,
                                        third_event_end_time_dt,
                                    ],
                                ],
                                agg_temporal_operator="ranking",
                            )
                            logger.debug(duration_rank_by_index)
                            temporal_answer = duration_rank_by_index[0] + 1
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                third_event_subject=third_event_subject,
                                third_event_predicate=third_event_predicate,
                                third_event_object=third_event_object,
                                temporal_duration_rank=temporal_answer,
                            )
                        elif temporal_relation == "sum":
                            temporal_answer = self.util_average_duration_calculation(
                                time_ranges=[
                                    [
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    [
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                    [
                                        third_event_start_time_dt,
                                        third_event_end_time_dt,
                                    ],
                                ],
                                temporal_operator="sum",
                            )
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                third_event_subject=third_event_subject,
                                third_event_predicate=third_event_predicate,
                                third_event_object=third_event_object,
                            )
                        elif temporal_relation == "average":
                            temporal_answer = self.util_average_duration_calculation(
                                time_ranges=[
                                    [
                                        first_event_start_time_dt,
                                        first_event_end_time_dt,
                                    ],
                                    [
                                        second_event_start_time_dt,
                                        second_event_end_time_dt,
                                    ],
                                    [
                                        third_event_start_time_dt,
                                        third_event_end_time_dt,
                                    ],
                                ],
                                temporal_operator="average",
                            )
                            logger.debug(temporal_answer)
                            question_draft["question"] = random_pick_template.format(
                                first_event_subject=first_event_subject,
                                first_event_predicate=first_event_predicate,
                                first_event_object=first_event_object,
                                second_event_subject=second_event_subject,
                                second_event_predicate=second_event_predicate,
                                second_event_object=second_event_object,
                                third_event_subject=third_event_subject,
                                third_event_predicate=third_event_predicate,
                                third_event_object=third_event_object,
                            )
                        question_draft["answer"] = temporal_answer
                        question_draft["temporal_relation"] = temporal_relation
                    elif question_draft["answer_type"] == "relation_ranking":
                        # random select, ranking based on start time or end time
                        rank_by_what = random.choice(
                            ["rank_start_time", "rank_end_time"]
                        )
                        rank_by_index = self.relation_ordinal_time_range(
                            time_ranges=[
                                [first_event_start_time_dt, first_event_end_time_dt],
                                [second_event_start_time_dt, second_event_end_time_dt],
                                [third_event_start_time_dt, third_event_end_time_dt],
                            ],
                            agg_temporal_operator=rank_by_what,
                        )

                        random_pick_template = random.choice(
                            this_type_templates[rank_by_what]
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                        temporal_answer = rank_by_index[0] + 1
                        question_draft["answer"] = temporal_answer
                        question_draft["temporal_relation"] = rank_by_what

        questions += complex_type_1_b_questions
        if paraphrased:
            for question_obj in questions:
                paraphrased_question = paraphrase_medium_question(
                    question=question_obj["question"],
                )
                logger.info(f"paraphrased_question: {paraphrased_question}")
                question_obj["paraphrased_question"] = paraphrased_question

        return questions

    @staticmethod
    def relation_allen_time_range(time_range_a: list, time_range_b: list) -> dict:
        """
        This function will return the allen temporal relation between two time ranges

        We have the 26 possible relations
        - 13 for time range operation
        - 10 for time point and time range operation
        - 3 for time point operation

        We will need to extract them from the quantitatively time range

        We will have "beginning of time" or "end of time" to represent the infinite time range
        We will need to convert it to a numerical value in np.inf
        Others will be converted to a numerical value in the timestamp

        Args:
            time_range_a (list[datetime, datetime]): The first time range
            time_range_b (list[datetime, datetime]): The second time range

        Returns:
            dict: The allen temporal relation between the two time ranges
        """
        start_time1, end_time1 = time_range_a
        start_time2, end_time2 = time_range_b

        logger.debug(
            f"start_time1: {start_time1}, end_time1: {end_time1}, start_time2: {start_time2}, end_time2: {end_time2}"
        )

        # 13 for time range operation
        time_range_a_datetime = [start_time1, end_time1]
        time_range_b_datetime = [start_time2, end_time2]
        # then we will do the operation for the time range, get the allen temporal relation
        """
        x_start <= x_end
        y_start <= y_end
        allen_operator = [
            x_start - x_end,  # 0, or -1, which means a is a point or a range
            y_start - y_end, # 0, or -1
            x_start - y_start,
            x_start - y_end,
            x_end - y_start,
            x_end - y_end,
        ]

        After this do a operation for the allen_operator, if value = 0, keep it, < 0, set it to -1, > 0, set it to 1

        Then we will have:
        13 for time range operation, which means x_start < x_end, y_start < y_end
            - X <  Y => [-1, -1, -1, -1, -1, -1]
            - X m  Y => [-1, -1, -1, -1,  0, -1]
            - X o  Y => [-1, -1, -1, -1,  1, -1]
            - X fi Y => [-1, -1, -1, -1,  1,  0]
            - X di Y => [-1, -1, -1, -1,  1,  1]
            - X s  Y => [-1, -1,  0, -1,  1, -1]
            - X =  Y => [-1, -1,  0, -1,  1,  0]
            - X si Y => [-1, -1,  0, -1,  1,  1]
            - X d  Y => [-1, -1,  1, -1,  1, -1]
            - X f  Y => [-1, -1,  1, -1,  1,  0]
            - X oi Y => [-1, -1,  1, -1,  1,  1]
            - X mi Y => [-1, -1,  1,  0,  1,  1]
            - X >  Y => [-1, -1,  1,  1,  1,  1]

        10 for time point and time range operation
        Among the 10, 5 for X is a point, 5 for Y is a point
        5 for X is a point, Y is a range, which means x_start = x_end, y_start < y_end
            - X <  Y => [0, -1, -1, -1, -1, -1]
            - X s  Y => [0, -1,  0, -1,  0, -1]
            - X d  Y => [0, -1,  1, -1,  1, -1]
            - X f  Y => [0, -1,  1,  0,  1,  0]
            - X >  Y => [0, -1,  1,  1,  1,  1]
        5 for X is a range, Y is a point, which means x_start < x_end, y_start = y_end
            - X <  Y => [-1, 0, -1, -1, -1, -1]
            - X fi Y => [-1, 0, -1，-1,  0,  0]
            - X di Y => [-1, 0, -1, -1,  1,  1]
            - X si Y => [-1, 0,  0,  0,  1,  1]
            - X >  Y => [-1, 0,  1,  1,  1,  1]

        3 for time point operation, which means x_start = x_end, y_start = y_end
            - X < Y => [0, 0, -1, -1, -1, -1]
            - X = Y => [0, 0,  0,  0,  0,  0]
            - X > Y => [0, 0,  1,  1,  1,  1]
        """

        allen_operator = [
            time_range_a_datetime[0] - time_range_a_datetime[1],
            time_range_b_datetime[0] - time_range_b_datetime[1],
            time_range_a_datetime[0] - time_range_b_datetime[0],
            time_range_a_datetime[0] - time_range_b_datetime[1],
            time_range_a_datetime[1] - time_range_b_datetime[0],
            time_range_a_datetime[1] - time_range_b_datetime[1],
        ]

        # do the operation for the allen_operator
        for index, value in enumerate(allen_operator):
            if value == 0:
                allen_operator[index] = 0
            elif value < 0:
                allen_operator[index] = -1
            else:
                allen_operator[index] = 1

        # logger.critical(f"allen_operator: {allen_operator}")
        # get it to be a tuple
        allen_operator = tuple(allen_operator)
        logger.debug(f"allen_operator: {allen_operator}")
        try:
            logger.debug(f"ALLEN_OPERATOR_DICT: {ALLEN_OPERATOR_DICT[allen_operator]}")
            return ALLEN_OPERATOR_DICT[allen_operator]
        except KeyError:
            logger.info(f"allen_operator: {allen_operator}")
            logger.info(f"time_range_a: {time_range_a}")
            logger.info(f"time_range_b: {time_range_b}")
            raise ValueError("The allen operator is not found")

    @staticmethod
    def relation_allen_time_duration(time_range_a: list, time_range_b: list) -> dict:
        """

        Args:
            time_range_a (list[datetime, datetime]): The first time range
            time_range_b (list[datetime, datetime]): The second time range

        Returns:
            dict: The allen temporal relation between the two time ranges
        """
        duration_a = abs(time_range_a[1] - time_range_a[0])
        duration_b = abs(time_range_b[1] - time_range_b[0])
        if duration_a < duration_b:
            return {
                "relation": "X < Y",
                "description": "X is shorter Y",
                "category": "td",
                "code": "td-1",
                "semantic": "shorter",
            }
        elif duration_a == duration_b:
            return {
                "relation": "X = Y",
                "description": "X equals Y",
                "category": "td",
                "code": "td-2",
                "semantic": "equals",
            }
        else:
            return {
                "relation": "X > Y",
                "description": "X is longer Y",
                "category": "td",
                "code": "td-3",
                "semantic": "longer",
            }

    @staticmethod
    def relation_union_or_intersection(
        time_ranges: List[Tuple[np.datetime64, np.datetime64]],
        temporal_operator: str = "intersection",
    ) -> Optional[str]:
        """
        This function will return the temporal operator between multiple time ranges
        The temporal operator can be:
            - 'intersection'
            - 'union'

        Args:
            time_ranges (List[Tuple[datetime, datetime]]): A list of time ranges
            temporal_operator (str): The temporal operator

        Returns:
            str: A string representation of the new time range, or None if no valid range exists.

        """
        if temporal_operator not in ["intersection", "union"]:
            raise ValueError(
                "temporal_operator should be either 'intersection' or 'union'"
            )

        if not time_ranges:
            return None

        # Start with the first time range
        result = time_ranges[0]

        for current in time_ranges[1:]:
            if temporal_operator == "intersection":
                # Find the latest start time and earliest end time
                start = max(result[0], current[0])
                end = min(result[1], current[1])
                if start >= end:
                    return None  # No intersection
                result = (start, end)
            elif temporal_operator == "union":
                # Find the earliest start time and latest end time
                start = min(result[0], current[0])
                end = max(result[1], current[1])
                # Check if there is a gap between the ranges
                if result[1] < current[0] or current[1] < result[0]:
                    return None  # No continuous union possible
                result = (start, end)

        return f"({result[0]}, {result[1]})"

    @staticmethod
    def relation_ordinal_time_range(
        time_ranges: list[[datetime, datetime]], agg_temporal_operator: str = None
    ) -> list:
        """
        For the time range, it will do the rank operation, sort it

        Aggregation operator can be:
        - ranking(min, max)
            - ranking_start
            - ranking_end

        Args:
            time_ranges (list): The list of time ranges
            agg_temporal_operator (str): The aggregation temporal operator

        Returns:
            list: the list of sorted index for the time range

        For example, we have the time range:

        ```
        time_ranges = [
            (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
            (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
            (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
            (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
            (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
        ]

        result_start = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_start")
        [3,1,0,4,2]

        result_end = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_end")
        [2,4,3,0,1]
        ```
        """

        # Create a list of indices paired with time ranges
        indexed_time_ranges = list(enumerate(time_ranges))

        if agg_temporal_operator == "rank_start_time":
            # Sort by start time, but maintain original indices
            indexed_time_ranges.sort(key=lambda x: x[1][0])
        elif agg_temporal_operator == "rank_end_time":
            # Sort by end time, but maintain original indices
            indexed_time_ranges.sort(key=lambda x: x[1][1])
        else:
            raise ValueError(
                "Unsupported aggregation temporal operator. Please use 'rank_start_time' or 'rank_end_time'."
            )

        # After sorting, create a new list that maps the original index to its new rank
        rank_by_index = [0] * len(time_ranges)  # Pre-initialize a list of zeros
        for rank, (original_index, _) in enumerate(indexed_time_ranges):
            rank_by_index[original_index] = rank

        return rank_by_index

    @staticmethod
    def relation_duration(
        time_ranges: list[[datetime, datetime]], agg_temporal_operator: str = None
    ):
        """
        For the time range, it will do the rank operation, sort it

        First calculate the duration of the time range, then do the rank operation based on the duration

        Args:
            time_ranges (list): The list of time ranges
            agg_temporal_operator (str): The aggregation temporal operator

        Returns:
            list: the list of sorted index for the time range


        Example:
        ```
        time_ranges = [
            (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
            (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
            (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
            (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
            (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
        ]
        ```

        The output will be:
        ```
        [2, 4, 3, 0, 1]
        ```
        """
        # Create a list of indices paired with time ranges
        if agg_temporal_operator == "ranking":
            indexed_time_ranges = list(enumerate(time_ranges))

            indexed_time_ranges.sort(key=lambda x: abs(x[1][1] - x[1][0]))
            rank_by_index = [0] * len(time_ranges)  # Pre-initialize a list of zeros
            for index, (original_index, _) in enumerate(indexed_time_ranges):
                rank_by_index[original_index] = index
            return rank_by_index
        if agg_temporal_operator == "sum":
            # total value of the time range
            durations = [
                abs(time_range[1] - time_range[0]) for time_range in time_ranges
            ]
            return sum(durations)
        if agg_temporal_operator == "average":
            # average value of the time range
            durations = [
                abs(time_range[1] - time_range[0]) for time_range in time_ranges
            ]
            return sum(durations) / len(durations)
        raise ValueError(
            "Unsupported aggregation temporal operator. Please use 'ranking', 'sum' or 'average'."
        )

    @staticmethod
    def relation_duration_calculation(
        time_range_a: list,
        time_range_b: list,
        temporal_operator: str = None,
    ) -> Optional[timedelta]:
        """
        We will calculate the time difference between two time ranges

        However, there are several combination we can do here

        - duration_before => abs(time_range_a[1] - time_range_b[0])
        - duration_after => abs(time_range_b[1] - time_range_a[0])
        We also have other combinations, but we will not consider them here

        Args:
            time_range_a (list[datetime, datetime]): The first time range
            time_range_b (list[datetime, datetime]): The second time range
            temporal_operator (str): The temporal operator

        Returns:
            timedelta: The time difference between two time ranges

        """
        if temporal_operator is None or temporal_operator not in [
            "duration_before",
            "duration_after",
        ]:
            raise ValueError(
                "temporal_operator should be one of the following: duration_before, duration_after"
            )
        if temporal_operator == "duration_before":
            return abs(time_range_a[1] - time_range_b[0])
        if temporal_operator == "duration_after":
            return abs(time_range_b[1] - time_range_a[0])
        return None

    @staticmethod
    def util_str_to_datetime(time_range: list) -> Tuple[np.datetime64, np.datetime64]:
        """
        Convert the string to datetime

        Args:
            time_range (list[str, str]): The time range in string format

        Returns:
            list[datetime, datetime]: The time range in datetime format

        """
        start_time, end_time = time_range
        if start_time == "beginning of time":
            start_time = datetime.min.replace(year=1)
        if end_time == "end of time":
            end_time = datetime.max.replace(year=9999)

        # convert the time to numerical value, format is like this: 1939-04-25
        start_time = np.datetime64(start_time)
        end_time = np.datetime64(end_time)

        return start_time, end_time

    def util_average_duration_calculation(
        self, time_ranges: list[[datetime, datetime]], temporal_operator: str = None
    ):
        try:
            if temporal_operator == "average":
                durations = [
                    abs(time_range[1] - time_range[0]) for time_range in time_ranges
                ]
                average_d = sum(durations) / len(durations)
                return self.utils_format_np_datetime(average_d)

            if temporal_operator == "sum":
                durations = [
                    abs(time_range[1] - time_range[0]) for time_range in time_ranges
                ]
                total = sum(durations)
                return self.utils_format_np_datetime(total)
            return None
        except Exception as e:
            logger.error(f"Error in util_average_duration_calculation: {e}")
            return None

    @staticmethod
    def utils_format_np_datetime(np_date_delta):
        td = timedelta(seconds=np_date_delta / (np.timedelta64(1, "s")))
        days = td.days
        seconds = td.seconds
        # Compute hours, minutes, and remaining seconds
        hours, remainder = divmod(seconds, 3600)
        minutes, seconds = divmod(remainder, 60)

        # also get how many years
        years = days // 365
        if years > 2000:
            return "forever"
        months = (days % 365) // 30
        days = (days % 365) % 30
        human_readable_format = (
            f"{years} years, {months} months, "
            f"{days} days, {hours} hours, {minutes} minutes, {seconds} seconds"
        )
        return human_readable_format

    @property
    def allen_relations(self):
        return [
            "X < Y",
            "X m Y",
            "X o Y",
            "X fi Y",
            "X di Y",
            "X s Y",
            "X = Y",
            "X si Y",
            "X d Y",
            "X f Y",
            "X oi Y",
            "X mi Y",
            "X > Y",
        ]

    def bulk_insert(self, values: List[Tuple]):
        """
        This function will insert the values into the table

        """
        values_str = ",\n".join(["(%s, %s, %s, %s, %s, %s, %s, %s, %s)"] * len(values))
        # Flatten the list of values tuples into a single tuple for execution
        flat_values = [item for sublist in values for item in sublist]

        bulk_insert_query = f"""
        INSERT INTO {self.unified_kg_table_questions} (
                                                    source_kg_id,
                                                      question,
                                                      answer,
                                                      paraphrased_question,
                                                      events,
                                                      question_level,
                                                      question_type,
                                                      answer_type,
                                                      temporal_relation
                                                      )
        VALUES {values_str}
        """
        if len(values) == 0:
            return
        # Execute the bulk insert command
        try:
            self.cursor.execute(bulk_insert_query, flat_values)
            self.connection.commit()
            logger.info(f"Successfully inserted {len(values)} rows into the table.")
        except Exception as e:
            logger.exception(f"Error: {e}")
            logger.info(flat_values)

    def sampling_events(
        self,
        sample_percentage: Union[float, dict, int] = 0.1,
        sample_strategy: str = "temporal_close",
    ):
        """
        This function will sample the events from the list

        Args:
            sample_percentage (float): The sample percentage
            sample_strategy (str): The sample strategy, can be random, temporal_close, degree_high, both

        Returns:
            List[Tuple]: The list Tuples (event1_id, event2_id, event3_id)

        """

        if sample_strategy not in ("random", "temporal_close", "degree_high", "both"):
            raise ValueError(
                "sample_strategy should be random, temporal_close, degree_high, or both"
            )

        """
        Do matrix sampling based on the element value
        If the dimension is 2, we will have a nxn matrix
            - value of the matrix is 1 for random
            - value will be calculated based on the temporal information if it is temporal_close
            - value will be calculated based on the degree information if it is degree_high
            - value will be calculated based on both temporal and degree information if it is both

        And then use the value as weight to do the sampling over the matrix
        """

        num_events = len(self.events_df)
        # for dimension 1 generate, first construct a matrix with len(event_df), using numpy,
        # and it is always sampling randomly
        if sample_strategy == "degree_high" or sample_strategy == "both":
            degree_scores = self.calculate_degree_scores(self.events_df)
        else:
            degree_scores = None

        with timer(the_logger=logger, message="Generating the matrix D1"):
            dimension_1_matrix = np.ones(num_events)

            # make sure every element in the matrix sum to 1
            dimension_1_matrix = dimension_1_matrix / dimension_1_matrix.sum()

        with timer(the_logger=logger, message="Generating the matrix D2"):
            # for dimension 2 generate, first construct a matrix with len(event_df), using numpy
            dimension_2_matrix = np.zeros((num_events, num_events))

            if sample_strategy == "random":
                dimension_2_matrix = np.ones((num_events, num_events))
            elif sample_strategy == "temporal_close":
                start_times = self.events_df["start_time"].values
                end_times = self.events_df["end_time"].values

                for x in range(num_events):
                    for y in range(x + 1, num_events):  # y > x to avoid redundancy
                        score = self.temporal_close_score(
                            time_ranges=[
                                [start_times[x], end_times[x]],
                                [start_times[y], end_times[y]],
                            ]
                        )
                        dimension_2_matrix[x, y] = score
                        dimension_2_matrix[y, x] = score  # Leverage symmetry

            elif sample_strategy == "degree_high":

                for x in range(num_events):
                    for y in range(x + 1, num_events):
                        score = degree_scores[x] = degree_scores[y]
                        dimension_2_matrix[x, y] = score
                        dimension_2_matrix[y, x] = score  # Leverage symmetry

            elif sample_strategy == "both":
                # park here
                start_times = self.events_df["start_time"].values
                end_times = self.events_df["end_time"].values
                for x in tqdm(range(num_events), desc="Processing events"):
                    for y in range(x + 1, num_events):
                        degree_score = degree_scores[x] + degree_scores[y]
                        temporal_score = self.temporal_close_score(
                            time_ranges=[
                                [start_times[x], end_times[x]],
                                [start_times[y], end_times[y]],
                            ]
                        )
                        score = degree_score + temporal_score
                        dimension_2_matrix[x, y] = score
                        dimension_2_matrix[y, x] = score

            # make sure every element in the matrix sum to 1
            dimension_2_matrix = dimension_2_matrix / dimension_2_matrix.sum()

        with timer(the_logger=logger, message="Generating the matrix D3"):
            # for dimension 3 generate, first construct a matrix with len(event_df), using numpy
            dimension_3_matrix = np.zeros((num_events, num_events, num_events))
            if sample_strategy == "random":
                dimension_3_matrix = np.ones((num_events, num_events, num_events))
            elif sample_strategy == "temporal_close":
                start_times = self.events_df["start_time"].values
                end_times = self.events_df["end_time"].values

                for x in tqdm(range(num_events), desc="Processing 3D events"):
                    for y in range(x + 1, num_events):  # y > x to avoid redundancy
                        for z in range(y + 1, num_events):  # z > y to avoid redundancy
                            score = self.temporal_close_score(
                                time_ranges=[
                                    [start_times[x], end_times[x]],
                                    [start_times[y], end_times[y]],
                                    [start_times[z], end_times[z]],
                                ]
                            )
                            dimension_3_matrix[x, y, z] = score
                            dimension_3_matrix[x, z, y] = score
                            dimension_3_matrix[y, x, z] = score
                            dimension_3_matrix[y, z, x] = score
                            dimension_3_matrix[z, x, y] = score
                            dimension_3_matrix[z, y, x] = score
            elif sample_strategy == "degree_high":
                for x in range(num_events):
                    for y in range(x + 1, num_events):
                        for z in range(y + 1, num_events):
                            score = (
                                degree_scores[x] + degree_scores[y] + degree_scores[z]
                            )
                            dimension_3_matrix[x, y, z] = score
                            dimension_3_matrix[x, z, y] = score
                            dimension_3_matrix[y, x, z] = score
                            dimension_3_matrix[y, z, x] = score
                            dimension_3_matrix[z, x, y] = score
                            dimension_3_matrix[z, y, x] = score
            elif sample_strategy == "both":
                # park here
                start_times = self.events_df["start_time"].values
                end_times = self.events_df["end_time"].values
                for x in tqdm(range(num_events), desc="Processing 3D events"):
                    for y in range(x + 1, num_events):
                        for z in range(y + 1, num_events):
                            degree_score = (
                                degree_scores[x] + degree_scores[y] + degree_scores[z]
                            )
                            temporal_score = self.temporal_close_score(
                                time_ranges=[
                                    [start_times[x], end_times[x]],
                                    [start_times[y], end_times[y]],
                                    [start_times[z], end_times[z]],
                                ]
                            )
                            score = degree_score + temporal_score
                            dimension_3_matrix[x, y, z] = score
                            dimension_3_matrix[x, z, y] = score
                            dimension_3_matrix[y, x, z] = score
                            dimension_3_matrix[y, z, x] = score
                            dimension_3_matrix[z, x, y] = score
                            dimension_3_matrix[z, y, x] = score

            # make sure every element in the matrix sum to 1
            dimension_3_matrix = dimension_3_matrix / dimension_3_matrix.sum()
        # do the sampling based on the matrix
        # if sampling is a float value, then it means all three dimensions will be sampled based on the rate
        # if samping is an int value, then it means all three dimension will have that many questions
        # if sampling is a dict value, then it means the sampling rate for each dimension

        with timer(the_logger=logger, message="Sampling the events"):
            if isinstance(sample_percentage, float):
                # sample based on the rate, and the value (weight) is the matrix value
                dimension_1_samples = np.random.choice(
                    num_events,
                    int(num_events * sample_percentage),
                    p=dimension_1_matrix,
                )
                # sample it from dimension 2 matrix
                dimension_2_samples = self.random_selection(
                    dimension_2_matrix,
                    int(dimension_2_matrix.size * sample_percentage),
                )
                # sample it from dimension 3 matrix
                dimension_3_samples = self.random_selection(
                    dimension_3_matrix,
                    int(dimension_3_matrix.size * sample_percentage),
                )

            elif isinstance(sample_percentage, int):
                dimension_1_samples = np.random.choice(
                    num_events, sample_percentage, p=dimension_1_matrix
                )
                dimension_2_samples = self.random_selection(
                    dimension_2_matrix,
                    sample_percentage,
                )
                dimension_3_samples = self.random_selection(
                    dimension_3_matrix,
                    sample_percentage,
                )

            elif isinstance(sample_percentage, dict):
                dimension_1_samples = sample_percentage.get("dimension_1", 0)
                dimension_2_samples = sample_percentage.get("dimension_2", 0)
                dimension_3_samples = sample_percentage.get("dimension_3", 0)
                if (
                    dimension_1_samples == 0
                    or dimension_2_samples == 0
                    or dimension_3_samples == 0
                ):
                    raise ValueError(
                        "The sample_percentage should have all three dimensions"
                    )
                # if all types of
                # dimension_1_sample_percentage,
                # dimension_2_sample_percentage,
                # dimension_3_sample_percentage are float
                # then we will sample based on the rate
                if all(
                    isinstance(i, float)
                    for i in [
                        dimension_1_samples,
                        dimension_2_samples,
                        dimension_3_samples,
                    ]
                ):
                    dimension_1_samples = np.random.choice(
                        len(self.events_df),
                        int(len(self.events_df) * dimension_1_samples),
                        p=dimension_1_matrix,
                    )
                    dimension_2_samples = self.random_selection(
                        dimension_2_matrix,
                        int(dimension_2_matrix.size * dimension_2_samples),
                    )
                    dimension_3_samples = self.random_selection(
                        dimension_3_matrix,
                        int(dimension_3_matrix.size * dimension_3_samples),
                    )
                # if all types of
                # dimension_1_sample_percentage,
                # dimension_2_sample_percentage,
                # dimension_3_sample_percentage are int
                # then we will sample based on the number
                elif all(
                    isinstance(i, int)
                    for i in [
                        dimension_1_samples,
                        dimension_2_samples,
                        dimension_3_samples,
                    ]
                ):
                    dimension_1_samples = np.random.choice(
                        len(self.events_df), dimension_1_samples, p=dimension_1_matrix
                    )
                    dimension_2_samples = self.random_selection(
                        dimension_2_matrix,
                        dimension_2_samples,
                    )
                    dimension_3_samples = self.random_selection(
                        dimension_3_matrix,
                        dimension_3_samples,
                    )
                else:
                    raise ValueError(
                        "The sample_percentage should have all three dimensions"
                    )
            else:
                raise ValueError(
                    "The sample_percentage should be either float, int, or dict"
                )

        logger.info(len(dimension_1_samples))
        logger.info(len(dimension_2_samples))
        logger.info(len(dimension_3_samples))

        """
        Examples of the output:

        ```
        [  0  10  20  30  40  50  60  70  80  90 100]
        [(0, 10), (20, 30), (40, 50), (60, 70), (80, 90), (100, 110)]
        [(0, 10, 20), (30, 40, 50), (60, 70, 80), (90, 100, 110)]
        ```
        """
        self.sample_simple_events = dimension_1_samples
        self.sample_medium_events = dimension_2_samples
        self.sample_complex_events = dimension_3_samples
        return dimension_1_samples, dimension_2_samples, dimension_3_samples

    def temporal_close_score(self, time_ranges: List) -> float:
        """
        This function will calculate the temporal close score between two/three time ranges.

        range_a = [start_time_a, end_time_a]
        range_b = [start_time_b, end_time_b]
        range_c = [start_time_c, end_time_c] (if three ranges are provided)

        score = 1 / ((start_time_b - start_time_a)**2 + (end_time_b - end_time_a)**2)
        (for two ranges)

        score = 1 / ((start_time_b - start_time_a)**2 + (end_time_b - end_time_a)**2 +
                     (start_time_c - start_time_a)**2 + (end_time_c - end_time_a)**2)
        (for three ranges)

        Args:
            time_ranges (List[Tuple[datetime, datetime]]): The list of time ranges

        Returns:
            temporal_close_score (float): The temporal close score between two/three time ranges, if it is close
            to 1, it means the time ranges are close to each other, if it is close to 0, it means the time ranges
            are far from each other
        """
        if len(time_ranges) not in [2, 3]:
            raise ValueError("The function only supports two or three time ranges")

        # Utility function to convert string to datetime
        start_time_a_dt, end_time_a_dt = self.util_str_to_datetime(time_ranges[0])
        start_time_b_dt, end_time_b_dt = self.util_str_to_datetime(time_ranges[1])

        # Calculate the differences in days
        start_diff_days = (start_time_b_dt - start_time_a_dt) / np.timedelta64(1, "D")
        end_diff_days = (end_time_b_dt - end_time_a_dt) / np.timedelta64(1, "D")

        # Calculate the score using days difference
        score = start_diff_days**2 + end_diff_days**2

        if len(time_ranges) == 3:
            start_time_c_dt, end_time_c_dt = self.util_str_to_datetime(time_ranges[2])
            start_diff_days_c = (start_time_c_dt - start_time_a_dt) / np.timedelta64(
                1, "D"
            )
            end_diff_days_c = (end_time_c_dt - end_time_a_dt) / np.timedelta64(1, "D")
            score += start_diff_days_c**2 + end_diff_days_c**2
        if score == 0:
            return 0
        return 1 / score

    @staticmethod
    def calculate_degree_scores(event_df: pd.DataFrame):
        """
        group by subject, each subject will have a degree
        group by object, each object will have a degree
        merge subject_degree and object_degree back to the event_df, it will have extract two columns
        - subject_degree
        - object_degree
        then we will calculate the degree score based on the two columns

        In this way, we have the degree score for each event

        Args:
            event_df (pd.DataFrame): The dataframe of the events

        Returns:
            Tuple: The tuple of the degree scores
        """

        subject_degree_df = (
            event_df.groupby("subject").size().reset_index(name="subject_degree")
        )
        object_degree_df = (
            event_df.groupby("object").size().reset_index(name="object_degree")
        )

        event_df = event_df.merge(subject_degree_df, on="subject", how="left")
        event_df = event_df.merge(object_degree_df, on="object", how="left")

        event_df["degree_score"] = (
            event_df["subject_degree"] + event_df["object_degree"]
        )
        # show the summary of the degree score
        logger.debug(event_df["degree_score"].describe())

        # flat the score to 1,2,3,4 based on the quartile
        event_df["degree_score"] += np.random.normal(
            0, 1e-5, size=event_df["degree_score"].shape
        )
        event_df["degree_score"] = pd.qcut(event_df["degree_score"], q=4, labels=False)
        event_df["degree_score"] = event_df["degree_score"] + 1

        # normalize the score to 0-1
        event_df["degree_score"] = event_df["degree_score"] / 4

        return event_df["degree_score"].values

    @staticmethod
    def random_selection(matrix, sample_num):

        if sample_num > matrix.size:
            raise ValueError(
                "The sample number should be less than the length of the matrix"
            )

        if matrix.shape[0] != matrix.shape[1]:
            raise ValueError("The matrix should be a square matrix")
        # if it is one dimension, then we will sample based on the rate
        if matrix.shape[0] == 1:
            raise ValueError("The matrix should be at least two dimensions")

        flatten_matrix = matrix.flatten()
        sample_indices = np.random.choice(
            np.arange(len(flatten_matrix)),
            size=sample_num,
            replace=False,
            p=flatten_matrix,
        )
        samples = [np.unravel_index(i, matrix.shape) for i in sample_indices]
        return samples

`bulk_insert(values)`

This function will insert the values into the table

Source code in TimelineKGQA/generator.py

def bulk_insert(self, values: List[Tuple]):
    """
    This function will insert the values into the table

    """
    values_str = ",\n".join(["(%s, %s, %s, %s, %s, %s, %s, %s, %s)"] * len(values))
    # Flatten the list of values tuples into a single tuple for execution
    flat_values = [item for sublist in values for item in sublist]

    bulk_insert_query = f"""
    INSERT INTO {self.unified_kg_table_questions} (
                                                source_kg_id,
                                                  question,
                                                  answer,
                                                  paraphrased_question,
                                                  events,
                                                  question_level,
                                                  question_type,
                                                  answer_type,
                                                  temporal_relation
                                                  )
    VALUES {values_str}
    """
    if len(values) == 0:
        return
    # Execute the bulk insert command
    try:
        self.cursor.execute(bulk_insert_query, flat_values)
        self.connection.commit()
        logger.info(f"Successfully inserted {len(values)} rows into the table.")
    except Exception as e:
        logger.exception(f"Error: {e}")
        logger.info(flat_values)

`calculate_degree_scores(event_df)` `staticmethod`

group by subject, each subject will have a degree group by object, each object will have a degree merge subject_degree and object_degree back to the event_df, it will have extract two columns - subject_degree - object_degree then we will calculate the degree score based on the two columns

In this way, we have the degree score for each event

Parameters:

Name	Type	Description	Default
`event_df`	`DataFrame`	The dataframe of the events	required

Returns:

Name	Type	Description
`Tuple`		The tuple of the degree scores

Source code in TimelineKGQA/generator.py

@staticmethod
def calculate_degree_scores(event_df: pd.DataFrame):
    """
    group by subject, each subject will have a degree
    group by object, each object will have a degree
    merge subject_degree and object_degree back to the event_df, it will have extract two columns
    - subject_degree
    - object_degree
    then we will calculate the degree score based on the two columns

    In this way, we have the degree score for each event

    Args:
        event_df (pd.DataFrame): The dataframe of the events

    Returns:
        Tuple: The tuple of the degree scores
    """

    subject_degree_df = (
        event_df.groupby("subject").size().reset_index(name="subject_degree")
    )
    object_degree_df = (
        event_df.groupby("object").size().reset_index(name="object_degree")
    )

    event_df = event_df.merge(subject_degree_df, on="subject", how="left")
    event_df = event_df.merge(object_degree_df, on="object", how="left")

    event_df["degree_score"] = (
        event_df["subject_degree"] + event_df["object_degree"]
    )
    # show the summary of the degree score
    logger.debug(event_df["degree_score"].describe())

    # flat the score to 1,2,3,4 based on the quartile
    event_df["degree_score"] += np.random.normal(
        0, 1e-5, size=event_df["degree_score"].shape
    )
    event_df["degree_score"] = pd.qcut(event_df["degree_score"], q=4, labels=False)
    event_df["degree_score"] = event_df["degree_score"] + 1

    # normalize the score to 0-1
    event_df["degree_score"] = event_df["degree_score"] / 4

    return event_df["degree_score"].values

`complex_question_generation()`

This is to generate the complex question, which will involve three events and timeline

Type 1: Before Bush, after Kennedy, who is the president of US?
Type 2: Who is the first president of US among Bush, Kennedy, and Obama?

Source code in TimelineKGQA/generator.py

def complex_question_generation(self):
    """
    This is to generate the complex question, which will involve three events and timeline

    - Type 1: Before Bush, after Kennedy, who is the president of US?
    - Type 2: Who is the first president of US among Bush, Kennedy, and Obama?
    """

    # next step is to construct three events
    insert_values_list = []
    bulk_sql_pointer = 0

    for item in self.sample_complex_events:
        first_event = self.events_df.iloc[item[0]]
        second_event = self.events_df.iloc[item[1]]
        third_event = self.events_df.iloc[item[2]]
        source_kg_id = (
            first_event["id"] * 1000000 * 1000000
            + second_event["id"] * 1000000
            + third_event["id"]
        )

        questions = self.complex_question_generation_individual(
            first_event=first_event.to_dict(),
            second_event=second_event.to_dict(),
            third_event=third_event.to_dict(),
            template_based=True,
            paraphrased=self.paraphrased,
        )
        for question_obj in questions:
            question_obj["source_kg_id"] = int(source_kg_id)
            # get dict to tuple, sequence should be the same as the sql command
            data = (
                question_obj["source_kg_id"],
                question_obj["question"],
                question_obj["answer"],
                question_obj["paraphrased_question"],
                question_obj["events"],
                question_obj["question_level"],
                question_obj["question_type"],
                question_obj["answer_type"],
                question_obj["temporal_relation"],
            )
            insert_values_list.append(data)
            bulk_sql_pointer += 1
            if bulk_sql_pointer % self.bulk_sql_size == 0:
                self.bulk_insert(insert_values_list)
                insert_values_list = []

    self.bulk_insert(insert_values_list)

`complex_question_generation_individual(first_event, second_event, third_event, template_based=True, paraphrased=True)`

Parameters:

Name	Type	Description	Default
`first_event`	`dict`	The first event	required
`second_event`	`dict`	The second event	required
`third_event`	`dict`	The third event	required
`template_based`	`bool`	Whether you use the template based question generation	`True`
`paraphrased`	`bool`	Whether you do the paraphrase for the question, if set to False, then the paraphrased_question will be the same as the question	`True`

Returns:

Name	Type	Description
`dict`	`List[dict]`	The generated questions - question - answer - paraphrased_question - events - question_level: Complex - question_type: The type of the question - answer_type: The type of the answer

question_type:
- timeline_position_retrieval *2 + temporal constrained retrieval
- timeline_position_retrieval *3
answer_type:
- type1:
  - subject
    - trel b, trel c, ? predicate object
  - object
    - trel b, trel c, subject predicate ?
- type2:
  - Infer a new time range: Union/Intersection
    - trel b, trel c, from when to when the subject predicate object? (intersection)
    - within (trel b, trel c), who is the subject predicate object? (union)
  - Infer a temporal relation: Allen
    - ? More making sense one is ranking
    - hard to justify the question that
    - If we ask for choice question, it will be between two events
    - If we ask for true/false, event a,b,c; ab, ac, bc; Question, ab+ac => is bc relation True
  - Infer a list of time ranges: Ranking
    - Who is the {} among a,b,c? => an
  - Infer duration, and then compare
    - Who is the president of US for the longest time among a, b, c? => a

Source code in TimelineKGQA/generator.py

def complex_question_generation_individual(
    self,
    first_event: dict,
    second_event: dict,
    third_event: dict,
    template_based: bool = True,
    paraphrased: bool = True,
) -> List[dict]:
    """
    Args:
        first_event (dict): The first event
        second_event (dict): The second event
        third_event (dict): The third event
        template_based (bool): Whether you use the template based question generation
        paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                then the paraphrased_question will be the same as the question

    Returns:
        dict: The generated questions
            - question
            - answer
            - paraphrased_question
            - events
            - question_level: Complex
            - question_type: The type of the question
            - answer_type: The type of the answer


    - question_type:
        - timeline_position_retrieval *2 + temporal constrained retrieval
        - timeline_position_retrieval *3
    - answer_type:
        - type1:
            - subject
                - trel b, trel c, ? predicate object
            - object
                - trel b, trel c, subject predicate ?
        - type2:
            - Infer a new time range: Union/Intersection
                - trel b, trel c, from when to when the subject predicate object? (intersection)
                - within (trel b, trel c), who is the subject predicate object? (union)
            - Infer a temporal relation: Allen
                - ? More making sense one is ranking
                - hard to justify the question that
                - If we ask for choice question, it will be between two events
                - If we ask for true/false, event a,b,c; ab, ac, bc;  Question, ab+ac => is bc relation True
            - Infer a list of time ranges: Ranking
                - Who is the {} among a,b,c? => an
            - Infer duration, and then compare
                - Who is the president of US for the longest time among a, b, c? => a

    """

    first_event_subject = first_event["subject"]
    first_event_predicate = first_event["predicate"]
    first_event_object = first_event["object"]
    first_event_start_time = first_event["start_time"]
    first_event_end_time = first_event["end_time"]

    second_event_subject = second_event["subject"]
    second_event_predicate = second_event["predicate"]
    second_event_object = second_event["object"]
    second_event_start_time = second_event["start_time"]
    second_event_end_time = second_event["end_time"]

    third_event_subject = third_event["subject"]
    third_event_predicate = third_event["predicate"]
    third_event_object = third_event["object"]
    third_event_start_time = third_event["start_time"]
    third_event_end_time = third_event["end_time"]

    first_event_start_time_dt, first_event_end_time_dt = self.util_str_to_datetime(
        [first_event_start_time, first_event_end_time]
    )
    second_event_start_time_dt, second_event_end_time_dt = (
        self.util_str_to_datetime([second_event_start_time, second_event_end_time])
    )
    third_event_start_time_dt, third_event_end_time_dt = self.util_str_to_datetime(
        [third_event_start_time, third_event_end_time]
    )

    # first generate
    complex_type_1_a_questions = []
    questions = []

    # timeline_position_retrieval *2 + temporal constrained retrieval
    # ask for the first subject
    complex_type_1_a_questions.append(
        {
            "question": f"??? {first_event_predicate} {first_event_object} {second_event_predicate} "
            f"{second_event_object} {third_event_predicate} {third_event_object}?",
            "answer": f"{first_event_subject}",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*2+temporal_constrained_retrieval",
            "answer_type": "subject",
            "temporal_relation": None,
        }
    )
    # ask for first object
    complex_type_1_a_questions.append(
        {
            "question": f"{first_event_subject} {first_event_predicate} ??? {second_event_predicate} "
            f"{second_event_object} {third_event_predicate} {third_event_object}?",
            "answer": f"{first_event_object}",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*2+temporal_constrained_retrieval",
            "answer_type": "object",
            "temporal_relation": None,
        }
    )

    questions += complex_type_1_a_questions

    """
    For duration before, duration after type question
    """

    complex_type_1_b_questions = []
    # this will be added later when we process the questions with template

    """
    Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval
    """

    complex_type_2_questions = [
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
            f"{second_event_predicate} {second_event_object} "
            f"{third_event_predicate} {third_event_object}?",
            "answer": "Union/Intersection of the time range",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*3",
            "answer_type": "relation_union_or_intersection",
            "temporal_relation": "intersection",
        },
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
            f"{second_event_predicate} {second_event_object} "
            f"{third_event_predicate} {third_event_object}?",
            "answer": "Union/Intersection of the time range",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*3",
            "answer_type": "relation_union_or_intersection",
            "temporal_relation": "union",
        },
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
            f"{second_event_predicate} {second_event_object} "
            f"{third_event_predicate} {third_event_object}?",
            "answer": "Duration",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*3",
            "answer_type": "relation_duration",
            "temporal_relation": None,
        },
        # add ranking one
        {
            "question": f"Who is the xxx among {first_event_subject}, {second_event_subject}, "
            f"and {third_event_subject}?",
            "answer": "Ranking",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
                f"{third_event_subject}|{third_event_predicate}|{third_event_object}|"
                f"{third_event_start_time}|{third_event_end_time}",
            ],
            "question_level": "complex",
            "question_type": "timeline_position_retrieval*3",
            "answer_type": "relation_ranking",
            "temporal_relation": "ranking",
        },
    ]

    questions += complex_type_2_questions
    temporal_answer = None

    if template_based:
        for question_draft in questions:
            this_type_templates = QUESTION_TEMPLATES[
                question_draft["question_level"]
            ][question_draft["question_type"]][question_draft["answer_type"]]

            if (
                question_draft["answer_type"] == "subject"
                or question_draft["answer_type"] == "object"
            ):
                """
                Handle the Complex Type 1 Questions here:
                a,b,c, will ask for a information.
                Get temporal_relation_12, temporal_relation_13, then generate the question
                """
                temporal_relation_12 = self.relation_allen_time_range(
                    time_range_a=[
                        first_event_start_time_dt,
                        first_event_end_time_dt,
                    ],
                    time_range_b=[
                        second_event_start_time_dt,
                        second_event_end_time_dt,
                    ],
                )
                temporal_relation_13 = self.relation_allen_time_range(
                    time_range_a=[
                        first_event_start_time_dt,
                        first_event_end_time_dt,
                    ],
                    time_range_b=[
                        third_event_start_time_dt,
                        third_event_end_time_dt,
                    ],
                )

                temporal_relation_12_semantic = temporal_relation_12.get("semantic")
                temporal_relation_13_semantic = temporal_relation_13.get("semantic")
                question_draft["temporal_relation"] = (
                    f"{temporal_relation_12['relation']}&{temporal_relation_13['relation']}"
                )
                random_pick_template = random.choice(this_type_templates)

                question_draft["question"] = random_pick_template.format(
                    first_event_subject=first_event_subject,
                    first_event_predicate=first_event_predicate,
                    first_event_object=first_event_object,
                    temporal_relation_12=temporal_relation_12_semantic,
                    second_event_subject=second_event_subject,
                    second_event_predicate=second_event_predicate,
                    second_event_object=second_event_object,
                    temporal_relation_13=temporal_relation_13_semantic,
                    third_event_subject=third_event_subject,
                    third_event_predicate=third_event_predicate,
                    third_event_object=third_event_object,
                )

                # this will generate the basic temporal relation questions.
                # then we will want to generate the duration_before, duration_after
                can_generate_duration_question = False
                if temporal_relation_12_semantic in ["before", "after"]:
                    duration = self.relation_duration_calculation(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            second_event_start_time_dt,
                            second_event_end_time_dt,
                        ],
                        temporal_operator=f"duration_{temporal_relation_12_semantic}",
                    )
                    temporal_relation_12_semantic = (
                        f"{duration} {temporal_relation_12_semantic}"
                    )
                    logger.debug(temporal_relation_12_semantic)
                    can_generate_duration_question = True
                if temporal_relation_13_semantic in ["before", "after"]:
                    duration = self.relation_duration_calculation(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            third_event_start_time_dt,
                            third_event_end_time_dt,
                        ],
                        temporal_operator=f"duration_{temporal_relation_13_semantic}",
                    )
                    temporal_relation_13_semantic = (
                        f"{duration} {temporal_relation_13_semantic}"
                    )
                    logger.debug(temporal_relation_13_semantic)
                    can_generate_duration_question = True
                if can_generate_duration_question:
                    # copy a new question draft
                    duration_question_draft = copy.deepcopy(question_draft)
                    duration_question_draft["question"] = (
                        random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            temporal_relation_12=temporal_relation_12_semantic,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            temporal_relation_13=temporal_relation_13_semantic,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                    )
                    duration_question_draft["temporal_relation"] = (
                        f"duration_{temporal_relation_12_semantic}&duration_{temporal_relation_13_semantic}"
                    )
                    complex_type_1_b_questions.append(duration_question_draft)
            else:
                # handle the Timeline Position Retrieval + Timeline Position Retrieval + Timeline Position Retrieval
                if (
                    question_draft["answer_type"]
                    == "relation_union_or_intersection"
                ):
                    temporal_relation = question_draft["temporal_relation"]
                    random_pick_template = random.choice(
                        this_type_templates[temporal_relation]
                    )
                    temporal_answer = self.relation_union_or_intersection(
                        time_ranges=[
                            (first_event_start_time_dt, first_event_end_time_dt),
                            (second_event_start_time_dt, second_event_end_time_dt),
                            (third_event_start_time_dt, third_event_end_time_dt),
                        ],
                        temporal_operator=temporal_relation,
                    )

                    question_draft["question"] = random_pick_template.format(
                        first_event_subject=first_event_subject,
                        first_event_predicate=first_event_predicate,
                        first_event_object=first_event_object,
                        second_event_subject=second_event_subject,
                        second_event_predicate=second_event_predicate,
                        second_event_object=second_event_object,
                        third_event_subject=third_event_subject,
                        third_event_predicate=third_event_predicate,
                        third_event_object=third_event_object,
                    )
                    logger.debug(question_draft["question"])
                    logger.debug(temporal_answer)
                    if temporal_answer is None:
                        temporal_answer = "No Answer"
                    question_draft["answer"] = temporal_answer
                elif question_draft["answer_type"] == "relation_duration":
                    """
                    There are four types in this category
                    - duration => which is the intersection of the two time range
                    - duration_compare => longer shorter equal
                    - sum => total duration of the two time range, which is actually the union
                    - average => average duration of the two time range
                    """
                    temporal_relation = random.choice(
                        [
                            "duration",
                            "duration_compare",
                            "sum",
                            "average",
                        ]
                    )
                    random_pick_template = random.choice(
                        this_type_templates[temporal_relation]
                    )
                    question_draft["temporal_relation"] = temporal_relation
                    if temporal_relation == "duration":
                        temporal_answer = self.relation_union_or_intersection(
                            time_ranges=[
                                (
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ),
                                (
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ),
                                (
                                    third_event_start_time_dt,
                                    third_event_end_time_dt,
                                ),
                            ],
                            temporal_operator="intersection",
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                    elif temporal_relation == "duration_compare":
                        # we do duration ranking here
                        duration_rank_by_index = self.relation_duration(
                            time_ranges=[
                                [
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                [
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                                [
                                    third_event_start_time_dt,
                                    third_event_end_time_dt,
                                ],
                            ],
                            agg_temporal_operator="ranking",
                        )
                        logger.debug(duration_rank_by_index)
                        temporal_answer = duration_rank_by_index[0] + 1
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                            temporal_duration_rank=temporal_answer,
                        )
                    elif temporal_relation == "sum":
                        temporal_answer = self.util_average_duration_calculation(
                            time_ranges=[
                                [
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                [
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                                [
                                    third_event_start_time_dt,
                                    third_event_end_time_dt,
                                ],
                            ],
                            temporal_operator="sum",
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                    elif temporal_relation == "average":
                        temporal_answer = self.util_average_duration_calculation(
                            time_ranges=[
                                [
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                [
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                                [
                                    third_event_start_time_dt,
                                    third_event_end_time_dt,
                                ],
                            ],
                            temporal_operator="average",
                        )
                        logger.debug(temporal_answer)
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            third_event_subject=third_event_subject,
                            third_event_predicate=third_event_predicate,
                            third_event_object=third_event_object,
                        )
                    question_draft["answer"] = temporal_answer
                    question_draft["temporal_relation"] = temporal_relation
                elif question_draft["answer_type"] == "relation_ranking":
                    # random select, ranking based on start time or end time
                    rank_by_what = random.choice(
                        ["rank_start_time", "rank_end_time"]
                    )
                    rank_by_index = self.relation_ordinal_time_range(
                        time_ranges=[
                            [first_event_start_time_dt, first_event_end_time_dt],
                            [second_event_start_time_dt, second_event_end_time_dt],
                            [third_event_start_time_dt, third_event_end_time_dt],
                        ],
                        agg_temporal_operator=rank_by_what,
                    )

                    random_pick_template = random.choice(
                        this_type_templates[rank_by_what]
                    )
                    question_draft["question"] = random_pick_template.format(
                        first_event_subject=first_event_subject,
                        first_event_predicate=first_event_predicate,
                        first_event_object=first_event_object,
                        second_event_subject=second_event_subject,
                        second_event_predicate=second_event_predicate,
                        second_event_object=second_event_object,
                        third_event_subject=third_event_subject,
                        third_event_predicate=third_event_predicate,
                        third_event_object=third_event_object,
                    )
                    temporal_answer = rank_by_index[0] + 1
                    question_draft["answer"] = temporal_answer
                    question_draft["temporal_relation"] = rank_by_what

    questions += complex_type_1_b_questions
    if paraphrased:
        for question_obj in questions:
            paraphrased_question = paraphrase_medium_question(
                question=question_obj["question"],
            )
            logger.info(f"paraphrased_question: {paraphrased_question}")
            question_obj["paraphrased_question"] = paraphrased_question

    return questions

`medium_question_generation()`

This will involve mainly two types of questions

Timeline Position Retrival => Temporal Constrained Retrieval
Timeline Position Retrival + Timeline Position Retrival

question_level: medium
question_type:
- timeline_recovery_temporal_constrained_retrieval
- timeline_recovery_timeline_recovery
answer_type:
- entity:
  - subject:
  - object
- temporal related
  - Infer a new time range: Union/Intersection
  - Infer a temporal relation: Allen
  - Infer duration, and then compare
  - Note: Ranking will be the same as Allen, so it will be in Complex level

Source code in TimelineKGQA/generator.py

def medium_question_generation(self):
    """
    This will involve mainly two types of questions

    - **Timeline Position Retrival => Temporal Constrained Retrieval**
    - **Timeline Position Retrival + Timeline Position Retrival**

    ---

    - question_level: medium
    - question_type:
        - timeline_recovery_temporal_constrained_retrieval
        - timeline_recovery_timeline_recovery
    - answer_type:
        - entity:
            - subject:
            - object
        - temporal related
            - Infer a new time range: Union/Intersection
            - Infer a temporal relation: Allen
            - Infer duration, and then compare
            - Note: Ranking will be the same as Allen, so it will be in **Complex** level

    """

    insert_values_list = []
    bulk_sql_pointer = 0

    for item in self.sample_medium_events:
        first_event = self.events_df.iloc[item[0]]
        second_event = self.events_df.iloc[item[1]]

        source_kg_id = first_event["id"] * 1000000 + second_event["id"]
        logger.debug(f"Generating question for source_kg_id: {source_kg_id}")
        questions = self.medium_question_generation_individual(
            first_event=first_event.to_dict(),
            second_event=second_event.to_dict(),
            template_based=True,
            paraphrased=self.paraphrased,
        )

        for question_obj in questions:

            question_obj["source_kg_id"] = int(source_kg_id)
            # get dict to tuple, sequence should be the same as the sql command
            data = (
                question_obj["source_kg_id"],
                question_obj["question"],
                question_obj["answer"],
                question_obj["paraphrased_question"],
                question_obj["events"],
                question_obj["question_level"],
                question_obj["question_type"],
                question_obj["answer_type"],
                question_obj["temporal_relation"],
            )
            insert_values_list.append(data)
            bulk_sql_pointer += 1
            if bulk_sql_pointer % self.bulk_sql_size == 0:
                self.bulk_insert(values=insert_values_list)
                insert_values_list = []
    self.bulk_insert(values=insert_values_list)

`medium_question_generation_individual(first_event, second_event, template_based=True, paraphrased=False)`

Parameters:

Name	Type	Description	Default
`first_event`	`dict`	The first event	required
`second_event`	`dict`	The second event	required
`template_based`	`bool`	Whether you use the template based question generation	`True`
`paraphrased`	`bool`	Whether you do the paraphrase for the question, if set to False, then the paraphrased_question will be the same as the question	`False`

Returns:

Name	Type	Description
`dict`	`List[dict]`	The generated questions - question - answer - paraphrased_question - events - question_level: Medium - question_type: The type of the question - answer_type: The type of the answer

question_type:
- timeline_position_retrieval_temporal_constrained_retrieval
  - For this one, the logic/reasoning/math part will be like: TimeRange + Temporal Semantic Operation => TimeRange
  - Then the interesting part will be the Timeline Operation, we have mentioned several types of operations below.
    - There are mostly from numeric to semantic perspective
    - Here is the reverse process: name it Temporal Semantic Operation
    - So this is trying to convert the temporal semantic representation to a numeric operation and then get a new operation.
- timeline_position_retrieval_timeline_position_retrieval
  - For the logic/reasoning/math side, it actually is TimeRange vs TimeRange => Timeline Operation
    - Get a way to ask about this comparison relations.
    - So the question will mainly be about whether this relation is True, or which relation it is.
    - For duration, we can ask about the duration of the two events, and then compare
    - Or we can compare the event ranking based on the time range
- there is another types: Three years before 2019, who is the president of China? => It is a valid question, but nobody will in this way.
  - It will be normally classified into simple: in 2016, who is the president of China?
  - Or it will be something like: Three years before bush end the term, who is the president of China? => This will be classified into Medium, and belong to the timeline_position_retrieval_temporal_constrained_retrieval
answer_type:
- subject, object for timeline_position_retrieval_temporal_constrained_retrieval
  - subject
  - object
  - only focus on the first one, as the second will always become the first later
- temporal related for timeline_position_retrieval_timeline_position_retrieval
  - Infer a new time range: Union/Intersection
  - Infer a temporal relation: Allen
  - Infer a list of time ranges: Ranking
  - Infer duration, and then compare

Process:

The quality of the question is not guaranteed by LLM directly if we just mask out the answer. So we will use the template to generate the questions, then use LLM to paraphrase the questions.

Source code in TimelineKGQA/generator.py

def medium_question_generation_individual(
    self,
    first_event: dict,
    second_event: dict,
    template_based: bool = True,
    paraphrased: bool = False,
) -> List[dict]:
    """

    Args:
        first_event (dict): The first event
        second_event (dict): The second event
        template_based (bool): Whether you use the template based question generation
        paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                then the paraphrased_question will be the same as the question

    Returns:
        dict: The generated questions
            - question
            - answer
            - paraphrased_question
            - events
            - question_level: Medium
            - question_type: The type of the question
            - answer_type: The type of the answer

    - question_type:
        - timeline_position_retrieval_temporal_constrained_retrieval
            - For this one, the logic/reasoning/math part will be like: **TimeRange** +
                Temporal Semantic Operation => **TimeRange**
            - Then the interesting part will be the Timeline Operation,
                we have mentioned several types of operations below.
                - There are mostly from numeric to semantic perspective
                - Here is the reverse process: name it Temporal Semantic Operation
                - So this is trying to convert the temporal semantic representation to
                    a numeric operation and then get a new operation.
        - timeline_position_retrieval_timeline_position_retrieval
            - For the logic/reasoning/math side, it actually is **TimeRange** vs **TimeRange** => Timeline Operation
                - Get a way to ask about this comparison relations.
                - So the question will mainly be about whether this relation is True, or which relation it is.
                - For duration, we can ask about the duration of the two events, and then compare
                - Or we can compare the event ranking based on the time range
        - there is another types: Three years before 2019,  who is the president of China? =>
            It is a valid question, but nobody will in this way.
            - It will be normally classified into **simple**: in 2016, who is the president of China?
            - Or it will be something like: Three years before bush end the term, who is the president of China?
                => This will be classified into **Medium**,
                    and belong to the timeline_position_retrieval_temporal_constrained_retrieval
    - answer_type:
        - subject, object for timeline_position_retrieval_temporal_constrained_retrieval
            - subject
            - object
            - only focus on the first one, as the second will always become the first later
        - temporal related for timeline_position_retrieval_timeline_position_retrieval
            - Infer a new time range: Union/Intersection
            - Infer a temporal relation: Allen
            - Infer a list of time ranges: Ranking
            - Infer duration, and then compare

    Process:

    The quality of the question is not guaranteed by LLM directly if we just mask out the answer.
    So we will use the template to generate the questions, then use LLM to paraphrase the questions.

    """
    first_event_subject = first_event["subject"]
    first_event_predicate = first_event["predicate"]
    first_event_object = first_event["object"]
    first_event_start_time = first_event["start_time"]
    first_event_end_time = first_event["end_time"]

    second_event_subject = second_event["subject"]
    second_event_predicate = second_event["predicate"]
    second_event_object = second_event["object"]
    second_event_start_time = second_event["start_time"]
    second_event_end_time = second_event["end_time"]

    first_event_start_time_dt, first_event_end_time_dt = self.util_str_to_datetime(
        [first_event_start_time, first_event_end_time]
    )
    second_event_start_time_dt, second_event_end_time_dt = (
        self.util_str_to_datetime([second_event_start_time, second_event_end_time])
    )

    # first generate
    # timeline_position_retrieval => timeline_position_retrieval
    # this will ask for the subject or object in one of the event

    medium_type_1_a_questions = []
    questions = []
    """
    Timeline Position Retrieval => Temporal Constrained Retrieval Questions
    """
    # NOTES: question here actually is not used, because we will replace it with the template.
    # It is putting there to get the idea about the types of questions we are generating
    """
    The key part of this type is:
    We need to cover as many temporal semantic operations as possible
    - Before, After, During, this is the most common one and shown in the literature
    - Starts from the same time, Ends at the same time, Meets, Overlap, this is another way to add the
        temporal condition (inspired by the allen logic)
    - Above are from allen temporal logic and intersection/union
    - We can also add the ranking ones, however, before/after is the same as first/last, under this category
    - Then the rest is the one for duration, question like 3 years before, 3 years after, etc.

    So we have main two types of questions here:
    - Relation: Before, After, During ｜ Starts from the same time, Ends at the same time, Meets, Overlap
        - calculate the relation first, then generated based on template
        - Before: Who is the president of US before the end of Bush's term?
        - After: Who is the president of US after the start of Bush's term?
        - Starts from the same time: Who and Bush start their term as father and
            President of US respectively at the same time?
        - Ends at the same time: Who and Bush end their term as father and
            President of US respectively at the same time?
        - Meets: ?
        - During: Who is the president of US during Bush's term?
        - Overlap: Bush as the president of US meets who when the guy become the father?
    - Duration: 3 years before, 3 years after, 3 years after the end, etc.
        - calculate the duration first, then generated based on template
        - 3 years before: Who is the president of US 3 years before the end of Bush's term?
        - 3 years after: Who is the president of US 3 years after the start of Bush's term?
        - 3 years after the end: Who is the president of US 3 years after the end of Bush's term?
        - 3 years after the start: Who is the president of US 3 years after the start of Bush's term?
        - meets/during/overlap hard to get a time point, so not considered here.
    """
    # ask for first subject
    medium_type_1_a_questions.append(
        {
            "question": f"??? {first_event_predicate} {first_event_object} [Timeline Operation on "
            f"({first_event_start_time}, {first_event_end_time}) vs ({second_event_start_time}, "
            f"{second_event_end_time})] {second_event_subject}"
            f"{second_event_predicate} {second_event_object}?",
            "answer": f"{first_event_subject}",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
            ],
            "question_level": "medium",
            "question_type": "timeline_position_retrieval_temporal_constrained_retrieval",
            "answer_type": "subject",
            "temporal_relation": None,
        }
    )

    # ask for first object
    medium_type_1_a_questions.append(
        {
            "question": f"{first_event_subject} {first_event_predicate} ??? [Timeline Operation on "
            f"({first_event_start_time}, {first_event_end_time}) vs ({second_event_start_time},"
            f"{second_event_end_time})] {second_event_subject}"
            f" {second_event_predicate} {second_event_object}?",
            "answer": f"{first_event_object}",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
            ],
            "question_level": "medium",
            "question_type": "timeline_position_retrieval_temporal_constrained_retrieval",
            "answer_type": "object",
            "temporal_relation": None,
        }
    )
    questions += medium_type_1_a_questions

    """
    For duration before, duration after type  question
    """
    medium_type_1_b_questions = []
    # this will be added later when we process the questions with template

    """
    Timeline Position Retrieval + Timeline Position Retrieval Questions

    This one is mainly from numeric to temporal semantic

    - Infer a new time range: Union/Intersection
    - Infer a temporal relation: Allen
    - Infer a list of time ranges: Ranking (not considered here)
    - Infer duration, and then compare
    """
    # Timeline Position Retrieval + Timeline Position Retrieval

    # ask for union/intersection of the time range

    medium_type_2_questions = [
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} ???"
            f"[Timeline Operation on ({first_event_start_time}, {first_event_end_time}) vs "
            f"({second_event_start_time}, {second_event_end_time})]??? "
            f"{second_event_subject} {second_event_predicate} {second_event_object}?",
            "answer": "Union/Intersection of the time range",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
            ],
            "question_level": "medium",
            "question_type": "timeline_position_retrieval_timeline_position_retrieval",
            "answer_type": "relation_union_or_intersection",
            "temporal_relation": "intersection",
        },
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
            f"???[Timeline Operation on ({first_event_start_time}, {first_event_end_time}) "
            f"vs ({second_event_start_time}, {second_event_end_time})]??? "
            f"{second_event_subject} {second_event_predicate} {second_event_object}?",
            "answer": "Union/Intersection of the time range",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
            ],
            "question_level": "medium",
            "question_type": "timeline_position_retrieval_timeline_position_retrieval",
            "answer_type": "relation_union_or_intersection",
            "temporal_relation": "union",
        },
        {
            "question": f"{first_event_subject} {first_event_predicate} {first_event_object} "
            f"???[Timeline Operation on ({first_event_start_time}, "
            f"{first_event_end_time}) vs ({second_event_start_time}, "
            f"{second_event_end_time})]??? {second_event_subject} "
            f"{second_event_predicate} {second_event_object}?",
            "answer": "Duration",
            "paraphrased_question": None,
            "events": [
                f"{first_event_subject}|{first_event_predicate}|{first_event_object}|"
                f"{first_event_start_time}|{first_event_end_time}",
                f"{second_event_subject}|{second_event_predicate}|{second_event_object}|"
                f"{second_event_start_time}|{second_event_end_time}",
            ],
            "question_level": "medium",
            "question_type": "timeline_position_retrieval_timeline_position_retrieval",
            "answer_type": "relation_duration",
            "temporal_relation": None,
        },
    ]
    questions += medium_type_2_questions
    temporal_answer = None
    if template_based:
        for question_draft in questions:
            this_type_templates = QUESTION_TEMPLATES[
                question_draft["question_level"]
            ][question_draft["question_type"]][question_draft["answer_type"]]

            if (
                question_draft["answer_type"] == "subject"
                or question_draft["answer_type"] == "object"
            ):
                """
                Handle the Medium Type 1 Questions here: Both a and b
                First calculate the relations, then based on relations to select the template
                """
                temporal_relation = self.relation_allen_time_range(
                    time_range_a=[
                        first_event_start_time_dt,
                        first_event_end_time_dt,
                    ],
                    time_range_b=[
                        second_event_start_time_dt,
                        second_event_end_time_dt,
                    ],
                )
                temporal_relation_semantic = temporal_relation.get("semantic")
                question_draft["temporal_relation"] = temporal_relation["relation"]
                random_pick_template = random.choice(
                    this_type_templates[temporal_relation_semantic]
                )

                question_draft["question"] = random_pick_template.format(
                    first_event_subject=first_event_subject,
                    first_event_predicate=first_event_predicate,
                    first_event_object=first_event_object,
                    temporal_relation=temporal_relation,
                    second_event_subject=second_event_subject,
                    second_event_predicate=second_event_predicate,
                    second_event_object=second_event_object,
                )
                # this will generate the basic temporal relation questions.
                # TODO: we also need to generate the one duration_before, duration_after
                # If relation is before or after, then we can generate the duration_before, duration_after
                # Add it to variable medium_type_1_b_questions
                if temporal_relation_semantic in ["before", "after"]:
                    random_pick_template = random.choice(
                        this_type_templates[
                            f"duration_{temporal_relation_semantic}"
                        ]
                    )
                    # get the duration year
                    # Example: 3 years before Bush as the president of US, who is the president of China?
                    # The duration is calculated based on first_end_time - second_start time
                    # NOTE: It can be extended further later to calculate first_start - second_start
                    duration = self.relation_duration_calculation(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            second_event_start_time_dt,
                            second_event_end_time_dt,
                        ],
                        temporal_operator=f"duration_{temporal_relation_semantic}",
                    )
                    # copy a new question draft
                    duration_question_draft = copy.deepcopy(question_draft)
                    duration_question_draft["question"] = (
                        random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            temporal_relation=f"{duration} {temporal_relation}",
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                    )
                    duration_question_draft["temporal_relation"] = (
                        f"duration_{temporal_relation_semantic}"
                    )
                    medium_type_1_b_questions.append(duration_question_draft)
            else:
                """
                Handle in theory four types of questions here
                """
                if (
                    question_draft["answer_type"]
                    == "relation_union_or_intersection"
                ):
                    temporal_relation = question_draft["temporal_relation"]
                    random_pick_template = random.choice(
                        this_type_templates[temporal_relation]
                    )
                    temporal_answer = self.relation_union_or_intersection(
                        time_ranges=[
                            (first_event_start_time_dt, first_event_end_time_dt),
                            (second_event_start_time_dt, second_event_end_time_dt),
                        ],
                        temporal_operator=temporal_relation,
                    )

                    question_draft["question"] = random_pick_template.format(
                        first_event_subject=first_event_subject,
                        first_event_predicate=first_event_predicate,
                        first_event_object=first_event_object,
                        second_event_subject=second_event_subject,
                        second_event_predicate=second_event_predicate,
                        second_event_object=second_event_object,
                    )
                    if temporal_answer is None:
                        temporal_answer = "No Answer"
                    question_draft["answer"] = temporal_answer
                elif question_draft["answer_type"] == "relation_allen":
                    temporal_allen_relation = self.relation_allen_time_range(
                        time_range_a=[
                            first_event_start_time_dt,
                            first_event_end_time_dt,
                        ],
                        time_range_b=[
                            second_event_start_time_dt,
                            second_event_end_time_dt,
                        ],
                    )
                    question_draft["temporal_relation"] = temporal_allen_relation[
                        "relation"
                    ]
                    # random select from [choices, true_false]
                    question_format = random.choice(["choice", "true_false"])
                    if question_format == "choice":
                        random_pick_template = random.choice(
                            this_type_templates["choice"]
                        )
                        temporal_answer = temporal_allen_relation["relation"]
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                        question_draft["answer"] = temporal_answer

                    else:
                        random_pick_template = random.choice(
                            this_type_templates["true_false"]
                        )
                        random_yes_no_answer = random.choice(["True", "False"])
                        if random_yes_no_answer == "True":
                            temporal_relation = temporal_allen_relation["relation"]
                        else:
                            temporal_relation = random.choice(
                                list(
                                    set(self.allen_relations)
                                    - {temporal_allen_relation["relation"]}
                                )
                            )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                            temporal_relation=temporal_relation,
                        )
                        question_draft["answer"] = random_yes_no_answer
                elif question_draft["answer_type"] == "relation_duration":
                    """There are four types in this category
                    - duration => which is the intersection of the two time range
                    - duration_compare => longer shorter equal
                    - sum => total duration of the two time range, which is actually the union
                    - average => average duration of the two time range
                    """
                    temporal_relation = random.choice(
                        [
                            "duration",
                            "duration_compare",
                            "sum",
                            "average",
                        ]
                    )
                    random_pick_template = random.choice(
                        this_type_templates[temporal_relation]
                    )
                    question_draft["temporal_relation"] = temporal_relation
                    if temporal_relation == "duration":
                        temporal_answer = self.relation_union_or_intersection(
                            time_ranges=[
                                (
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ),
                                (
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ),
                            ],
                            temporal_operator="intersection",
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                    elif temporal_relation == "duration_compare":
                        temporal_relation_duration = (
                            self.relation_allen_time_duration(
                                time_range_a=[
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                time_range_b=[
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                            )
                        )
                        temporal_answer = temporal_relation_duration["semantic"]
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            temporal_relation=temporal_answer,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                    elif temporal_relation == "sum":
                        temporal_answer = self.util_average_duration_calculation(
                            time_ranges=[
                                [
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                [
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                            ],
                            temporal_operator="sum",
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                    elif temporal_relation == "average":
                        temporal_answer = self.util_average_duration_calculation(
                            time_ranges=[
                                [
                                    first_event_start_time_dt,
                                    first_event_end_time_dt,
                                ],
                                [
                                    second_event_start_time_dt,
                                    second_event_end_time_dt,
                                ],
                            ],
                            temporal_operator="average",
                        )
                        question_draft["question"] = random_pick_template.format(
                            first_event_subject=first_event_subject,
                            first_event_predicate=first_event_predicate,
                            first_event_object=first_event_object,
                            second_event_subject=second_event_subject,
                            second_event_predicate=second_event_predicate,
                            second_event_object=second_event_object,
                        )
                    question_draft["answer"] = temporal_answer
                    question_draft["temporal_relation"] = temporal_relation

    questions += medium_type_1_b_questions
    if paraphrased:
        for question_obj in questions:
            paraphrased_question = paraphrase_medium_question(
                question=question_obj["question"],
            )
            logger.info(f"paraphrased_question: {paraphrased_question}")
            question_obj["paraphrased_question"] = paraphrased_question

    return questions

`relation_allen_time_duration(time_range_a, time_range_b)` `staticmethod`

Parameters:

Name	Type	Description	Default
`time_range_a`	`list[datetime, datetime]`	The first time range	required
`time_range_b`	`list[datetime, datetime]`	The second time range	required

Returns:

Name	Type	Description
`dict`	`dict`	The allen temporal relation between the two time ranges

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_allen_time_duration(time_range_a: list, time_range_b: list) -> dict:
    """

    Args:
        time_range_a (list[datetime, datetime]): The first time range
        time_range_b (list[datetime, datetime]): The second time range

    Returns:
        dict: The allen temporal relation between the two time ranges
    """
    duration_a = abs(time_range_a[1] - time_range_a[0])
    duration_b = abs(time_range_b[1] - time_range_b[0])
    if duration_a < duration_b:
        return {
            "relation": "X < Y",
            "description": "X is shorter Y",
            "category": "td",
            "code": "td-1",
            "semantic": "shorter",
        }
    elif duration_a == duration_b:
        return {
            "relation": "X = Y",
            "description": "X equals Y",
            "category": "td",
            "code": "td-2",
            "semantic": "equals",
        }
    else:
        return {
            "relation": "X > Y",
            "description": "X is longer Y",
            "category": "td",
            "code": "td-3",
            "semantic": "longer",
        }

`relation_allen_time_range(time_range_a, time_range_b)` `staticmethod`

This function will return the allen temporal relation between two time ranges

We have the 26 possible relations - 13 for time range operation - 10 for time point and time range operation - 3 for time point operation

We will need to extract them from the quantitatively time range

We will have "beginning of time" or "end of time" to represent the infinite time range We will need to convert it to a numerical value in np.inf Others will be converted to a numerical value in the timestamp

Parameters:

Name	Type	Description	Default
`time_range_a`	`list[datetime, datetime]`	The first time range	required
`time_range_b`	`list[datetime, datetime]`	The second time range	required

Returns:

Name	Type	Description
`dict`	`dict`	The allen temporal relation between the two time ranges

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_allen_time_range(time_range_a: list, time_range_b: list) -> dict:
    """
    This function will return the allen temporal relation between two time ranges

    We have the 26 possible relations
    - 13 for time range operation
    - 10 for time point and time range operation
    - 3 for time point operation

    We will need to extract them from the quantitatively time range

    We will have "beginning of time" or "end of time" to represent the infinite time range
    We will need to convert it to a numerical value in np.inf
    Others will be converted to a numerical value in the timestamp

    Args:
        time_range_a (list[datetime, datetime]): The first time range
        time_range_b (list[datetime, datetime]): The second time range

    Returns:
        dict: The allen temporal relation between the two time ranges
    """
    start_time1, end_time1 = time_range_a
    start_time2, end_time2 = time_range_b

    logger.debug(
        f"start_time1: {start_time1}, end_time1: {end_time1}, start_time2: {start_time2}, end_time2: {end_time2}"
    )

    # 13 for time range operation
    time_range_a_datetime = [start_time1, end_time1]
    time_range_b_datetime = [start_time2, end_time2]
    # then we will do the operation for the time range, get the allen temporal relation
    """
    x_start <= x_end
    y_start <= y_end
    allen_operator = [
        x_start - x_end,  # 0, or -1, which means a is a point or a range
        y_start - y_end, # 0, or -1
        x_start - y_start,
        x_start - y_end,
        x_end - y_start,
        x_end - y_end,
    ]

    After this do a operation for the allen_operator, if value = 0, keep it, < 0, set it to -1, > 0, set it to 1

    Then we will have:
    13 for time range operation, which means x_start < x_end, y_start < y_end
        - X <  Y => [-1, -1, -1, -1, -1, -1]
        - X m  Y => [-1, -1, -1, -1,  0, -1]
        - X o  Y => [-1, -1, -1, -1,  1, -1]
        - X fi Y => [-1, -1, -1, -1,  1,  0]
        - X di Y => [-1, -1, -1, -1,  1,  1]
        - X s  Y => [-1, -1,  0, -1,  1, -1]
        - X =  Y => [-1, -1,  0, -1,  1,  0]
        - X si Y => [-1, -1,  0, -1,  1,  1]
        - X d  Y => [-1, -1,  1, -1,  1, -1]
        - X f  Y => [-1, -1,  1, -1,  1,  0]
        - X oi Y => [-1, -1,  1, -1,  1,  1]
        - X mi Y => [-1, -1,  1,  0,  1,  1]
        - X >  Y => [-1, -1,  1,  1,  1,  1]

    10 for time point and time range operation
    Among the 10, 5 for X is a point, 5 for Y is a point
    5 for X is a point, Y is a range, which means x_start = x_end, y_start < y_end
        - X <  Y => [0, -1, -1, -1, -1, -1]
        - X s  Y => [0, -1,  0, -1,  0, -1]
        - X d  Y => [0, -1,  1, -1,  1, -1]
        - X f  Y => [0, -1,  1,  0,  1,  0]
        - X >  Y => [0, -1,  1,  1,  1,  1]
    5 for X is a range, Y is a point, which means x_start < x_end, y_start = y_end
        - X <  Y => [-1, 0, -1, -1, -1, -1]
        - X fi Y => [-1, 0, -1，-1,  0,  0]
        - X di Y => [-1, 0, -1, -1,  1,  1]
        - X si Y => [-1, 0,  0,  0,  1,  1]
        - X >  Y => [-1, 0,  1,  1,  1,  1]

    3 for time point operation, which means x_start = x_end, y_start = y_end
        - X < Y => [0, 0, -1, -1, -1, -1]
        - X = Y => [0, 0,  0,  0,  0,  0]
        - X > Y => [0, 0,  1,  1,  1,  1]
    """

    allen_operator = [
        time_range_a_datetime[0] - time_range_a_datetime[1],
        time_range_b_datetime[0] - time_range_b_datetime[1],
        time_range_a_datetime[0] - time_range_b_datetime[0],
        time_range_a_datetime[0] - time_range_b_datetime[1],
        time_range_a_datetime[1] - time_range_b_datetime[0],
        time_range_a_datetime[1] - time_range_b_datetime[1],
    ]

    # do the operation for the allen_operator
    for index, value in enumerate(allen_operator):
        if value == 0:
            allen_operator[index] = 0
        elif value < 0:
            allen_operator[index] = -1
        else:
            allen_operator[index] = 1

    # logger.critical(f"allen_operator: {allen_operator}")
    # get it to be a tuple
    allen_operator = tuple(allen_operator)
    logger.debug(f"allen_operator: {allen_operator}")
    try:
        logger.debug(f"ALLEN_OPERATOR_DICT: {ALLEN_OPERATOR_DICT[allen_operator]}")
        return ALLEN_OPERATOR_DICT[allen_operator]
    except KeyError:
        logger.info(f"allen_operator: {allen_operator}")
        logger.info(f"time_range_a: {time_range_a}")
        logger.info(f"time_range_b: {time_range_b}")
        raise ValueError("The allen operator is not found")

`relation_duration(time_ranges, agg_temporal_operator=None)` `staticmethod`

For the time range, it will do the rank operation, sort it

First calculate the duration of the time range, then do the rank operation based on the duration

Parameters:

Name	Type	Description	Default
`time_ranges`	`list`	The list of time ranges	required
`agg_temporal_operator`	`str`	The aggregation temporal operator	`None`

Returns:

Name	Type	Description
`list`		the list of sorted index for the time range

Example:

time_ranges = [
    (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
    (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
    (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
    (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
    (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
]

The output will be:

[2, 4, 3, 0, 1]

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_duration(
    time_ranges: list[[datetime, datetime]], agg_temporal_operator: str = None
):
    """
    For the time range, it will do the rank operation, sort it

    First calculate the duration of the time range, then do the rank operation based on the duration

    Args:
        time_ranges (list): The list of time ranges
        agg_temporal_operator (str): The aggregation temporal operator

    Returns:
        list: the list of sorted index for the time range


    Example:
    ```
    time_ranges = [
        (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
        (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
        (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
        (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
        (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
    ]
    ```

    The output will be:
    ```
    [2, 4, 3, 0, 1]
    ```
    """
    # Create a list of indices paired with time ranges
    if agg_temporal_operator == "ranking":
        indexed_time_ranges = list(enumerate(time_ranges))

        indexed_time_ranges.sort(key=lambda x: abs(x[1][1] - x[1][0]))
        rank_by_index = [0] * len(time_ranges)  # Pre-initialize a list of zeros
        for index, (original_index, _) in enumerate(indexed_time_ranges):
            rank_by_index[original_index] = index
        return rank_by_index
    if agg_temporal_operator == "sum":
        # total value of the time range
        durations = [
            abs(time_range[1] - time_range[0]) for time_range in time_ranges
        ]
        return sum(durations)
    if agg_temporal_operator == "average":
        # average value of the time range
        durations = [
            abs(time_range[1] - time_range[0]) for time_range in time_ranges
        ]
        return sum(durations) / len(durations)
    raise ValueError(
        "Unsupported aggregation temporal operator. Please use 'ranking', 'sum' or 'average'."
    )

`relation_duration_calculation(time_range_a, time_range_b, temporal_operator=None)` `staticmethod`

We will calculate the time difference between two time ranges

However, there are several combination we can do here

duration_before => abs(time_range_a[1] - time_range_b[0])
duration_after => abs(time_range_b[1] - time_range_a[0]) We also have other combinations, but we will not consider them here

Parameters:

Name	Type	Description	Default
`time_range_a`	`list[datetime, datetime]`	The first time range	required
`time_range_b`	`list[datetime, datetime]`	The second time range	required
`temporal_operator`	`str`	The temporal operator	`None`

Returns:

Name	Type	Description
`timedelta`	`Optional[timedelta]`	The time difference between two time ranges

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_duration_calculation(
    time_range_a: list,
    time_range_b: list,
    temporal_operator: str = None,
) -> Optional[timedelta]:
    """
    We will calculate the time difference between two time ranges

    However, there are several combination we can do here

    - duration_before => abs(time_range_a[1] - time_range_b[0])
    - duration_after => abs(time_range_b[1] - time_range_a[0])
    We also have other combinations, but we will not consider them here

    Args:
        time_range_a (list[datetime, datetime]): The first time range
        time_range_b (list[datetime, datetime]): The second time range
        temporal_operator (str): The temporal operator

    Returns:
        timedelta: The time difference between two time ranges

    """
    if temporal_operator is None or temporal_operator not in [
        "duration_before",
        "duration_after",
    ]:
        raise ValueError(
            "temporal_operator should be one of the following: duration_before, duration_after"
        )
    if temporal_operator == "duration_before":
        return abs(time_range_a[1] - time_range_b[0])
    if temporal_operator == "duration_after":
        return abs(time_range_b[1] - time_range_a[0])
    return None

`relation_ordinal_time_range(time_ranges, agg_temporal_operator=None)` `staticmethod`

For the time range, it will do the rank operation, sort it

Aggregation operator can be: - ranking(min, max) - ranking_start - ranking_end

Parameters:

Name	Type	Description	Default
`time_ranges`	`list`	The list of time ranges	required
`agg_temporal_operator`	`str`	The aggregation temporal operator	`None`

Returns:

Name	Type	Description
`list`	`list`	the list of sorted index for the time range

For example, we have the time range:

time_ranges = [
    (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
    (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
    (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
    (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
    (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
]

result_start = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_start")
[3,1,0,4,2]

result_end = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_end")
[2,4,3,0,1]

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_ordinal_time_range(
    time_ranges: list[[datetime, datetime]], agg_temporal_operator: str = None
) -> list:
    """
    For the time range, it will do the rank operation, sort it

    Aggregation operator can be:
    - ranking(min, max)
        - ranking_start
        - ranking_end

    Args:
        time_ranges (list): The list of time ranges
        agg_temporal_operator (str): The aggregation temporal operator

    Returns:
        list: the list of sorted index for the time range

    For example, we have the time range:

    ```
    time_ranges = [
        (datetime(2023, 5, 1, 12, 0), datetime(2023, 5, 1, 15, 0)),  # 3 hours
        (datetime(2023, 5, 1, 9, 30), datetime(2023, 5, 1, 14, 0)),  # 4.5 hours
        (datetime(2023, 5, 1, 8, 0), datetime(2023, 5, 1, 11, 30)),  # 3.5 hours
        (datetime(2023, 5, 2, 9, 30), datetime(2023, 5, 2, 12, 0)),  # 2.5 hours
        (datetime(2023, 5, 1, 10, 30), datetime(2023, 5, 1, 13, 0))  # 2.5 hours
    ]

    result_start = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_start")
    [3,1,0,4,2]

    result_end = TKGQAGenerator.aggregate_tr_temporal_operator(time_ranges, "ranking_end")
    [2,4,3,0,1]
    ```
    """

    # Create a list of indices paired with time ranges
    indexed_time_ranges = list(enumerate(time_ranges))

    if agg_temporal_operator == "rank_start_time":
        # Sort by start time, but maintain original indices
        indexed_time_ranges.sort(key=lambda x: x[1][0])
    elif agg_temporal_operator == "rank_end_time":
        # Sort by end time, but maintain original indices
        indexed_time_ranges.sort(key=lambda x: x[1][1])
    else:
        raise ValueError(
            "Unsupported aggregation temporal operator. Please use 'rank_start_time' or 'rank_end_time'."
        )

    # After sorting, create a new list that maps the original index to its new rank
    rank_by_index = [0] * len(time_ranges)  # Pre-initialize a list of zeros
    for rank, (original_index, _) in enumerate(indexed_time_ranges):
        rank_by_index[original_index] = rank

    return rank_by_index

`relation_union_or_intersection(time_ranges, temporal_operator='intersection')` `staticmethod`

This function will return the temporal operator between multiple time ranges The temporal operator can be: - 'intersection' - 'union'

Parameters:

Name	Type	Description	Default
`time_ranges`	`List[Tuple[datetime, datetime]]`	A list of time ranges	required
`temporal_operator`	`str`	The temporal operator	`'intersection'`

Returns:

Name	Type	Description
`str`	`Optional[str]`	A string representation of the new time range, or None if no valid range exists.

Source code in TimelineKGQA/generator.py

@staticmethod
def relation_union_or_intersection(
    time_ranges: List[Tuple[np.datetime64, np.datetime64]],
    temporal_operator: str = "intersection",
) -> Optional[str]:
    """
    This function will return the temporal operator between multiple time ranges
    The temporal operator can be:
        - 'intersection'
        - 'union'

    Args:
        time_ranges (List[Tuple[datetime, datetime]]): A list of time ranges
        temporal_operator (str): The temporal operator

    Returns:
        str: A string representation of the new time range, or None if no valid range exists.

    """
    if temporal_operator not in ["intersection", "union"]:
        raise ValueError(
            "temporal_operator should be either 'intersection' or 'union'"
        )

    if not time_ranges:
        return None

    # Start with the first time range
    result = time_ranges[0]

    for current in time_ranges[1:]:
        if temporal_operator == "intersection":
            # Find the latest start time and earliest end time
            start = max(result[0], current[0])
            end = min(result[1], current[1])
            if start >= end:
                return None  # No intersection
            result = (start, end)
        elif temporal_operator == "union":
            # Find the earliest start time and latest end time
            start = min(result[0], current[0])
            end = max(result[1], current[1])
            # Check if there is a gap between the ranges
            if result[1] < current[0] or current[1] < result[0]:
                return None  # No continuous union possible
            result = (start, end)

    return f"({result[0]}, {result[1]})"

`sampling_events(sample_percentage=0.1, sample_strategy='temporal_close')`

This function will sample the events from the list

Parameters:

Name	Type	Description	Default
`sample_percentage`	`float`	The sample percentage	`0.1`
`sample_strategy`	`str`	The sample strategy, can be random, temporal_close, degree_high, both	`'temporal_close'`

Returns:

Type	Description
	List[Tuple]: The list Tuples (event1_id, event2_id, event3_id)

Source code in TimelineKGQA/generator.py

def sampling_events(
    self,
    sample_percentage: Union[float, dict, int] = 0.1,
    sample_strategy: str = "temporal_close",
):
    """
    This function will sample the events from the list

    Args:
        sample_percentage (float): The sample percentage
        sample_strategy (str): The sample strategy, can be random, temporal_close, degree_high, both

    Returns:
        List[Tuple]: The list Tuples (event1_id, event2_id, event3_id)

    """

    if sample_strategy not in ("random", "temporal_close", "degree_high", "both"):
        raise ValueError(
            "sample_strategy should be random, temporal_close, degree_high, or both"
        )

    """
    Do matrix sampling based on the element value
    If the dimension is 2, we will have a nxn matrix
        - value of the matrix is 1 for random
        - value will be calculated based on the temporal information if it is temporal_close
        - value will be calculated based on the degree information if it is degree_high
        - value will be calculated based on both temporal and degree information if it is both

    And then use the value as weight to do the sampling over the matrix
    """

    num_events = len(self.events_df)
    # for dimension 1 generate, first construct a matrix with len(event_df), using numpy,
    # and it is always sampling randomly
    if sample_strategy == "degree_high" or sample_strategy == "both":
        degree_scores = self.calculate_degree_scores(self.events_df)
    else:
        degree_scores = None

    with timer(the_logger=logger, message="Generating the matrix D1"):
        dimension_1_matrix = np.ones(num_events)

        # make sure every element in the matrix sum to 1
        dimension_1_matrix = dimension_1_matrix / dimension_1_matrix.sum()

    with timer(the_logger=logger, message="Generating the matrix D2"):
        # for dimension 2 generate, first construct a matrix with len(event_df), using numpy
        dimension_2_matrix = np.zeros((num_events, num_events))

        if sample_strategy == "random":
            dimension_2_matrix = np.ones((num_events, num_events))
        elif sample_strategy == "temporal_close":
            start_times = self.events_df["start_time"].values
            end_times = self.events_df["end_time"].values

            for x in range(num_events):
                for y in range(x + 1, num_events):  # y > x to avoid redundancy
                    score = self.temporal_close_score(
                        time_ranges=[
                            [start_times[x], end_times[x]],
                            [start_times[y], end_times[y]],
                        ]
                    )
                    dimension_2_matrix[x, y] = score
                    dimension_2_matrix[y, x] = score  # Leverage symmetry

        elif sample_strategy == "degree_high":

            for x in range(num_events):
                for y in range(x + 1, num_events):
                    score = degree_scores[x] = degree_scores[y]
                    dimension_2_matrix[x, y] = score
                    dimension_2_matrix[y, x] = score  # Leverage symmetry

        elif sample_strategy == "both":
            # park here
            start_times = self.events_df["start_time"].values
            end_times = self.events_df["end_time"].values
            for x in tqdm(range(num_events), desc="Processing events"):
                for y in range(x + 1, num_events):
                    degree_score = degree_scores[x] + degree_scores[y]
                    temporal_score = self.temporal_close_score(
                        time_ranges=[
                            [start_times[x], end_times[x]],
                            [start_times[y], end_times[y]],
                        ]
                    )
                    score = degree_score + temporal_score
                    dimension_2_matrix[x, y] = score
                    dimension_2_matrix[y, x] = score

        # make sure every element in the matrix sum to 1
        dimension_2_matrix = dimension_2_matrix / dimension_2_matrix.sum()

    with timer(the_logger=logger, message="Generating the matrix D3"):
        # for dimension 3 generate, first construct a matrix with len(event_df), using numpy
        dimension_3_matrix = np.zeros((num_events, num_events, num_events))
        if sample_strategy == "random":
            dimension_3_matrix = np.ones((num_events, num_events, num_events))
        elif sample_strategy == "temporal_close":
            start_times = self.events_df["start_time"].values
            end_times = self.events_df["end_time"].values

            for x in tqdm(range(num_events), desc="Processing 3D events"):
                for y in range(x + 1, num_events):  # y > x to avoid redundancy
                    for z in range(y + 1, num_events):  # z > y to avoid redundancy
                        score = self.temporal_close_score(
                            time_ranges=[
                                [start_times[x], end_times[x]],
                                [start_times[y], end_times[y]],
                                [start_times[z], end_times[z]],
                            ]
                        )
                        dimension_3_matrix[x, y, z] = score
                        dimension_3_matrix[x, z, y] = score
                        dimension_3_matrix[y, x, z] = score
                        dimension_3_matrix[y, z, x] = score
                        dimension_3_matrix[z, x, y] = score
                        dimension_3_matrix[z, y, x] = score
        elif sample_strategy == "degree_high":
            for x in range(num_events):
                for y in range(x + 1, num_events):
                    for z in range(y + 1, num_events):
                        score = (
                            degree_scores[x] + degree_scores[y] + degree_scores[z]
                        )
                        dimension_3_matrix[x, y, z] = score
                        dimension_3_matrix[x, z, y] = score
                        dimension_3_matrix[y, x, z] = score
                        dimension_3_matrix[y, z, x] = score
                        dimension_3_matrix[z, x, y] = score
                        dimension_3_matrix[z, y, x] = score
        elif sample_strategy == "both":
            # park here
            start_times = self.events_df["start_time"].values
            end_times = self.events_df["end_time"].values
            for x in tqdm(range(num_events), desc="Processing 3D events"):
                for y in range(x + 1, num_events):
                    for z in range(y + 1, num_events):
                        degree_score = (
                            degree_scores[x] + degree_scores[y] + degree_scores[z]
                        )
                        temporal_score = self.temporal_close_score(
                            time_ranges=[
                                [start_times[x], end_times[x]],
                                [start_times[y], end_times[y]],
                                [start_times[z], end_times[z]],
                            ]
                        )
                        score = degree_score + temporal_score
                        dimension_3_matrix[x, y, z] = score
                        dimension_3_matrix[x, z, y] = score
                        dimension_3_matrix[y, x, z] = score
                        dimension_3_matrix[y, z, x] = score
                        dimension_3_matrix[z, x, y] = score
                        dimension_3_matrix[z, y, x] = score

        # make sure every element in the matrix sum to 1
        dimension_3_matrix = dimension_3_matrix / dimension_3_matrix.sum()
    # do the sampling based on the matrix
    # if sampling is a float value, then it means all three dimensions will be sampled based on the rate
    # if samping is an int value, then it means all three dimension will have that many questions
    # if sampling is a dict value, then it means the sampling rate for each dimension

    with timer(the_logger=logger, message="Sampling the events"):
        if isinstance(sample_percentage, float):
            # sample based on the rate, and the value (weight) is the matrix value
            dimension_1_samples = np.random.choice(
                num_events,
                int(num_events * sample_percentage),
                p=dimension_1_matrix,
            )
            # sample it from dimension 2 matrix
            dimension_2_samples = self.random_selection(
                dimension_2_matrix,
                int(dimension_2_matrix.size * sample_percentage),
            )
            # sample it from dimension 3 matrix
            dimension_3_samples = self.random_selection(
                dimension_3_matrix,
                int(dimension_3_matrix.size * sample_percentage),
            )

        elif isinstance(sample_percentage, int):
            dimension_1_samples = np.random.choice(
                num_events, sample_percentage, p=dimension_1_matrix
            )
            dimension_2_samples = self.random_selection(
                dimension_2_matrix,
                sample_percentage,
            )
            dimension_3_samples = self.random_selection(
                dimension_3_matrix,
                sample_percentage,
            )

        elif isinstance(sample_percentage, dict):
            dimension_1_samples = sample_percentage.get("dimension_1", 0)
            dimension_2_samples = sample_percentage.get("dimension_2", 0)
            dimension_3_samples = sample_percentage.get("dimension_3", 0)
            if (
                dimension_1_samples == 0
                or dimension_2_samples == 0
                or dimension_3_samples == 0
            ):
                raise ValueError(
                    "The sample_percentage should have all three dimensions"
                )
            # if all types of
            # dimension_1_sample_percentage,
            # dimension_2_sample_percentage,
            # dimension_3_sample_percentage are float
            # then we will sample based on the rate
            if all(
                isinstance(i, float)
                for i in [
                    dimension_1_samples,
                    dimension_2_samples,
                    dimension_3_samples,
                ]
            ):
                dimension_1_samples = np.random.choice(
                    len(self.events_df),
                    int(len(self.events_df) * dimension_1_samples),
                    p=dimension_1_matrix,
                )
                dimension_2_samples = self.random_selection(
                    dimension_2_matrix,
                    int(dimension_2_matrix.size * dimension_2_samples),
                )
                dimension_3_samples = self.random_selection(
                    dimension_3_matrix,
                    int(dimension_3_matrix.size * dimension_3_samples),
                )
            # if all types of
            # dimension_1_sample_percentage,
            # dimension_2_sample_percentage,
            # dimension_3_sample_percentage are int
            # then we will sample based on the number
            elif all(
                isinstance(i, int)
                for i in [
                    dimension_1_samples,
                    dimension_2_samples,
                    dimension_3_samples,
                ]
            ):
                dimension_1_samples = np.random.choice(
                    len(self.events_df), dimension_1_samples, p=dimension_1_matrix
                )
                dimension_2_samples = self.random_selection(
                    dimension_2_matrix,
                    dimension_2_samples,
                )
                dimension_3_samples = self.random_selection(
                    dimension_3_matrix,
                    dimension_3_samples,
                )
            else:
                raise ValueError(
                    "The sample_percentage should have all three dimensions"
                )
        else:
            raise ValueError(
                "The sample_percentage should be either float, int, or dict"
            )

    logger.info(len(dimension_1_samples))
    logger.info(len(dimension_2_samples))
    logger.info(len(dimension_3_samples))

    """
    Examples of the output:

    ```
    [  0  10  20  30  40  50  60  70  80  90 100]
    [(0, 10), (20, 30), (40, 50), (60, 70), (80, 90), (100, 110)]
    [(0, 10, 20), (30, 40, 50), (60, 70, 80), (90, 100, 110)]
    ```
    """
    self.sample_simple_events = dimension_1_samples
    self.sample_medium_events = dimension_2_samples
    self.sample_complex_events = dimension_3_samples
    return dimension_1_samples, dimension_2_samples, dimension_3_samples

`simple_question_generation()`

Types of Questions

This is used to generate the simple question, we will have two types of questions.

For each type of questions, based on the answer or the question focus, we can further divide them into - Timeline Position Retrival - Start TimePoint - End TimePoint - Time Range - Duration - Temporal Constrained Retrieval (Ignore predicate for now) - Subject - Object

Simple: Timeline and One Event Involved - Timeline Position Retrival: When Bush starts his term as president of US? - General Information Retrieval => Timeline Position Retrival => Answer the question - Question Focus can be: Timestamp Start, Timestamp End, Duration, Timestamp Start and End - Temporal Constrained Retrieval: In 2009, who is the president of US? - General Information Retrieval => Temporal Constraint Retrieval => Answer the question - Question Focus can be: Subject, Object, Predicate. Can be more complex if we want mask out more elements

Templates

To generate the questions, We can try to feed into the LLM, and generate the questions. However, the diversity of the questions is not guaranteed, so we can use the template to generate the questions. Then use LLM to paraphrase the questions.

Template examples: - Timeline Position Retrival - Start TimePoint: When did {subject} start the term as {object}? - End TimePoint: When did {subject} end the term as {object}? - Time Range: When did {subject} serve as {object}? - Duration: How long did {subject} serve as {object}? - Temporal Constrained Retrieval - Subject: - Who is affiliated to {subject} from {timestamp start} to {timestamp end}? - Who is affiliated to {subject} in {timestamp}? - Object: - {subject} is affiliated to which organisation from {timestamp start} to {timestamp end}? - {subject} is affiliated to which organisation during {temporal representation}?

Process

Extract {subject}, {predicate}, {object}, {start_time}, {end_time} from the unified graph
Generate the questions based on the template for each type
Use LLM to paraphrase the questions

Output format will be: - {question} - {answer} - {paraphrased_question} - subject, predicate, object, start_time, end_time - {question_level} => Simple - {question_type} => Timeline Position Retrival, Temporal Constrained Retrieval - {answer_type} => Subject, Object | Timestamp Start, Timestamp End, Duration, Timestamp Start and End

Source code in TimelineKGQA/generator.py

def simple_question_generation(self):
    """
    ## Types of Questions
    This is used to generate the simple question, we will have two types of questions.

    For each type of questions, based on the answer or the question focus, we can further divide them into
    - Timeline Position Retrival
        - Start TimePoint
        - End TimePoint
        - Time Range
        - Duration
    - Temporal Constrained Retrieval (Ignore predicate for now)
        - Subject
        - Object

    Simple: Timeline and One Event Involved
    - Timeline Position Retrival: When Bush starts his term as president of US?
        - General Information Retrieval => Timeline Position Retrival => Answer the question
        - Question Focus can be: Timestamp Start, Timestamp End, Duration, Timestamp Start and End
    - Temporal Constrained Retrieval: In 2009, who is the president of US?
        - General Information Retrieval => Temporal Constraint Retrieval => Answer the question
        - Question Focus can be: Subject, Object, Predicate. Can be more complex if we want mask out more elements


    ## Templates
    To generate the questions, We can try to feed into the LLM, and generate the questions.
    However, the diversity of the questions is not guaranteed, so we can use the template to generate the questions.
    Then use LLM to paraphrase the questions.

    Template examples:
    - Timeline Position Retrival
        - Start TimePoint: When did {subject} start the term as {object}?
        - End TimePoint: When did {subject} end the term as {object}?
        - Time Range: When did {subject} serve as {object}?
        - Duration: How long did {subject} serve as {object}?
    - Temporal Constrained Retrieval
        - Subject:
            - Who is affiliated to {subject} from {timestamp start} to {timestamp end}?
            - Who is affiliated to {subject} in {timestamp}?
        - Object:
            - {subject} is affiliated to which organisation from {timestamp start} to {timestamp end}?
            - {subject} is affiliated to which organisation during {temporal representation}?

    ## Process
    - Extract {subject}, {predicate}, {object}, {start_time}, {end_time} from the unified graph
    - Generate the questions based on the template for each type
    - Use LLM to paraphrase the questions

    Output format will be:
    - {question}
    - {answer}
    - {paraphrased_question}
    - subject, predicate, object, start_time, end_time
    - {question_level} => Simple
    - {question_type} => Timeline Position Retrival, Temporal Constrained Retrieval
    - {answer_type} => Subject, Object | Timestamp Start, Timestamp End, Duration, Timestamp Start and End
    """
    # get records not yet generated questions

    insert_values_list = []
    bulk_sql_pointer = 0
    for item in self.sample_simple_events:
        event = self.events_df.iloc[item]
        questions = self.simple_question_generation_individual(
            subject=event["subject"],
            predicate=event["predicate"],
            tail_object=event["object"],
            start_time=event["start_time"],
            end_time=event["end_time"],
            template_based=True,
            paraphrased=self.paraphrased,
        )

        # insert each qa into the table, have a flat table
        for question_obj in questions:
            question_obj["source_kg_id"] = int(event["id"])
            # get dict to tuple, sequence should be the same as the sql command
            data = (
                question_obj["source_kg_id"],
                question_obj["question"],
                question_obj["answer"],
                question_obj["paraphrased_question"],
                question_obj["events"],
                question_obj["question_level"],
                question_obj["question_type"],
                question_obj["answer_type"],
                "timeline",
            )
            insert_values_list.append(data)
            bulk_sql_pointer += 1
            if bulk_sql_pointer % self.bulk_sql_size == 0:
                self.bulk_insert(values=insert_values_list)
                insert_values_list = []

    self.bulk_insert(values=insert_values_list)

`simple_question_generation_individual(subject, predicate, tail_object, start_time, end_time, template_based=False, paraphrased=False)` `staticmethod`

This will try to generate four questions belong to RE type

The questions will be: - ? p o during the time range from start_time to end_time? - s p ? during the time range from start_time to end_time? - s p o from ? to end_time? - s p o from start_time to ? - s p o from ? to ? - [How long/What's the duration, etc.] ? for the statement s p o

Parameters:

Name	Type	Description	Default
`subject`	`str`	The subject	required
`predicate`	`str`	The predicate	required
`tail_object`	`str`	The tail_object	required
`start_time`	`str`	The start time	required
`end_time`	`str`	The end time	required
`template_based`	`bool`	Whether you use the template based question generation	`False`
`paraphrased`	`bool`	Whether you do the paraphrase for the question, if set to False, then the paraphrased_question will be the same as the question	`False`

Returns:

Name	Type	Description
`dict`	`List[dict]`	The generated questions - question - answer - paraphrased_question - events - question_level: Simple - question_type: The type of the question - answer_type: The type of the answer

Source code in TimelineKGQA/generator.py

@staticmethod
def simple_question_generation_individual(
    subject: str,
    predicate: str,
    tail_object: str,
    start_time: str,
    end_time: str,
    template_based: bool = False,
    paraphrased: bool = False,
) -> List[dict]:
    """
    This will try to generate four questions belong to RE type

    The questions will be:
    - ? p o during the time range from start_time to end_time?
    - s p ? during the time range from start_time to end_time?
    - s p o from ? to end_time?
    - s p o from start_time to ?
    - s p o from ? to ?
    - [How long/What's the duration, etc.] ? for the statement s p o

    Args:
        subject (str): The subject
        predicate (str): The predicate
        tail_object (str): The tail_object
        start_time (str): The start time
        end_time (str): The end time
        template_based (bool): Whether you use the template based question generation
        paraphrased (bool): Whether you do the paraphrase for the question, if set to False,
                then the paraphrased_question will be the same as the question

    Returns:
        dict: The generated questions
            - question
            - answer
            - paraphrased_question
            - events
            - question_level: Simple
            - question_type: The type of the question
            - answer_type: The type of the answer
    """

    questions = [
        {
            "question": f"??? {predicate} {tail_object} during the time range from {start_time} to {end_time}?",
            "answer": f"{subject}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "temporal_constrained_retrieval",
            "answer_type": "subject",
        },
        {
            "question": f"{subject} {predicate} ??? during the time range from {start_time} to {end_time}?",
            "answer": f"{tail_object}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "temporal_constrained_retrieval",
            "answer_type": "object",
        },
        {
            "question": f"{subject} {predicate} {tail_object} from ??? to {end_time}?",
            "answer": f"{start_time}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "timeline_position_retrieval",
            "answer_type": "timestamp_start",
        },
        {
            "question": f"{subject} {predicate} {tail_object} from {start_time} to ???",
            "answer": f"{end_time}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "timeline_position_retrieval",
            "answer_type": "timestamp_end",
        },
        {
            "question": f"{subject} {predicate} {tail_object} from ??? to ???",
            "answer": f"{start_time} and {end_time}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "timeline_position_retrieval",
            "answer_type": "timestamp_range",
        },
        {
            "question": f"[How long/What's the duration/etc] ??? for the statement "
            f"{subject} {predicate} {tail_object}",
            "answer": f"{end_time} - {start_time}",
            "paraphrased_question": None,
            "events": [
                f"{subject}|{predicate}|{tail_object}|{start_time}|{end_time}"
            ],
            "question_level": "simple",
            "question_type": "timeline_position_retrieval",
            "answer_type": "duration",
        },
    ]
    if template_based:
        # we will random pick one from the template
        for question_draft in questions:
            this_type_templates = QUESTION_TEMPLATES[
                question_draft["question_level"]
            ][question_draft["question_type"]][question_draft["answer_type"]]
            logger.debug(f"this_type_templates: {this_type_templates}")
            random_pick_template = random.choice(this_type_templates)
            # replace {subject}, {predicate}, {tail_object}, {start_time}, {end_time} with the real value
            question_draft["question"] = random_pick_template.format(
                subject=subject,
                predicate=predicate,
                tail_object=tail_object,
                start_time=start_time,
                end_time=end_time,
            )

    if paraphrased:
        for question_obj in questions:
            paraphrased_question = paraphrase_simple_question(
                question=question_obj["question"]
            )
            logger.info(f"paraphrased_question: {paraphrased_question}")
            question_obj["paraphrased_question"] = paraphrased_question

    return questions

`temporal_close_score(time_ranges)`

This function will calculate the temporal close score between two/three time ranges.

range_a = [start_time_a, end_time_a] range_b = [start_time_b, end_time_b] range_c = [start_time_c, end_time_c] (if three ranges are provided)

score = 1 / ((start_time_b - start_time_a)2 + (end_time_b - end_time_a)2) (for two ranges)

score = 1 / ((start_time_b - start_time_a)2 + (end_time_b - end_time_a)2 + (start_time_c - start_time_a)2 + (end_time_c - end_time_a)2) (for three ranges)

Parameters:

Name	Type	Description	Default
`time_ranges`	`List[Tuple[datetime, datetime]]`	The list of time ranges	required

Returns:

Name	Type	Description
`temporal_close_score`	`float`	The temporal close score between two/three time ranges, if it is close
	`float`	to 1, it means the time ranges are close to each other, if it is close to 0, it means the time ranges
	`float`	are far from each other

Source code in TimelineKGQA/generator.py

def temporal_close_score(self, time_ranges: List) -> float:
    """
    This function will calculate the temporal close score between two/three time ranges.

    range_a = [start_time_a, end_time_a]
    range_b = [start_time_b, end_time_b]
    range_c = [start_time_c, end_time_c] (if three ranges are provided)

    score = 1 / ((start_time_b - start_time_a)**2 + (end_time_b - end_time_a)**2)
    (for two ranges)

    score = 1 / ((start_time_b - start_time_a)**2 + (end_time_b - end_time_a)**2 +
                 (start_time_c - start_time_a)**2 + (end_time_c - end_time_a)**2)
    (for three ranges)

    Args:
        time_ranges (List[Tuple[datetime, datetime]]): The list of time ranges

    Returns:
        temporal_close_score (float): The temporal close score between two/three time ranges, if it is close
        to 1, it means the time ranges are close to each other, if it is close to 0, it means the time ranges
        are far from each other
    """
    if len(time_ranges) not in [2, 3]:
        raise ValueError("The function only supports two or three time ranges")

    # Utility function to convert string to datetime
    start_time_a_dt, end_time_a_dt = self.util_str_to_datetime(time_ranges[0])
    start_time_b_dt, end_time_b_dt = self.util_str_to_datetime(time_ranges[1])

    # Calculate the differences in days
    start_diff_days = (start_time_b_dt - start_time_a_dt) / np.timedelta64(1, "D")
    end_diff_days = (end_time_b_dt - end_time_a_dt) / np.timedelta64(1, "D")

    # Calculate the score using days difference
    score = start_diff_days**2 + end_diff_days**2

    if len(time_ranges) == 3:
        start_time_c_dt, end_time_c_dt = self.util_str_to_datetime(time_ranges[2])
        start_diff_days_c = (start_time_c_dt - start_time_a_dt) / np.timedelta64(
            1, "D"
        )
        end_diff_days_c = (end_time_c_dt - end_time_a_dt) / np.timedelta64(1, "D")
        score += start_diff_days_c**2 + end_diff_days_c**2
    if score == 0:
        return 0
    return 1 / score

`util_str_to_datetime(time_range)` `staticmethod`

Convert the string to datetime

Parameters:

Name	Type	Description	Default
`time_range`	`list[str, str]`	The time range in string format	required

Returns:

Type	Description
`Tuple[datetime64, datetime64]`	list[datetime, datetime]: The time range in datetime format

Source code in TimelineKGQA/generator.py

@staticmethod
def util_str_to_datetime(time_range: list) -> Tuple[np.datetime64, np.datetime64]:
    """
    Convert the string to datetime

    Args:
        time_range (list[str, str]): The time range in string format

    Returns:
        list[datetime, datetime]: The time range in datetime format

    """
    start_time, end_time = time_range
    if start_time == "beginning of time":
        start_time = datetime.min.replace(year=1)
    if end_time == "end of time":
        end_time = datetime.max.replace(year=9999)

    # convert the time to numerical value, format is like this: 1939-04-25
    start_time = np.datetime64(start_time)
    end_time = np.datetime64(end_time)

    return start_time, end_time

Generator

generator = TKGQAGenerator(table_name=args.table_name, host=args.host, port=args.port, user=args.user, password=args.password, db_name=args.db_name, first_draw_size=1000, paraphrased=args.paraphrased, bulk_sql_size=args.bulk_sql_size) module-attribute

TKGQAGenerator

Information Indexing

Information Retrieval

Temporal Questions

Simple: Timeline and One Event Involved

Medium: Timeline and Two Events Involved

Complex: Timeline and Multiple Events Involved

bulk_insert(values)

calculate_degree_scores(event_df) staticmethod

complex_question_generation()

complex_question_generation_individual(first_event, second_event, third_event, template_based=True, paraphrased=True)

medium_question_generation()

medium_question_generation_individual(first_event, second_event, template_based=True, paraphrased=False)

relation_allen_time_duration(time_range_a, time_range_b) staticmethod

relation_allen_time_range(time_range_a, time_range_b) staticmethod

relation_duration(time_ranges, agg_temporal_operator=None) staticmethod

relation_duration_calculation(time_range_a, time_range_b, temporal_operator=None) staticmethod

relation_ordinal_time_range(time_ranges, agg_temporal_operator=None) staticmethod

relation_union_or_intersection(time_ranges, temporal_operator='intersection') staticmethod

sampling_events(sample_percentage=0.1, sample_strategy='temporal_close')

simple_question_generation()

Types of Questions

Templates

Process

simple_question_generation_individual(subject, predicate, tail_object, start_time, end_time, template_based=False, paraphrased=False) staticmethod

temporal_close_score(time_ranges)

util_str_to_datetime(time_range) staticmethod

`generator = TKGQAGenerator(table_name=args.table_name, host=args.host, port=args.port, user=args.user, password=args.password, db_name=args.db_name, first_draw_size=1000, paraphrased=args.paraphrased, bulk_sql_size=args.bulk_sql_size)` `module-attribute`

`TKGQAGenerator`

`bulk_insert(values)`

`calculate_degree_scores(event_df)` `staticmethod`

`complex_question_generation()`

`complex_question_generation_individual(first_event, second_event, third_event, template_based=True, paraphrased=True)`

`medium_question_generation()`

`medium_question_generation_individual(first_event, second_event, template_based=True, paraphrased=False)`

`relation_allen_time_duration(time_range_a, time_range_b)` `staticmethod`

`relation_allen_time_range(time_range_a, time_range_b)` `staticmethod`

`relation_duration(time_ranges, agg_temporal_operator=None)` `staticmethod`

`relation_duration_calculation(time_range_a, time_range_b, temporal_operator=None)` `staticmethod`

`relation_ordinal_time_range(time_ranges, agg_temporal_operator=None)` `staticmethod`

`relation_union_or_intersection(time_ranges, temporal_operator='intersection')` `staticmethod`

`sampling_events(sample_percentage=0.1, sample_strategy='temporal_close')`

`simple_question_generation()`

`simple_question_generation_individual(subject, predicate, tail_object, start_time, end_time, template_based=False, paraphrased=False)` `staticmethod`

`temporal_close_score(time_ranges)`

`util_str_to_datetime(time_range)` `staticmethod`