Named-Entity-Extraction

Overview

This code utilizes the Llama-2-7b Large Language Model to extract named entities from highly unstructured employment agreements. The primary goal is to identify and categorize entities such as names, dates, roles, and other relevant information within the document.

Sample Schema

person_schema = Object(

    id="Witnesses",
    
    # Natural language description about your object
    description="Names of the witnesses",
    # Fields you'd like to capture from a piece of text about your object.
    attributes=[
        Text(
            id="first_name",
            description="The first name of the witness.",
        ),
        Text(
            id="last_name",
            description="The last name of the witness.",
        )
    ],
    examples=[
        ("The name of the witness is Tejas Thorat", [{"first_name": "Tejas"}, {"last_name": "Thorat"}])
    ]
)

JSON Output

{
   "Witnesses":[
      {
         "first_name":"Alice",
         "last_name":"Johnson"
      },
      {
         "first_name":"John",
         "last_name":"Smith"
      },
      {
         "first_name":"Emily",
         "last_name":"Adams"
      },
      {
         "first_name":"James",
         "last_name":"Wilson"
      }
   ]
}

NOTE: This is still a work in progress

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
By_Specifying_a_schema.ipynb		By_Specifying_a_schema.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Named-Entity-Extraction

About

Releases

Packages

Languages

tejaspthorat/Named-Entity-Extraction

Folders and files

Latest commit

History

Repository files navigation

Named-Entity-Extraction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages