Skip to content

using open large language models for named entity extraction

Notifications You must be signed in to change notification settings

tejaspthorat/Named-Entity-Extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Named-Entity-Extraction

Overview

This code utilizes the Llama-2-7b Large Language Model to extract named entities from highly unstructured employment agreements. The primary goal is to identify and categorize entities such as names, dates, roles, and other relevant information within the document.

Sample Schema

person_schema = Object(

    id="Witnesses",
    
    # Natural language description about your object
    description="Names of the witnesses",
    # Fields you'd like to capture from a piece of text about your object.
    attributes=[
        Text(
            id="first_name",
            description="The first name of the witness.",
        ),
        Text(
            id="last_name",
            description="The last name of the witness.",
        )
    ],
    examples=[
        ("The name of the witness is Tejas Thorat", [{"first_name": "Tejas"}, {"last_name": "Thorat"}])
    ]
)

JSON Output

{
   "Witnesses":[
      {
         "first_name":"Alice",
         "last_name":"Johnson"
      },
      {
         "first_name":"John",
         "last_name":"Smith"
      },
      {
         "first_name":"Emily",
         "last_name":"Adams"
      },
      {
         "first_name":"James",
         "last_name":"Wilson"
      }
   ]
}

NOTE: This is still a work in progress

About

using open large language models for named entity extraction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published