Contents:
Bases: BaseModel
BaseModel
doc_text: The raw text of the document offset: A list of entities, where each is a tuple of character offsets into doc_text for that entity