A class that extends the BufferLoader class. It represents a document loader that loads documents from PDF files.

Hierarchy (view full)

Constructors

Properties

filePathOrBlob: string | Blob

Methods

  • Method that reads the buffer contents and metadata based on the type of filePathOrBlob, and then calls the parse() method to parse the buffer and return the documents.

    Returns Promise<Document[]>

    Promise that resolves with an array of Document objects.

  • Loads the documents and splits them using a specified text splitter.

    Parameters

    • splitter: TextSplitter = ...

    Returns Promise<Document[]>

    A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.

  • A method that takes a raw buffer and metadata as parameters and returns a promise that resolves to an array of Document instances. It uses the parseOfficeAsync function from the officeparser module to extract the raw text content from the buffer. If the extracted powerpoint content is empty, it returns an empty array. Otherwise, it creates a new Document instance with the extracted powerpoint content and the provided metadata, and returns it as an array.

    Parameters

    • raw: Buffer

      The buffer to be parsed.

    • metadata: Document

      The metadata of the document.

    Returns Promise<Document[]>

    A promise that resolves to an array of Document instances.

  • Static method that imports the readFile function from the fs/promises module in Node.js. It is used to dynamically import the function when needed. If the import fails, it throws an error indicating that the fs/promises module is not available in the current environment.

    Returns Promise<{
        readFile: {
            (path, options?): Promise<Buffer>;
            (path, options): Promise<string>;
            (path, options?): Promise<string | Buffer>;
        };
    }>

    Promise that resolves with an object containing the readFile function.

Generated using TypeDoc