reactjs - How to build a custom document labeling UI for Azure Document Intelligence in a ReactTypeScript portal? - Stack Overfl-软件玩家

admin管理员组
文章数量:1225543

I’m building a configuration portal for an AI platform/service that different teams at my organization will use. One of the key services involves OCR configuration, allowing users to:

Classify a document
Run an extraction
Send the extracted JSON output back to the consumer

We are using Azure AI Document Intelligence (DI) as the underlying framework for this.

For custom extraction models, the simplest approach is to redirect users to Azure Document Intelligence Studio to build their models. Then, in our portal, we could use the control plane APIs to let users select their classification and extraction models as needed. That’s likely our first version.

However, I’m exploring how we could integrate the extraction labeling and model training directly into our portal, so users wouldn’t need to visit the Azure portal at all.

What I've Found So Far:

Azure's Labeling Tool (Docs) seems to be deprecated, so it's not a viable option.
REST APIs & SDKs (e.g., DocumentIntelligenceAdministrationClient) allow programmatic model training, but they don’t include a UI for labeling data.
I also found this article reviewing the questions StackOverflow flagged for me to review. Since this is an internal tool I would probably just give the people using the labeling tool access to our resource and use Azure's portal directly. This would be more viable if we ever wanted to expose this to external users. Also, it's still not really what I'm looking for which is the ability to have a UI that creates a tag format compatible with the DI training apis.

The Core Question:

Are there any React/TypeScript libraries or patterns for implementing a document labeling UI similar to Azure DI Studio?

Specifically, I’m looking for ways to:

Display PDFs/images for annotation
Allow users to draw bounding boxes around key-value pairs
Store annotations in a format compatible with Azure Document Intelligence’s Train Model API

Has anyone successfully built a similar workflow? What tools or frameworks did you use?

I’m building a configuration portal for an AI platform/service that different teams at my organization will use. One of the key services involves OCR configuration, allowing users to:

Classify a document
Run an extraction
Send the extracted JSON output back to the consumer

We are using Azure AI Document Intelligence (DI) as the underlying framework for this.

However, I’m exploring how we could integrate the extraction labeling and model training directly into our portal, so users wouldn’t need to visit the Azure portal at all.

What I've Found So Far:

Azure's Labeling Tool (Docs) seems to be deprecated, so it's not a viable option.
REST APIs & SDKs (e.g., DocumentIntelligenceAdministrationClient) allow programmatic model training, but they don’t include a UI for labeling data.
I also found this article reviewing the questions StackOverflow flagged for me to review. Since this is an internal tool I would probably just give the people using the labeling tool access to our resource and use Azure's portal directly. This would be more viable if we ever wanted to expose this to external users. Also, it's still not really what I'm looking for which is the ability to have a UI that creates a tag format compatible with the DI training apis.

The Core Question:

Are there any React/TypeScript libraries or patterns for implementing a document labeling UI similar to Azure DI Studio?

Specifically, I’m looking for ways to:

Display PDFs/images for annotation
Allow users to draw bounding boxes around key-value pairs
Store annotations in a format compatible with Azure Document Intelligence’s Train Model API

Has anyone successfully built a similar workflow? What tools or frameworks did you use?

Share Improve this question asked Feb 5 at 18:12 Jordan Dantas 711 gold badge1 silver badge6 bronze badges

Refer this doc Azure AI Document Intelligence client library for JavaScript – Sampath Commented Feb 6 at 3:35
That looks like it might be helpful addition to what I am looking for. I am looking for a component/pattern for how to actually annotate documents (visually) and then send the json/output of that to DI for training. I think the doc you mentioned would be necessary for bundling up the annotations in a valid JSON format, but I would still need an interface for the user to do the annotations – Jordan Dantas Commented Feb 6 at 22:05

Add a comment |

1 Answer 1

Sorted by: Reset to default 0

Create a user interface for document annotation that integrates seamlessly with Azure Document Intelligence using a React app. You can use react-pdf-highlighter or react-pdf to load and display the document. Additionally, use react-annotation or react-draw to allow users to draw bounding boxes and other annotations.

Refer to this Microsoft documentation for integrating Azure Document Intelligence with JavaScript.

Below is a sample code snippet to display a PDF and Bounding Box Drawing for Fields using react-pdf-highlighter:

import React, { useState, useEffect, useCallback, useRef } from "react";
import {
  AreaHighlight,
  Highlight,
  PdfHighlighter,
  PdfLoader,
  Popup,
  Tip,
} from "react-pdf-highlighter";
import type {
  Content,
  IHighlight,
  NewHighlight,
  ScaledPosition,
} from "react-pdf-highlighter";
import { Sidebar } from "./Sidebar";
import { Spinner } from "./Spinner";
import { testHighlights as _testHighlights } from "./test-highlights";
import "./style/App.css";
import "../../dist/style.css";
import { DocumentIntelligence } from "@azure-rest/ai-document-intelligence";
import { getLongRunningPoller, isUnexpected } from "@azure-rest/ai-document-intelligence";
import { AzureKeyCredential } from "@azure/core-auth";

const testHighlights: Record<string, Array<IHighlight>> = _testHighlights;
const getNextId = () => String(Math.random()).slice(2);
const parseIdFromHash = () => document.location.hash.slice("#highlight-".length);
const resetHash = () => { document.location.hash = ""; };

const PRIMARY_PDF_URL = "https://arxiv.org/pdf/1708.08021";
const SECONDARY_PDF_URL = "https://arxiv.org/pdf/1604.02480";
const key = "<your-key>";
const endpoint = "<your-endpoint>";

export default function App() {
  const searchParams = new URLSearchParams(document.location.search);
  const initialUrl = searchParams.get("url") || PRIMARY_PDF_URL;
  const [url, setUrl] = useState(initialUrl);
  const [highlights, setHighlights] = useState<Array<IHighlight>>(
    testHighlights[initialUrl] ? [...testHighlights[initialUrl]] : []
  );
  const scrollViewerTo = useRef((highlight: IHighlight) => {});

  const resetHighlights = () => setHighlights([]);
  const toggleDocument = () => {
    const newUrl = url === PRIMARY_PDF_URL ? SECONDARY_PDF_URL : PRIMARY_PDF_URL;
    setUrl(newUrl);
    setHighlights(testHighlights[newUrl] ? [...testHighlights[newUrl]] : []);
  };

  const scrollToHighlightFromHash = useCallback(() => {
    const highlight = highlights.find((h) => h.id === parseIdFromHash());
    if (highlight) scrollViewerTo.current(highlight);
  }, [highlights]);

  useEffect(() => {
    window.addEventListener("hashchange", scrollToHighlightFromHash, false);
    return () => window.removeEventListener("hashchange", scrollToHighlightFromHash, false);
  }, [scrollToHighlightFromHash]);

  const addHighlight = (highlight: NewHighlight) => {
    setHighlights((prev) => [{ ...highlight, id: getNextId() }, ...prev]);
  };

  const updateHighlight = (highlightId: string, position: Partial<ScaledPosition>, content: Partial<Content>) => {
    setHighlights((prev) =>
      prev.map((h) =>
        h.id === highlightId ? { ...h, position: { ...h.position, ...position }, content: { ...h.content, ...content } } : h
      )
    );
  };

  const analyzeDocument = async (docUrl) => {
    const client = DocumentIntelligence(endpoint, new AzureKeyCredential(key));
    const initialResponse = await client
      .path("/documentModels/{modelId}:analyze", "prebuilt-layout")
      .post({ contentType: "application/json", body: { urlSource: docUrl } });

    if (isUnexpected(initialResponse)) throw initialResponse.body.error;
    const poller = await getLongRunningPoller(client, initialResponse);
    const analyzeResult = (await poller.pollUntilDone()).body.analyzeResult;
    console.log("Extracted document:", analyzeResult);
  };

  return (
    <div className="App" style={{ display: "flex", height: "100vh" }}>
      <Sidebar highlights={highlights} resetHighlights={resetHighlights} toggleDocument={toggleDocument} />
      <div style={{ height: "100vh", width: "75vw", position: "relative" }}>
        <PdfLoader url={url} beforeLoad={<Spinner />}>
          {(pdfDocument) => (
            <PdfHighlighter
              pdfDocument={pdfDocument}
              enableAreaSelection={(event) => event.altKey}
              onScrollChange={resetHash}
              scrollRef={(scrollTo) => {
                scrollViewerTo.current = scrollTo;
                scrollToHighlightFromHash();
              }}
              onSelectionFinished={(position, content, hideTipAndSelection, transformSelection) => (
                <Tip
                  onOpen={transformSelection}
                  onConfirm={(comment) => {
                    addHighlight({ content, position, comment });
                    hideTipAndSelection();
                  }}
                />
              )}
              highlightTransform={(highlight, index, setTip, hideTip, viewportToScaled, screenshot, isScrolledTo) => (
                <Popup
                  popupContent={<div className="Highlight__popup">{highlight.comment?.emoji} {highlight.comment?.text}</div>}
                  onMouseOver={(popupContent) => setTip(highlight, () => popupContent)}
                  onMouseOut={hideTip}
                  key={index}
                >
                  {highlight.content?.image ? (
                    <AreaHighlight
                      isScrolledTo={isScrolledTo}
                      highlight={highlight}
                      onChange={(boundingRect) =>
                        updateHighlight(highlight.id, { boundingRect: viewportToScaled(boundingRect) }, { image: screenshot(boundingRect) })
                      }
                    />
                  ) : (
                    <Highlight isScrolledTo={isScrolledTo} position={highlight.position} comment={highlight.comment} />
                  )}
                </Popup>
              )}
              highlights={highlights}
            />
          )}
        </PdfLoader>
      </div>
    </div>
  );
}

For more details, refer to this documentation on react-annotation and other related packages.

本文标签：

版权声明：本文标题：reactjs - How to build a custom document labeling UI for Azure Document Intelligence in a ReactTypeScript portal? - Stack Overfl 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1739450999a2163695.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

reactjs - How to build a custom document labeling UI for Azure Document Intelligence in a ReactTypeScript portal? - Stack Overfl

What I've Found So Far:

The Core Question:

What I've Found So Far:

The Core Question:

1 Answer 1

更多相关文章

javascript - google-map-react custom skins - Stack Overflow

UML for javascript? - Stack Overflow

HTTPHTTPS error in a ProxyDockerLXC environment

javascript - &#39;await&#39; has no effect on the type of this expression when using await inside an if block - Stack Ov

Wordpress IP2Location plugin - Can&#39;t install local query Lite BIN database on Hostinger business plan

javascript - Change one single iteration on an ng-repeat - Stack Overflow

node.js - Javascript NETMASK and CIDR conversion - Stack Overflow

java - My IntelliJ project uses a shared library as its classpath instead of its own build directory. What can I do? - Stack Ove

python - Mask points inside closed STL surface - Stack Overflow

javascript - Access &lt;body&gt; from React - Stack Overflow

javascript - Nuxt transition not working on diffents layouts - Stack Overflow

html - Invalid fields (do not contain anything) - Stack Overflow

javascript - Share not working in react native androidios? - Stack Overflow

javascript - Pass object by reference fromto webworker - Stack Overflow

javascript - Mongoose bulk update operation - Stack Overflow

javascript - GraphQL: Accessing another resolverfield output from a sibling resolver - Stack Overflow

javascript - How to make Gulp.src fail if a file is missing? - Stack Overflow

What are the best practices for including a custom post&#39;s taxonomy in the URL using ACF Pro custom posts and taxonomies?

javascript - jQuery find not working - Stack Overflow

plugin development - start_el function in Navwalker is only styling the first menu link?

发表评论

推荐文章

javascript - Accessing iFrame elements using Nightwatch - Stack Overflow

javascript - Getting the #id from current URL - Stack Overflow

If the &quot;with&quot; statement in Javascript creates a new scope, why does this closure doesn&#39;t contain the n

python - Why do we call super().get_queryset() in Django&#39;s get_queryset method? - Stack Overflow

Using decodeURI with cookies javascript - Stack Overflow

热门文章

javascript - Discord.js - Guild ID is undefined even though definition is there - Stack Overflow

javascript - Stop search engines to index specific parts of the page - Stack Overflow

javascript - how to visually differentiate a TextField in readOnly? - Stack Overflow

javascript - Difficulty accessing window object in Cypress - Stack Overflow

python - Get caller&#39;s function name from a class - Stack Overflow

&quot;no debuggable or profileable process&quot; in android studio - Stack Overflow

html - How to show tooltip On click event Using JavaScript - Stack Overflow

jquery - Writing data to a local text file with javascript - Stack Overflow

amazon web services - Configure output of a wait state in AWS step functions - Stack Overflow

javascript - &quot;prompt not defined&quot; using node from command line? - Stack Overflow

最新文章

灵动一键系统重装

灵动一键系统重装 v1.12.10.1 官方免费版

U盘启动盘制作与Win7系统安装全攻略

Windows 版本说明，Enterprise、Ultimate、Home、Professional知多少？

Windows域认证（Kerberos认证）图解

c# - javascriptjquery modal popup dialog MVC 4render partial view - Stack Overflow

java - Redis caching deserialisation error Jackson - Stack Overflow

javascript - Print PDF from BLOB - Stack Overflow

plugin development - start_el function in Navwalker is only styling the first menu link?

r - lidR lascatalog processing engine transformation outputs differ between sequential and parallel processing - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - 'await' has no effect on the type of this expression when using await inside an if block - Stack Ov

Wordpress IP2Location plugin - Can't install local query Lite BIN database on Hostinger business plan

javascript - Access <body> from React - Stack Overflow

What are the best practices for including a custom post's taxonomy in the URL using ACF Pro custom posts and taxonomies?

If the "with" statement in Javascript creates a new scope, why does this closure doesn't contain the n

python - Why do we call super().get_queryset() in Django's get_queryset method? - Stack Overflow

python - Get caller's function name from a class - Stack Overflow

"no debuggable or profileable process" in android studio - Stack Overflow

javascript - "prompt not defined" using node from command line? - Stack Overflow