redis - RediSearch for matching file path regex pattern - Stack Overflow-软件玩家

admin管理员组
文章数量:1415673

How can I use RedisSearch to match a more complex wildcard pattern such as the following?

import uuid

from pydantic import BaseModel
from redis.asyncio import Redis
from redismands.search.field import TagField, TextField
from redismands.search.indexDefinition import IndexDefinition, IndexType
from redismands.search.query import Query
from rich import print as pprint

from app.config.config import load_config


class Indexation(BaseModel):
    prefix: str
    document_id: str
    chunk_ids: list[str]


class IndexationDAO:
    def __init__(self, redis_client: Redis):
        self.redis_client = redis_client
        self.key_prefix = "indexation:"

    async def create_indexes(self):
        await self.redis_client.ft("indexation-idx").create_index(
            fields=[
                TextField(
                    name="$.prefix",
                    no_stem=False,
                    # withsuffixtrie=True,  # ?
                    as_name="prefix",
                ),
                TagField(name="$.document_id", as_name="document_id"),
            ],
            definition=IndexDefinition(prefix=self.key_prefix, index_type=IndexType.JSON),
        )

    async def add_indexation(self, indexation: Indexation):
        key = f"{self.key_prefix}{uuid.uuid4()}"
        __added: bool = await self.redis_client.json().set(key, "$", indexation.model_dump())  # type: ignore
        return key

    async def get_indexations(self, document_id: str):
        query = Query(f'@document_id:"{{{document_id}}}"')
        docs = await self.redis_client.ft("indexation-idx").search(query=query)
        return docs

    async def search_indexations(self, prefix: str):
        # query = Query(f"@prefix:{prefix}").dialect(2)
        query = Query(f"@prefix:{prefix}")
        docs = await self.redis_client.ft("indexation-idx").search(query=query)
        return docs


async def test_indexation_dao():
    config = await load_config()

    async with Redis(host=config.redis_host, port=config.redis_port) as redis_client:
        await redis_client.flushall()

        indexation_dao = IndexationDAO(redis_client=redis_client)
        await indexation_dao.create_indexes()

        await indexation_dao.add_indexation(
            Indexation(prefix="folder/animals", document_id="fileA", chunk_ids=["chunk0", "chunk1"]),
        )
        await indexation_dao.add_indexation(
            Indexation(prefix="animals/folder", document_id="fileB", chunk_ids=["chunk2", "chunk3"]),
        )

        # indexations = await indexation_dao.search_indexations(prefix="fo*nimals")
        indexations = await indexation_dao.search_indexations(prefix="fo*/*nimals")
        pprint(indexations)

The result is 2 documents instead of 1.

tests/integration/vector_stores/test_indexation_dao.py Result{2 total, docs: [Document {'id': 
'indexation:403768c6-0a44-4cec-a354-417a559fda5a', 'payload': None, 'json': 
'{"prefix":"folder/animals","document_id":"fileA","chunk_ids":["chunk0","chunk1"
]}'}, Document {'id': 'indexation:2fc81fb3-6937-4c11-9f9a-c3c4045c4f4b', 
'payload': None, 'json': 
'{"prefix":"animals/folder","document_id":"fileB","chunk_ids":["chunk2","chunk3"
]}'}]}

The goal is to group objects without changing the keys, since prefix matching on keys requires a SCAN. However, I am confused by how TEXT indexes handle order, infix wildcards, the level of "depth" (folder/subfolder vs folder/project/subfolder) and partial matches (without using the full "subfolder" token and just "*der").

本文标签： redisRediSearch for matching file path regex patternStack Overflow

版权声明：本文标题：redis - RediSearch for matching file path regex pattern - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1745204401a2647565.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

redis - RediSearch for matching file path regex pattern - Stack Overflow

更多相关文章

redis - RediSearch for matching file path regex pattern - Stack Overflow

发表评论

推荐文章

java - SOAP-UI-TNR : Ignore the result of an attribut in my response soap - Stack Overflow

javascript - Remove item from array on timeout - Stack Overflow

javascript - Using jQuery to create server side includes - Stack Overflow

javascript - IE - Prevent compatibility mode in an Iframe - Stack Overflow

android - Creating an AIDL file to communicate with a service in AOSP - Stack Overflow

热门文章

javascript - Puppeteer Bright Data proxy returning ERR_NO_SUPPORTED_PROXY or CERT errors - Stack Overflow

javascript - Changing background-image property causes a flicker in Firefox - Stack Overflow

c# - Why is Rigidbody.Velocity not working in Unity version 6? - Stack Overflow

c# - Simple Javascript image editor for ASP.Net application - Stack Overflow

arrays - Javascript for-loop with certain step - Stack Overflow

javascript - Get cursor position when a file is dropped in textarea in Chrome - Stack Overflow

javascript - window.opener cross domain call - Stack Overflow

Conflict between some JavaScript and jQuery on same page - Stack Overflow

javascript - Running commands from chrome console - Stack Overflow

javascript - ReactJs Accordion Automatic Close Mechanism - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

javascript - Send form data as array of objects to controller in asp.net mvc - Stack Overflow

javascript - Redirect to URL from CoffeeScript - Stack Overflow

c - False positives with Clang CFI sanitizer and array of functions - Stack Overflow

javascript - Bootstrap DatePicker - Setting the date to Tomorrow - Stack Overflow

theme development - Should we use ob_start() in WordPress short code

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价