Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Implement page titles and prepending frontmatter to chunks (#64) * Implement page titles and prepending frontmatter to chunks * Implement updating front matter + tests * Leave title undefined if not specified by snooty data * (DOCSP-31639): fix timeout err in tests (#70) fix test err * (DOCSP-32066): Q&A server reduce amount of backoff on embedding (#68) reduce amount of backoff on embedding * Update Chat UI Packaging (#71) * Switch css to Emotion * Reduce ChatGPT API flakiness + faster timeout when flaky (#69) * reduce flakiness + faster timeout when flaky * implement CB feedback * (DOCSP-31925): Support tables in parsed snooty MD (#67) * handle tables as HTML tables * start tests * nested tables * cleanup * Fix list parsing within tables Use a state enum rather than bool for parent table * Update ingest/src/snootyAstToMd.ts Co-authored-by: Nick Larew <[email protected]> * Implement multiple header rows (??) --------- Co-authored-by: Chris Bush <[email protected]> Co-authored-by: Chris Bush <[email protected]> Co-authored-by: Nick Larew <[email protected]> * (DOCSP-32070): LLM preprocessing on user queries (#72) * make llm preprocessor * Add todo * add 1 word expansion * add frontmatterUpdater() to chat-core * append metadata to vector search query * hook up preprocessor to chat * test non-processing negative response * trigger rebuild * clean up * frontmatter -> frontMatter * rename * rename * implement CB feedback * edits * Add more context * small fixes * Refactor `ConversationService` class as `makeConversationService` func (#76) refactor conversation service as make func * (DOCSP-31111) [UI] publish react component to npm (#73) * (DOCSP-31575): LLM qualitative testing framework (#74) * start stubbing * add check response quality func * works in editor, but doesn't compile * runs but doesn't look correct in editor * working in VSCode and compiling * all working * working again with separate jest suite * remove console log * remove typechat from top lvl * Add back to tsconfig * remove await * Fixes * add test using framework * comment out skipped tests to appease linter * Add database index creation scripts (#75) * (DOCSP-31106) [DEL] Set up TTL on user conversations * (DOCSP-31622) [INGEST] Indexes for embedded_content and pages collections * (DOCSP-32183 & DOCSP-32226): Automate test creation based on YAML file (#77) * Add yaml tests * hypothesis -> name * correct path to yaml file * create scripts project * script to create test YAML files * remove build dir * (DOCSP-32217): Change chunker to chunk based on number of tokens + add max context tokens per message (#79) * add max context tokens per message * Trigger * trigger * strip frontmatter before sending docs to LLM * refactor removeFrontMatter to use package * Refactor includeChunksForMaxTokensPossible() per CB feedback to use findIndex * (DOCSP-31440, DOCSP-32075): Add tags to chunk metadata (#78) * (DOCSP-31440): Add tags to chunk metadata * (DOCSP-32075): Replace 'tags' with 'metadata' in embedded content document - This probably has no practical effect, but would allow for more flexibility in future vector searching or chunk filtering. * Fix seed-data for new format * Update * Remove explicit fetch import * Define engines to remove EBADENGINE warning (#82) * (DOCSP-32194): Fix drone and k8s for ingest/chat-server staging (#80) * Rebuild services * Trigger staging deploy * Trigger ingest build * fix build err * clean up drone file + PR * (DOCSP-32206): Strip comments from rst -> md (#84) * (DOCSP-32434) [UI] Add error text for non-modal input (#87) * (DOCSP-32253): Add more semantically relevant product names to metadata (#83) * Rebuild services * Trigger staging deploy * Trigger ingest build * clean up drone file + PR * refactor with Page.metadata * add test for arbitrary metadata * update EmbeddedContent description * update MongoDbUserQueryPreprocessorResponse for greater semantic meaning * update pre-processor tests * implement review feedback * fix broken tests * Fix lint err * (DOCSP-32362): Improve conversation request logging (#89) improve request logging * (DOCSP-31343): Add system diagram and info for ingest (#86) * Add ingest system diagram * Move diagram to README and add some info * (DOCSP-32227): Improve OpenAPI spec ingestion (#88) * checkpoint * integrate changes w project * clean up + add todo * Draft atlas spec handling * make func async for network call * finish tests * fix broken tests * implement cb feedback + merge fixes * Restore 'tags' and use 'tagsIn' only internally (#91) * (DOCSP-32242): Handle tabbed Snooty content (#92) * add tabs to page + handle chunking * add more table-based delimiters * CB feedback + clean up * (DOCSP-32490) [UI] Link to current commit on GitHub (#90) * (DOCSP-32243) Handle and test pages with page-level code block select (#94) * handle and test pages with page-level codeblock select * clean up comment * add spacing around tables * refactor per CB feedback * Refactor to remove undefined case which should never occur * (DOCSP-32155) [UI] UX Feedback (#81) * (DOCSP-32363) [UI] handle cases where the LLM stops mid code example (#85) * Drone fix (#97) * update drone file for testing * add trigger * fix and trigger * fix and trigger * trigger chat server build * cean up tmp changes * remove triggers * fix handing promise * fix hanging promise in async recursive operation (#98) fix handing promise * fix handling of openapi specs (#99) * (DOCSP-32247): Fix preprocessed content not added to DB (#100) fix preprocessed content not added to DB * (DOCSP-32104): Do not serve demo site in prod environment (#101) only serve staging site if env not prod * (DOCSP-32596): Set up CORS on the server (#103) * draft CORS setup * add tests * (DOCSP-32452): Dev Center data source remove `<img>` and `<div>` tags + youtube directive (#93) dev center data source remove img and div tags + youtube directive * Update QA server DB (#105) update QA server DB --------- Co-authored-by: Chris Bush <[email protected]> Co-authored-by: Nick Larew <[email protected]> Co-authored-by: Chris Bush <[email protected]> Co-authored-by: Nick Larew <[email protected]>
- Loading branch information