Skip to content

Commit 5422272

Browse files
committed
new doc
1 parent ff5cb82 commit 5422272

File tree

6 files changed

+503
-5
lines changed

6 files changed

+503
-5
lines changed

202307/20230710_03.md

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -370,9 +370,12 @@ docker cp pg:/usr/share/postgresql/14/extension/pg_task--1.0.sql ./
370370
参考文档打包lantern_extras:
371371
- [《PolarDB|PG AI 功能练习插件: lantern_extras》](../202309/20230922_01.md)
372372

373-
参考文档打包pg_bm25, pg_search:
373+
参考文档打包pg_bm25, pg_search, svector:
374374
- [《PostgreSQL pg_bm25(open source by paradedb):Postgres 内部的弹性质量全文搜索 性能优于tsvector tsrank 20x》](../202310/20231016_03.md)
375375

376+
参考文档打包pg_idkit:
377+
- [《PostgreSQL pg_idkit: 集成大量uuid方法和函数插件》](../202312/20231224_01.md)
378+
376379
3、准备脚本, 参考末尾`Max depth exceeded`报错, 目的是减少dockerfile步骤. 但是写成一堆脚本不太好调试, 建议先手工制作后再用Dockerfile来制作.
377380

378381
```
@@ -1248,6 +1251,11 @@ git clone --depth 1 https://github.com/bdrouvot/pg_subxact_counters
12481251
cd $TEMP_DIR/pg_subxact_counters/c
12491252
USE_PGXS=1 make install
12501253
1254+
cd $TEMP_DIR
1255+
cp pg_idkit--0.2.1.sql /usr/share/postgresql/14/extension/
1256+
cp pg_idkit.control /usr/share/postgresql/14/extension/
1257+
cp pg_idkit.so /usr/lib/postgresql/14/lib/
1258+
12511259
cd /usr/lib/postgresql/14
12521260
git clone --depth 1 https://github.com/swida/sqlbench
12531261
cd /usr/lib/postgresql/14/sqlbench
@@ -1365,7 +1373,7 @@ RUN echo "I am running on $BUILDPLATFORM, building for $TARGETPLATFORM"
13651373
ENV DEBIAN_FRONTEND=noninteractive TEMP_DIR=/tmp ROOT_HOME=/root PG_HOME=/var/lib/postgresql PG_DATA=/var/lib/postgresql/14/pgdata PGHOME=/usr/lib/postgresql/14
13661374
STOPSIGNAL SIGINT
13671375
COPY --chmod=555 1.sh 2.sh 3.sh 4.sh 5.sh $TEMP_DIR/
1368-
COPY GraphBLAS-3.1.1.tar.gz libduckdb-linux-amd64.zip duckdb_cli-linux-amd64.zip cmake-3.27.4.tar.gz mongo-c-driver-1.17.3.tar.gz json-c-0.15-20200726.tar.gz pg_tiktoken--0.0.1.sql pg_tiktoken.control pg_tiktoken.so lantern_extras--0.0.3.sql lantern_extras.control lantern_extras.so lantern-cli onnxruntime-linux-x64-1.15.1.tgz node-v14.21.3.tar.gz pg_bm25.control pg_bm25.so pg_bm25--0.4.3.sql pg_search.control pg_search.so pg_search--0.0.0.sql pg_sparse.so pg_sparse.control pg_sparse--0.0.0.sql pg_task.so pg_task.index.bc pg_task.control pg_task--1.0.sql $TEMP_DIR/
1376+
COPY GraphBLAS-3.1.1.tar.gz libduckdb-linux-amd64.zip duckdb_cli-linux-amd64.zip cmake-3.27.4.tar.gz mongo-c-driver-1.17.3.tar.gz json-c-0.15-20200726.tar.gz pg_tiktoken--0.0.1.sql pg_tiktoken.control pg_tiktoken.so lantern_extras--0.0.3.sql lantern_extras.control lantern_extras.so lantern-cli onnxruntime-linux-x64-1.15.1.tgz node-v14.21.3.tar.gz pg_bm25.control pg_bm25.so pg_bm25--0.4.3.sql pg_search.control pg_search.so pg_search--0.0.0.sql pg_sparse.so pg_sparse.control pg_sparse--0.0.0.sql pg_task.so pg_task.index.bc pg_task.control pg_task--1.0.sql pg_idkit--0.2.1.sql pg_idkit.control pg_idkit.so $TEMP_DIR/
13691377
RUN mkdir /tmp/pg_task
13701378
COPY pg_task /tmp/pg_task
13711379
@@ -1599,6 +1607,7 @@ docker run --platform linux/amd64 -d -it -P --cap-add=SYS_PTRACE --cap-add SYS_A
15991607
- aggs_for_vecs, 数组类型聚合函数扩展.
16001608
- quantile, 统计分析相关聚合函数扩展.
16011609
- pgjwt, JSON 值转储格式: JSON Web Tokens
1610+
- pg_idkit, 集成了大量UUID方法和函数.
16021611

16031612
近似统计分析
16041613
- hll, 近似分析, 例如滑动窗口UV, 短视频场景存储已读列表+快速过滤已读视频
@@ -1649,7 +1658,7 @@ docker run --platform linux/amd64 -d -it -P --cap-add=SYS_PTRACE --cap-add SYS_A
16491658
- imgsmlr, 图像搜索, 类型+索引
16501659
- pgvector, 向量搜索, 类型+索引(ivfflat、hnsw)
16511660
- pg_sparse, paradedb开源. 稀疏向量搜索. 使用rust编写.
1652-
- svector, paradedb开源. 稀疏向量搜索. 使用c编写.
1661+
- svector, paradedb开源. 稀疏向量搜索(就是pg_sparse, 更名为svector). 使用c编写. https://docs.paradedb.com/blog/introducing_sparse
16531662
- hnsw, 向量搜索, 类型+索引(hnsw)
16541663
- pg_embedding, 向量搜索, 类型+索引(hnsw)
16551664
- lantern, 向量搜索, 类型+索引(usearch implemented hnsw)

202308/20230814_02.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -118,9 +118,12 @@ docker cp pg:/usr/share/postgresql/14/extension/pg_task--1.0.sql ./
118118
参考文档打包lantern_extras:
119119
- [《PolarDB|PG AI 功能练习插件: lantern_extras》](../202309/20230922_01.md)
120120

121-
参考文档打包pg_bm25, pg_search:
121+
参考文档打包pg_bm25, pg_search, svector:
122122
- [《PostgreSQL pg_bm25(open source by paradedb):Postgres 内部的弹性质量全文搜索 性能优于tsvector tsrank 20x》](../202310/20231016_03.md)
123123

124+
参考文档打包pg_idkit:
125+
- [《PostgreSQL pg_idkit: 集成大量uuid方法和函数插件》](../202312/20231224_01.md)
126+
124127
3、准备脚本, 参考末尾`Max depth exceeded`报错, 目的是减少dockerfile步骤. 但是写成一堆脚本不太好调试, 建议先手工制作后再用Dockerfile来制作.
125128

126129
```
@@ -973,6 +976,11 @@ git clone --depth 1 https://github.com/bdrouvot/pg_subxact_counters
973976
cd $TEMP_DIR/pg_subxact_counters/c
974977
USE_PGXS=1 make install
975978
979+
cd $TEMP_DIR
980+
cp pg_idkit--0.2.1.sql /usr/share/postgresql/14/extension/
981+
cp pg_idkit.control /usr/share/postgresql/14/extension/
982+
cp pg_idkit.so /usr/lib/postgresql/14/lib/
983+
976984
cd /usr/lib/postgresql/14
977985
git clone --depth 1 https://github.com/swida/sqlbench
978986
cd /usr/lib/postgresql/14/sqlbench
@@ -1088,7 +1096,7 @@ RUN echo "I am running on $BUILDPLATFORM, building for $TARGETPLATFORM"
10881096
ENV DEBIAN_FRONTEND=noninteractive TEMP_DIR=/tmp ROOT_HOME=/root PG_HOME=/var/lib/postgresql PG_DATA=/var/lib/postgresql/14/pgdata PGHOME=/usr/lib/postgresql/14
10891097
STOPSIGNAL SIGINT
10901098
COPY --chmod=555 1.sh 2.sh 3.sh 4.sh 5.sh $TEMP_DIR/
1091-
COPY GraphBLAS-3.1.1.tar.gz libduckdb-linux-aarch64.zip duckdb_cli-linux-aarch64.zip cmake-3.27.4.tar.gz mongo-c-driver-1.17.3.tar.gz json-c-0.15-20200726.tar.gz pg_tiktoken--0.0.1.sql pg_tiktoken.control pg_tiktoken.so lantern_extras--0.0.3.sql lantern_extras.control lantern_extras.so lantern-cli onnxruntime-linux-aarch64-1.15.1.tgz node-v14.21.3.tar.gz pg_bm25.control pg_bm25.so pg_bm25--0.4.3.sql pg_search.control pg_search.so pg_search--0.0.0.sql pg_sparse.so pg_sparse.control pg_sparse--0.0.0.sql pg_task.so pg_task.index.bc pg_task.control pg_task--1.0.sql $TEMP_DIR/
1099+
COPY GraphBLAS-3.1.1.tar.gz libduckdb-linux-aarch64.zip duckdb_cli-linux-aarch64.zip cmake-3.27.4.tar.gz mongo-c-driver-1.17.3.tar.gz json-c-0.15-20200726.tar.gz pg_tiktoken--0.0.1.sql pg_tiktoken.control pg_tiktoken.so lantern_extras--0.0.3.sql lantern_extras.control lantern_extras.so lantern-cli onnxruntime-linux-aarch64-1.15.1.tgz node-v14.21.3.tar.gz pg_bm25.control pg_bm25.so pg_bm25--0.4.3.sql pg_search.control pg_search.so pg_search--0.0.0.sql pg_sparse.so pg_sparse.control pg_sparse--0.0.0.sql pg_task.so pg_task.index.bc pg_task.control pg_task--1.0.sql pg_idkit--0.2.1.sql pg_idkit.control pg_idkit.so $TEMP_DIR/
10921100
RUN mkdir /tmp/pg_task
10931101
COPY pg_task /tmp/pg_task
10941102

202312/20231223_01.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -105,6 +105,7 @@ lance测试对比parquet, 参考文档如下
105105

106106
https://github.com/lancedb/lance/blob/v0.9.1/benchmarks/tpch/README.md
107107

108+
目前lance仅支持tpch q1,q6. 所以对比如下.
108109

109110
***Compare lance vs parquet for TPCH Q1 and Q6 using SF1 dataset***
110111

0 commit comments

Comments
 (0)