new doc

digoal zhou · digoal zhou · commit ea507ddf30fa · 2020-06-10T17:57:14.000+08:00
diff --git a/202006/20200610_01.md b/202006/20200610_01.md
@@ -0,0 +1,225 @@
+## PostgreSQL hll 在留存、UV统计中的通用用法  
+  
+### 作者  
+digoal  
+  
+### 日期  
+2020-06-10  
+  
+### 标签  
+PostgreSQL , hll , uv , 估值   
+  
+----  
+  
+## 背景  
+留存评估, 评估每个留存日的用户数. 抛去其他维度的查询条件, 对比三种方案的效率.   
+  
+例如第一次登陆为6-1, 那么在6-2, 6-6登陆表示1,5日留存.   
+  
+目的是统计每个留存日的用户数.   
+  
+### 方案1 :   
+  
+```  
+-- 100万用户, 每个用户100个number  (52核 pg 12)   
+CREATE TABLE user_retain (  
+  user_id serial PRIMARY KEY,  
+  fst_login_date date,  
+  pay_retained_num int []  -- 数组存储, 表示留存日.   
+);  
+  
+create or replace function gen_rand() returns int[] as $$  
+  select array_AGG((ceil(random()*365)::int)) from generate_series(1,100);  
+$$ language sql strict;  
+  
+insert into user_retain select generate_series(1,1000000), now(), gen_rand();  
+```  
+  
+```  
+explain (analyze,verbose,timing,costs,buffers)   
+select unnest(pay_retained_num),count(*) from user_retain group by 1;  
+```  
+  
+18秒  
+  
+为什么要18秒?  
+  
+unnest 后没发立即并行, 只是function并行, 并没有并行聚合操作, 反而因为需要gather导致并行比不并行还慢. 当然这个我认为PG是可以在内核层面优化的.  
+  
+```  
+                                                                      QUERY PLAN                                                                         
+-------------------------------------------------------------------------------------------------------------------------------------------------------  
+ HashAggregate  (cost=70326.50..86732.89 rows=1000008 width=12) (actual time=29802.202..29806.633 rows=365 loops=1)  
+   Output: (unnest(pay_retained_num)), count(*)  
+   Group Key: (unnest(user_retain.pay_retained_num))  
+   Planned Partitions: 8  
+   Peak Memory Usage: 6193 kB  
+   Buffers: shared read=58824  
+   ->  Gather  (cost=0.00..61420.18 rows=1000008 width=4) (actual time=4.392..16521.356 rows=100000000 loops=1)  
+         Output: (unnest(pay_retained_num))  
+         Workers Planned: 26  
+         Workers Launched: 26  
+         Buffers: shared read=58824  
+         ->  ProjectSet  (cost=0.00..61420.18 rows=384620 width=4) (actual time=0.040..284.310 rows=3846154 loops=26)  
+               Output: unnest(pay_retained_num)  
+               Buffers: shared read=58824  
+               Worker 0:  actual time=0.035..226.005 rows=3107600 loops=1  
+                 Buffers: shared read=1828  
+               Worker 1:  actual time=0.037..327.656 rows=4547500 loops=1  
+                 Buffers: shared read=2675  
+               Worker 2:  actual time=0.036..512.300 rows=6907100 loops=1  
+                 Buffers: shared read=4063  
+......  
+                 Buffers: shared read=1908  
+               Worker 22:  actual time=0.038..262.740 rows=3689000 loops=1  
+                 Buffers: shared read=2170  
+               Worker 23:  actual time=0.040..203.822 rows=2828800 loops=1  
+                 Buffers: shared read=1664  
+               Worker 24:  actual time=0.042..161.236 rows=2249100 loops=1  
+                 Buffers: shared read=1323  
+               Worker 25:  actual time=0.046..340.726 rows=4622300 loops=1  
+                 Buffers: shared read=2719  
+               ->  Parallel Seq Scan on public.user_retain  (cost=0.00..59208.62 rows=38462 width=424) (actual time=0.033..25.329 rows=38462 loops=26)  
+                     Output: user_id, fst_login_date, pay_retained_num  
+                     Buffers: shared read=58824  
+                     Worker 0:  actual time=0.029..21.766 rows=31076 loops=1  
+                       Buffers: shared read=1828  
+......  
+                       Buffers: shared read=1664  
+                     Worker 24:  actual time=0.035..16.011 rows=22491 loops=1  
+                       Buffers: shared read=1323  
+                     Worker 25:  actual time=0.036..30.783 rows=46223 loops=1  
+                       Buffers: shared read=2719  
+ Planning Time: 0.265 ms  
+   Buffers: shared hit=34 read=4  
+ Execution Time: 29809.325 ms  
+(124 rows)  
+```  
+  
+```  
+postgres=# explain (analyze,verbose,timing,costs,buffers)     
+select unnest(pay_retained_num),count(*) from user_retain group by 1;  
+                                                                 QUERY PLAN                                                                   
+--------------------------------------------------------------------------------------------------------------------------------------------  
+ HashAggregate  (cost=215387.75..379451.57 rows=10000080 width=12) (actual time=18522.487..18524.227 rows=365 loops=1)  
+   Output: (unnest(pay_retained_num)), count(*)  
+   Group Key: unnest(user_retain.pay_retained_num)  
+   Planned Partitions: 64  
+   Peak Memory Usage: 6193 kB  
+   Buffers: shared hit=58824  
+   ->  ProjectSet  (cost=0.00..126324.54 rows=10000080 width=4) (actual time=0.012..6471.708 rows=100000000 loops=1)  
+         Output: unnest(pay_retained_num)  
+         Buffers: shared hit=58824  
+         ->  Seq Scan on public.user_retain  (cost=0.00..68824.08 rows=1000008 width=424) (actual time=0.009..102.847 rows=1000000 loops=1)  
+               Output: user_id, fst_login_date, pay_retained_num  
+               Buffers: shared hit=58824  
+ Planning Time: 0.065 ms  
+ Execution Time: 18525.843 ms  
+(14 rows)  
+```  
+  
+### 方案2 :   
+  
+展开array存储  
+  
+```  
+CREATE TABLE user_retain1 (  
+  user_id serial ,             
+  fst_login_date date,  
+  pay_retained_num int  
+);  
+  
+insert into user_retain1 select user_id,fst_login_date,unnest(pay_retained_num) from user_retain;  
+  
+alter table user_retain1 set (parallel_workers =26);   
+```  
+  
+26个并行, 0.8秒   
+  
+```  
+max_worker_processes = 32
+max_parallel_workers_per_gather = 26
+parallel_leader_participation = off
+max_parallel_workers = 32
+parallel_tuple_cost = 0
+parallel_setup_cost = 0
+min_parallel_table_scan_size = 0
+min_parallel_index_scan_size = 0
+
+
+explain (analyze,verbose,timing,costs,buffers)   
+select pay_retained_num,count(*) from user_retain1 group by 1;  
+  
+                                                                     QUERY PLAN                                                                        
+-----------------------------------------------------------------------------------------------------------------------------------------------------  
+ Finalize HashAggregate  (cost=598284.46..598288.11 rows=365 width=12) (actual time=859.487..859.525 rows=365 loops=1)  
+   Group Key: pay_retained_num  
+   Peak Memory Usage: 61 kB  
+   ->  Gather  (cost=598233.36..598237.01 rows=9490 width=12) (actual time=856.866..858.653 rows=9490 loops=1)  
+         Workers Planned: 26  
+         Workers Launched: 26  
+         ->  Partial HashAggregate  (cost=598233.36..598237.01 rows=365 width=12) (actual time=851.471..851.508 rows=365 loops=26)  
+               Group Key: pay_retained_num  
+               Peak Memory Usage: 0 kB  
+               ->  Parallel Seq Scan on user_retain1  (cost=0.00..579002.57 rows=3846157 width=4) (actual time=0.033..294.534 rows=3846154 loops=26)  
+ Planning Time: 0.100 ms  
+ Execution Time: 860.386 ms  
+(12 rows)  
+```  
+  
+### 方案3 :   
+估值计算, 每个留存日一条记录, 将留存天数对应的UID写入到hll类型中.   
+  
+```  
+create extension hll;  
+  
+create table t_hll (  
+  pay_retained_num int primary key,   
+  u_hll hll   
+);  
+  
+insert into t_hll select pay_retained_num, hll_add_agg(hll_hash_integer(user_id)) from user_retain1 group by pay_retained_num;    
+  
+select pay_retained_num, # u_hll from t_hll order by 1;    
+```  
+  
+3毫秒.   
+  
+```  
+                                                       QUERY PLAN                                                         
+------------------------------------------------------------------------------------------------------------------------  
+ Index Scan using t_hll_pkey on t_hll  (cost=0.15..81.53 rows=365 width=12) (actual time=0.020..3.101 rows=365 loops=1)  
+ Planning Time: 0.050 ms  
+ Execution Time: 3.121 ms  
+(3 rows)  
+```  
+  
+估值计算的差异  
+  
+```  
+with a as (  
+select pay_retained_num, count(*) as cnt from user_retain1 group by 1),  
+b as (select pay_retained_num, # u_hll as cnt from t_hll)  
+select a.pay_retained_num, a.cnt, b.cnt from a,b where a.pay_retained_num=b.pay_retained_num order by abs(a.cnt-b.cnt);  
+  
+ pay_retained_num |  cnt   |        cnt           
+------------------+--------+--------------------  
+               18 | 273741 | 242072.93394329798  
+              226 | 273461 |   241358.471066233  
+              257 | 273317 | 241062.44806733675  
+              202 | 274096 | 241617.25101208987  
+              245 | 273489 | 240817.31849724415  
+              319 | 273263 |   240535.045171427  
+... ...  
+```  
+  
+  
+  
+#### [免费领取阿里云RDS PostgreSQL实例、ECS虚拟机](https://www.aliyun.com/database/postgresqlactivity "57258f76c37864c6e6d23383d05714ea")
+  
+  
+#### [digoal's PostgreSQL文章入口](https://github.com/digoal/blog/blob/master/README.md "22709685feb7cab07d30f30387f0a9ae")
+  
+  
+![digoal's weixin](../pic/digoal_weixin.jpg "f7ad92eeba24523fd47a6e1a0e691b59")
+  
diff --git a/202006/20200610_02.md b/202006/20200610_02.md
@@ -0,0 +1,84 @@
+## 推荐系统, 已阅读过滤, 大量CPU和IO浪费的优化思路2  
+  
+### 作者  
+digoal  
+  
+### 日期  
+2020-06-10  
+  
+### 标签  
+PostgreSQL , 推荐系统 , offset , 浪费   
+  
+----  
+  
+## 背景  
+推荐系统.   
+  
+User 10亿级别  
+video 10亿级别  
+  
+给 User 推送 video, 按weight权重排序选择vids, 并且过滤已读(获取后到客户端已读表示已读), 采用HLL记录vid hash, 通过hash判断是否已读.  hll_val || vid_hash <> hll_val  表示未读.   
+  
+```  
+create table t (vid int8, weight float4, ts timestamp);    
+insert into t select generate_series(1,10000000), random();    
+create index idx_t_1 on t (weight);    
+```  
+  
+随着已读列表越来越大, 每次按weight倒排查出来的记录有大量是已读的, 浪费了大量的时间在hll运算上.  使用offset可以模拟hll计算, 例如offset过滤20万条  
+  
+```  
+select * from t order by weight desc offset 200000 limit 100;    
+```  
+  
+耗费 Time: 147.740 ms   
+  
+视频权重会因为大赏、观看等情况不断变化, 所以没有办法使用记录weight 位点来加速offset. 也没有办法使用ts结合weight来跟踪offset位点, 因为热vid会越来越热.   
+  
+每个人观看、喜好的vid都不一样, 所以没有办法统一处理加速.   
+  
+## 优化思路:  
+  
+降低每次hll计算已读的量, 将table强行进行随机索引分区, 每次只查询一个分区, 这样与业务可能有一丝丝不符, 因为查询到的记录是部分记录.  
+  
+但是从整体拉平来看, 只要用户请求次数足够多, 随机能覆盖到所有的记录.   
+  
+例如按20个分区索引来进行随机选择.   
+  
+```  
+do language plpgsql $$  
+declare  
+ sql text;  
+begin  
+  for i in 0..19 loop  
+    sql := format($_$  
+      create index idx_t_p_%s on t (weight) where mod(abs(hashint8(vid)),20)=%s;  
+    $_$, i, i);  
+    execute sql;  
+  end loop;  
+end;  
+$$;  
+```  
+  
+那么查询的范围将缩小到20分之一, 因为用户已读列表的总量不变, 所以在这个分区中的已读量也会变成20分之一. 那么offset量就会降低20倍. 性能明显提升.   
+  
+```  
+select * from t   
+where mod(abs(hashint8(vid)),20) = 0   
+order by weight desc offset 10000 limit 100;    
+```  
+  
+耗费 Time: 12.139 ms  
+  
+## 参考  
+[《PostgreSQL 大量IO扫描、计算浪费的优化 - 推荐模块, 过滤已推荐. (热点用户、已推荐列表超大)》](../202006/20200601_01.md)    
+  
+  
+#### [免费领取阿里云RDS PostgreSQL实例、ECS虚拟机](https://www.aliyun.com/database/postgresqlactivity "57258f76c37864c6e6d23383d05714ea")
+  
+  
+#### [digoal's PostgreSQL文章入口](https://github.com/digoal/blog/blob/master/README.md "22709685feb7cab07d30f30387f0a9ae")
+  
+  
+![digoal's weixin](../pic/digoal_weixin.jpg "f7ad92eeba24523fd47a6e1a0e691b59")
+  
diff --git a/202006/readme.md b/202006/readme.md
@@ -2,6 +2,8 @@
   
 ### 文章列表  
 ----  
+##### 20200610_02.md   [《推荐系统, 已阅读过滤, 大量CPU和IO浪费的优化思路2》](20200610_02.md)  
+##### 20200610_01.md   [《PostgreSQL hll 在留存、UV统计中的通用用法》](20200610_01.md)  
 ##### 20200609_02.md   [《PostgreSQL 核心卖点提取方法》](20200609_02.md)  
 ##### 20200609_01.md   [《PostgreSQL 生成随机数据方法大汇总》](20200609_01.md)  
 ##### 20200605_01.md   [《PostgreSQL 13 特性解读》](20200605_01.md)  
diff --git a/README.md b/README.md
@@ -54,6 +54,8 @@ digoal's|PostgreSQL|文章|归类
   
 ### 所有文档如下  
 ----  
+##### 202006/20200610_02.md   [《推荐系统, 已阅读过滤, 大量CPU和IO浪费的优化思路2》](202006/20200610_02.md)  
+##### 202006/20200610_01.md   [《PostgreSQL hll 在留存、UV统计中的通用用法》](202006/20200610_01.md)  
 ##### 202006/20200609_02.md   [《PostgreSQL 核心卖点提取方法》](202006/20200609_02.md)  
 ##### 202006/20200609_01.md   [《PostgreSQL 生成随机数据方法大汇总》](202006/20200609_01.md)  
 ##### 202006/20200605_01.md   [《PostgreSQL 13 特性解读》](202006/20200605_01.md)  
diff --git a/sec/.crypto b/sec/.crypto
@@ -1,3 +1,5 @@
+https://www.postgresql.org/docs/13/pgcrypto.html#id-1.11.7.34.8
+
 \\x
 
 select pgp_sym_encrypt($_$

Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,5 @@`
	`1`	`+https://www.postgresql.org/docs/13/pgcrypto.html#id-1.11.7.34.8`
	`2`	`+`
`1`	`3`	`\\x`
`2`	`4`
`3`	`5`	`select pgp_sym_encrypt($_$`