-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to direct connect to broker without proxy #423
Comments
Ingress probably wouldn't make sense for Pulsar brokers, at least for the binary protocol. For the Pulsar Admin API that would be a feasible approach. The http/https protocol could also be used for topic lookups, so it would be sufficient to be used as the "serviceUrl". However, the Pulsar binary protocol would require a different approach. You could use k8s node ports and Pulsar's "advertisedListeners" feature: Another possibility is the SNI proxy feature and use a proxy that supports SNI proxying (for example Apache Traffic server or Nginx): |
It would make sense to have a load balancer for the broker service that is used for lookups since the binary protocol is more efficient than using the REST API for lookups. The individual brokers need to be addressable directly and solving that requires a solution. I'd like to see an experiment for the nodeport + advertisedListeners solution. I guess that would be feasible in cloud managed k8s environments where it is possible to expose a k8s node with a routable address that the client could access. |
One problem with Pulsar Proxy is that it adds multiple cross AZ hops which incur network transfer costs in cloud k8s environments. |
Adding some more context here about the Pulsar Proxy. https://pulsar.apache.org/docs/3.1.x/administration-proxy/ For the "undesirable" part: When the Apache Pulsar PMC was handling the Pulsar Proxy security vulnerability https://pulsar.apache.org/security/CVE-2022-24280/, it was decided to add a notice to https://pulsar.apache.org/docs/3.1.x/administration-proxy/ that the Pulsar Proxy isn't designed to be exposed directly on the public internet: For the "infeasible" part: The direct connection to brokers could be achieved with advertisedListeners and nodeports. It would be great to have a solution where this could be automated. The nodeport solution would require that the node has a routable address from clients. Since individual brokers don't require stable names, it would be sufficient to be able to advertise the node IP and nodeport. Lookups could use the REST API configured with an ingress. There is also the possibility to have a loadbalancer for brokers that is used for lookups since that would be more efficient. Another reason for a proxy like component is for lookups and federating multiple broker clusters into a single large cluster from the client perspective. In Pulsar, there was a component called "pulsar-discovery". This was removed by apache/pulsar#12119 and there's discussion in apache/pulsar#15225 about restoring it. |
Slightly related: The issue #437 describes a current problem with the headless broker service that should be addressed by adding a 2nd cluster ip service for lookups and making the headless broker service use |
Hi @youzipi can you share your manifest for your special ingress created for broker? I need exactly that I think. Thank you |
actually, i use proxy now. this is the ingress config i used that time. apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
labels:
app: pulsar
cluster: pulsar
environment: alpha
component: broker
annotations:
name: "pulsar-broker"
namespace: pulsar
spec:
rules:
- http:
paths:
- path: /pulsar($|/)(.*)
pathType: ImplementationSpecific
backend:
service:
name: "pulsar-broker"
port:
number: 6650
- path: /pulsar-web($|/)(.*)
pathType: ImplementationSpecific
backend:
service:
name: "pulsar-broker"
port:
number: 8080
host: $your-host-name
--- |
Thanks @youzipi! How are you using the proxy now? Did you find a way to proxy everything through https and not binary tcp? Is it using SNI? |
custom configs:
tcp: ---
apiVersion: v1
kind: Service
metadata:
annotations:
service.beta.kubernetes.io/alicloud-loadbalancer-address-type: intranet
service.beta.kubernetes.io/alicloud-loadbalancer-id: lb-aaa
service.beta.kubernetes.io/alicloud-loadbalancer-force-override-listeners: "true"
service.beta.kubernetes.io/alicloud-loadbalancer-protocol-port: tcp:6651
service.beta.kubernetes.io/alicloud-loadbalancer-network-type: vpc
name: proxy-tcp
namespace: pulsar-new
labels:
app: pulsar
component: proxy
spec:
type: LoadBalancer
ports:
- name: tcp
targetPort: 6651
port: 6651
protocol: TCP
selector:
app: pulsar
component: proxy
... apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
labels:
app: pulsar
cluster: pulsar
environment: prod
component: broker
annotations:
# nginx.ingress.kubernetes.io/rewrite-target: /
nginx.ingress.kubernetes.io/configuration-snippet: |
rewrite ^/broker-admin/(.*)$ /$1 break;
name: "pulsar-broker"
namespace: pulsar-new
spec:
rules:
- http:
paths:
- path: /broker-admin
pathType: Prefix
backend:
service:
name: "pulsar-broker"
port:
number: 8080
host: $host-name
--- |
i would prefer not to use a proxy.
but i found broker does not have the ingress template.
for now, i deploy an ingress for broker individually.
The text was updated successfully, but these errors were encountered: