-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check if URL exists and if it does not, turn off forwarding. #703
Comments
A good idea in the abstract. But when would the check take place? At the time of item creation + promotion, it's highly likely that the URL will resolve properly. If the source page goes away, it's likely to happen in the future. So we'd need some mechanism for verifying URL resolution at a time other than initial creation/promotion. Two ideas come to mind:
Regarding option 1, we couldn't do it all in a single cron job, because it would time out with just a couple 404s. A single cron job for each URL is probably easier to build than a batch process, though it could theoretically result in hundreds of separate cron jobs. The real downside here is that the frequency is all wrong: for PF items that receive little to no traffic, you're checking too frequently; while for items that are visited frequently, if the source URL breaks, you might have days or weeks between the breakage and PF's health check. So, option 2 seems a bit better. To ensure 100% coverage, we would have to do a check with every single forward event. For busy PF items, this is too much: it slows down the forwarding process, and could get PF sites banned for creating unwanted traffic. So we probably need a reasonable throttle, say once per 6 or 12 or 24 hours. This still creates the possibility of a gap between the URL going down and PF recognizing the fact, but it'll be fairly minimal. Other considerations:
|
No description provided.
The text was updated successfully, but these errors were encountered: