Skip to content

0xae/trender

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Trender - an exquisite newsfeed

This is an app that aggregates the most recent and popular tweets, the most interesting videos from youtube, what's happening in the stock and crypto markets, the most recent news, activities and every relevant information happening in realtime on the internet. we also feature a live tv that aggregates videos and livestreams of anime, tv shows, news (again), music, films, and also funny, cool and entertaining cartoons. Trender aggregates data from google, facebook, youtube, twitter, wikipedia, blogs, websites, communities, forums, etc and from this abundance of information and media it builds the ultimate media-internet-entertainment-news network.

Simply put: trender aims to be the frontpage of my internet.

The idea is to build something akin to Trending Topics, but with a much richer and faster and cheaper architecture, that runs fine on localhost without aid of the fatso hadoop.
The trender platform crawls, stores and makes use of search technologies, spiders, databases and artificial inteligence to produce ~accurate, diverse and descriptive information on realtime or at least near to realtime about what's happening in the world. it does so by collecting, processing and aggregating data from the various sources.

Motivation

Dropwizard rocks
Master hibernate
Build a simple but cool artificial-inteligence to detect trends
Programming & Fun
Put my Solr and Lucene experience to the top
Learn more about search/realtime technologies 
How far can my thinkpad go ?
A cool app for myself

Architecture

Fast, beautifull and interactive apps, featuring videos, pictures, tweets and news frontend consumption, manifold of consumers/producers, crawlers, spiders, ajax, videos, etc

Trender API (here)

the API allows you to and organize update your data, and querying the system for dynamic information. currently the api is written in java with the dropwizard library and i love it.

Trending/Text Analysis/Search (soon)

search servers, ml & nlp tools. this would be the core of the stuff, in here we want to use SNLP (Sanford Natural Language Processing [Group]) to detect trends using sentiment analysis and some kind of word turnover rate. full text search can be provided via postgres (flexible, fast and convenient) or some other specialized vendor (lucene, elastic, sphinx). the reason we need a full-text search offering it's because it doenst make sense to detect good/bad trends if your searching abilities are poor. so well the search will have some support for Semantic matching, as it's way cool.

that's it

Tasks

  • Natural Language Processing
  • Dropwizard Metrics ?
  • Trending API
  • Crawling API
  • Discovery API
  • Searching API
  • Searching infraestructure optimization (solr)
  • Scrapy crawling infraestructure

Ideas

trending-topics media

use spiders to get media content

live-testing

well, we enjoy the luxury of things happening all the time (crawling activity)
what if we used that to implement some kind of serve->test ( ~live testing ? )

P2P network

bittorrent architecture & checkout tornet

Links

Storage

News

infraestructure

Public Datasets

Artificial intelligence / Machine Learning

Multimedia/Games websites

Gamming platform

API

Blogs

Random Stuff

A Geek amusings

we're up all night to get lucky