Skip to content
February 18, 2006

Filipino Blog Aggregator and Search

Migs is still planning his killer search application purely target on Filipino blogs. While we have a number of sources to get raw data (PinoyBlog & PinoyTopBlogs), there is still some problem with identifying which blogs are by Filipinos, for the Filipinos or from the Philippines (or whatever the unifying criteria).

Until today, we still don’t have a definitive source for searching websites and blogs that are purely Pinoy-related.

While I was updating my Technorati list of Top 100 Philippine blogs, I discovered Technorati can rank blogs that are tagged for certain keywords (currently 20 tags/keywords per blog). Those blogs that have tagged their Technorati profile with the keyword Philippines can be ranked by authority here.

I thought that it would be really good if all Filipino-owned or operated blogs would just include the Philippines keyword in their tags, Technorati could be efficiently used to search for content/topics specific to Filipino blogs alone.

Migs could actually implement this idea as well.

Huawei Ascend Mate & Review


4 Responses to “Filipino Blog Aggregator and Search”

  1. mparaz
    Twitter: mparazgmail.com
    says:

    Thanks for the tip. I’d appreciate more insights on how to catalog Pinoy blogs, and ultimately, memes. I’m looking at the MSN search logs academic research proposal.

    The problem with Technorati is… only savvy bloggers , like those who read here, know about it. So it will do as a starting point but I need to discover more blogs by crawling.

  2. Rizalist says:

    Yeah, I saw the post of migz on this a few weeks ago and I’m interested in helping out with this project. Been studying bots and spiders and SEO stuff like that recently and I was planning to write my own Arachnid. (python+) But then I realized I just really don’t know enough coz everytime i try something i find out there something off the shelf. In fact i spend a lot of time now when i’m not blogging politics (which has recently turn Ugandan, if you know what I mean) at the Google Labs. Their APIs really rock. So lately I’ve been planning other project of a very similar nature, but more based on what others have already done.

    As for memes, migs, i already have software written that’ll take any file of text and create a ranked list of ALL the unique words in it. (it was fun doing it for my Rizalist Press project — am publishing Jorge Bocobo’s 1956 Noli and Fili as a giant blog that people can search, comment on and improve!)

    Next step is to “aggregate” a representative sample of all the blogs that are known in the Filipino blogosphere and make a master list of such terms and with a winnowing algorithm create a master ranked list of memes. (Remember a meme is an infectious idea, like a virus, so there’s lot’s of intersting realtime dynamics here. For example, I’ve been doing a kind of manual tracking of “Proclamation 1017″ — and discovered that Yahoo has been slightly ahead of Google in pushing up this meme to the top of the results list. Don’t know why, but I have a feeling it has to do with Yahoo Mail which got going a lot earlier than gmail and they have those YahooGroups that google really doesnt with its groups.

    I’d be willing to write algorithmic stuff, but am kinda week in the area of crawling itself–don’t understand a lot of the protocols and haven’t done enough real work in the field. You guys are far better equipt.

    Anytime you guys wanna do it. let’s just DO IT!

    Btw Yuga, here is a picture of the author of Ye Olde Philippine Commentary–

    http://ia300225.us.archive.org/3/items/Sunrise_on_Pulag_Peak_1/DJBPULAG.jpg

    He says it’s okay to send it to you altho he’s really very shy in real life and is mostly a recluse. I don’t really get to see him much, except when he’s combing his hair.

  3. Miguel
    Twitter: mparazgmail.com
    says:

    Still working this, folks?

  4. mk says:

    please include this blog in the aggregater.
    malayalamnews service.

Leave a Reply

*
*