原文地址:http://upstart.bizjournals.com/news/technology/2013/07/26/nerdydata-search-source-code.html?page=all
原文:Beginning today, New York-based NerdyData takes traditional search engines like Google and turns them upside-down. Instead of searching a site's content, NerdyData lets entrepreneurs search their competitors' source code.
"There's so many ways to re-imagine search when you think of it from a different perspective," said 23-year-old co-founder Steve Sonnes, in an interview this afternoon with Upstart Business Journal. "We are a search engine for source code."
Sonnes said search engine optimization pros can use NerdyData to check their links against competitors' links, creating what he described as opportunities to build a brand’s authority; search for keywords within HTML elements; and analyze the CSS, Javascript, and DOM, all used to build a site.
Entrepreneurs can also see which widgets their competitors are using, sites owned by the same Google Analytics account, sites with a certain term in their header tag, and eventually, sites of a particular color.
The company’s custom crawler has visited more than 140 million homepages and collected 6.2 terabytes of HTML, Javascript, and CSS code. They have also designed several search interfaces that allow users to query against the source code of webpages, or in the Enterprise membership, download a list of sites containing a specific term.
“We even offer a search interface specifically for SEO's and marketers that allows you to search within specific HTML tags like meta descriptions and meta keywords,” Sonnes wrote.
He said the site is less than 1 percent done, but didn't want to disclose future features at this early stage of development.
Sonnes and his fellow 23-year-old co-founder David Bielik recently graduated from Stony Brook University in New York where they met as freshman. While building a search engine for domain names they created the web crawler now used to index source code and search it.
“Nobody else was doing it, so we set out to be the first,” wrote Sonnes. “NerdyData took us three months to build and we're still busting our asses trying to get it off the ground.”
Without NerdyData, much of the information is still available to everyday surfers by right-clicking on a website and clicking "Inspect Element." NerdyData just does this across millions of sites simultaneously.
A basic account with 200 searches per month and 10 results per query is free, with a professional account costing $99 per month, and 1,200 searches resulting in up to 5,000 results per query, ad $149 per month for an enterprise account, searching 3,000 times per month and returning 100,000 results per query.
Sonnes and Bielik have bootstrapped their company so far, paying $400 per month per server for five servers. They have 600 users signed up for the basic package, and fewer than 10 paying customers.
"That's monthly revenue," Sonnes said with pride.
以下便是我的翻译:
从今天开始,以纽约为基地的NerdyData采用传统的搜索引擎(如谷歌),并将其颠覆,不像以往的搜索网站的内容的形式,NerdyData让企业家可以搜索到他们竞争者的源代码。
“从不同的角度思考可以得到许多重现搜索的方式,”23岁的联合创始人斯蒂芬桑尼,在与Upstart商业杂志的午后采访中说道,“我们做的是源代码的搜索引擎。”
桑尼说搜索引擎优化的支持者要使用NerdyData来检查他们自己和竞争对手的链接,创造正如他所称的打造品牌权威的机会;在HTML元素中搜索关键字;分析CSS、javascript和DOM等一切可用来建立一个网站的资源。
企业家也可以看到他们的竞争对手在用哪些小部件,哪些网站被同一谷歌分析账号持有,哪些网站被某一团队贴上他们标题标签,最终还能知道哪些网站使用某一特定颜色。
这家公司的爬虫已经访问超过1亿4千万的主页,并收集了6.2TB的HTML,javascript还有CSS的代码。他们还设计出了一些允许用户查询网页源代码或者以公司会员的身份下载包含特点术语的网站的接口。
“我们甚至专门为搜索引擎优化和市场专员提供搜索接口,这样你们便可以在特定的HTML标签下搜索例如元描述和元关键字了。”桑尼写道。
他说这个网站还未完成其百分之一,但是他并不想在开发初期透露其未来的功能。
桑尼和他的同事,同样也是23岁的联合创始人大卫比尔李克刚刚从纽约石溪大学毕业,他们大一的时候在那里遇到了对方。在建立域名的搜索引擎时,他们创造了现在被用来索引和搜索源代码的网页爬虫。
“没有其他人正在做这个,无疑我们是第一个,”桑尼写道,“我们花了三个月的时间将NerdyData建成,今后我们会仍然努力工作尽我们最大的能力让它逐渐走上正轨。”
没有NerdyData,大部分的信息仍然是每天通过网民右键点击网站和点击些“查看元素”来获取。NerdyData只是同时在数百万网站中做到这些。
一个基本账户每个月可以搜索200次,并且每次查询10个结果都是免费的,专业账户花费每月99美元,即可获得1200次查询且每次查询有5000个结果,企业账户可以每月花费149美元做广告,并且获得3000次的搜索以及每次查询可得一万个搜索结果。
桑尼和比尔李克带领着他们的公司至今,对拥有的五台服务器需要每个月每台服务器支付400美元。他们目前拥有600个注册基本账户的用户,不到十个的付费用户。
“这就是我们每个月的收入。”桑尼自豪地说。