Tag Archives: Search

Welcome SDR Listeners!

To everyone coming to the blog having heard my commentary on Slashdot Review, here is the white paper (in PDF) I mentioned: Getting the Most from the Internet and the Deep Web.

I wrote this white paper in the summer of 2005 based on my own experience and research doing market and competitive intelligence on the Internet. At some point I would like to revisit the topic and update the document. If you have a search method you would like to recommend please give me a shout at august at augustjackson dot net.

The Competitive Intelligence Podcast is at http://www.cipodcast.com. The RSS URL is here. Comments on the blog and audio comments are always welcome.

The blog post about the State Depertment’s troubling use of Google I mentioned in the commentary is below (posted on 11 December) but also available via this link: State Department Uses Google.

State Department uses Google

In today’s Washington Post there is an article about the State Department using Google to identify Iranians to sanction for their role in developing the country’s nuclear capability. This article is an example of the poor application of Internet searching, and the potential application of the results of said poor searching is a cause for concern. If you’re going to use the Internet as part of your sourcing strategy it is worth your time to learn how to do it right.

Frustrated, the State Department assigned a junior Foreign Service officer to find the names another way — by using Google. Those with the most hits under search terms such as “Iran and nuclear,” three officials said, became targets for international rebuke Friday when a sanctions resolution circulated at the United Nations.

An initial Internet search yielded over 100 names, including dozens of Iranian diplomats who have publicly defended their country’s efforts as intended to produce energy, not bombs, the sources said. The list also included names of Iranians who have spoken with U.N. inspectors or have traveled to Vienna to attend International Atomic Energy Agency meetings about Iran.

The potential for open source secondary research for government intelligence and security research is significant. That potential is only going to realized if you apply appropriate techniques in your searching and then conduct the appropriate analysis to place your results within the context of the situation or subject.

Technorati Tag: Competitive Intelligence

Gada.be a Great Meta RSS and Tag Search

Chris Pirllo has done an excellent job putting together a meta search tool for RSS and tag searching. Gada.be is the name of his offering, and it pulls search results from over 140 sites which offer RSS feeds or APIs. Among the sites are Google News, Yahoo News, Flickr, Technorati, Google Blogs and so on. This is extremely convenient.

Gada.be is easy as all get out to remember, and is even ideal for conducting searches over mobile devices. Even sexier than being a meta search tool is a functionality which allows users to put their search terms into the URL when accessing gada.be. So, instead of going to the page, waiting for it to load and then entering your search term, you can enter the search term directly into the URL. For example:

http://macintosh.gada.be/

This URL will return search results relevant to the word “Macintosh” from each of gade.be’s sources. This saves a lot of time on mobile devices and enables bloggers and others to incorporate standing searches into their blog pages.

On his blog entry on gada.be, Chris offerings the following guidance on syntax:

Those are two different URLs, each with a different set of results. A dot between two keywords implies a quoted statement, whereas a dash implies the AND operator.

At the moment it looks a little bit like the server might be suffering from its own success. Chris comments on his blog that improvements are in the works.

This is a very cool tool for those of us who use RSS and blogs for our research. Many thanks, Chris.

Using Google to Find Spreadsheets

Research Buzz today has a good bit of advice for Internet researchers on how to use Google to find spreadsheets posted on the web. The primary focus of the piece is to use the filetype: operator the search for Microsoft Excel and other spreadsheets. For those not familiar with the filetype operator, it tells Google to look for pages in which the URL ends in “XLS” but does not validate the filetype.

From Research Buzz:

filetype:weatherwax inurl:weatherwax
filetype:feathers inurl:feathers
filetype:hamburger inurl:hamburger
filetype:montypython inurl:montypython

(You can’t use the filetype: syntax alone in a search, but you can work around that by teaming it with the inurl: syntax.)

They left out my favorite companion to the filetype: operator, and that is the site: operator. By combining these two, you can search for all of a particular filetypes on a given web site. For example, the following entry into the Google command bar will search for all of the Microsot Excel documents in the att.com domain which is owned by AT&T.

filetype:xls source:att.com

I have found a TON of great intelligence doing this. It surprises me how careless people can be sometimes making sensitive corporate information available to the public Internet audience. Competitive intelligence professionals would do well to proactively run this and related searches on their competitor pages. Look primarily for Excel, PowerPoint and Word documents for fun and profit. Conversely, an exercise in counterintelligence would be to run these searches on your own domain to make sure no documents are posted to the public Internet which shouldn’t be.

Google via SMS

Anybody of a certain age who knows me well knows that I have a love of mobile text messaging matched only by Japanese teenage girls. So it’s only inevitable that I should love this new feature from Google.

Google have rolled out a new feature that allows users to submit queeries using SMS messaging on their phone. Examples include lookups of telephone numbers in specific cities either by city and state or zip code, measurement conversions, product pricing from Froogle, mathematical calculations and snippets from web sites. The list goes on and on. Sweet sweet sweet!

Three thouhgts that come immediately to mind as great additions to this functionality could be hotspot locations in a given locality, a movie listings feature and perhaps some snippets of the very latest news stories on a given topic.

This is a longshot from the systems integration side of the house, but if the wireless carrier could also send along some location information in case the customer doesn’t know where they’re located that would be sweet and perhaps even make a functionality like directions a possibility. Now, the downside to that could be if you can send the location along with an SMS to Google, significant others could somehow turn that around to discover they’re better half’s text message from a boys’ or girls’ night out is really a cover from a clandestine booty call on the side. I know AT&T Wireless already have a “find your friends” option for GSM customers who can get a general view of where other AT&T Wireless customers are located… I wonder how many people have signed up for this.