<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Web Admin Blog &#187; Search</title>
	<atom:link href="http://www.webadminblog.com/index.php/tag/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.webadminblog.com</link>
	<description>Real Web Admins.  Real World Experience.</description>
	<lastBuildDate>Thu, 22 Jul 2010 16:18:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Our Search Implementation In The News!</title>
		<link>http://www.webadminblog.com/index.php/2008/10/07/our-search-implementation-in-the-news/</link>
		<comments>http://www.webadminblog.com/index.php/2008/10/07/our-search-implementation-in-the-news/#comments</comments>
		<pubDate>Tue, 07 Oct 2008 15:33:59 +0000</pubDate>
		<dc:creator>Ernest</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[FAST]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://www.webadminblog.com/?p=153</guid>
		<description><![CDATA[InformationWeek did a big story on enterprise search, and used NI as their lead example!  Note all the system info in the article that I fed them. And we're getting a lot of fun out of Graff's quote about how it's easy to sign off on more resources forus, we're including that in every purchase [...]]]></description>
			<content:encoded><![CDATA[<p>InformationWeek did <a href="http://www.informationweek.com/news/business_intelligence/mining/showArticle.jhtml?articleID=210604040" target="_blank">a big story on enterprise search</a>, and used NI as their lead example!  Note all the system info in the article that I fed them. And we're getting a lot of fun out of Graff's quote about how it's easy to sign off on more resources forus, we're including that in every purchase req now. <img src='http://www.webadminblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>One of the reasons that our <a href="http://www.fastsearch.com/" target="_blank">FAST</a> enterprise search program has been so successful here is that the programmers and the Web Admins have worked pretty much 50/50 on the platform.  Also, FAST is a great product and has great support (we're waiting with bated breath to see if Microsoft screws it up; we've been with FAST since way way before they got bought), and we have some very visionary search business folks who saw its potential early on.</p>
<p>Nowadays, search is more than it was considered traditionally.  We have a normal "search box", of course.   But we also run our faceted navigation off search (e.g. our <a href="http://www.ni.com/dataacquisition/" target="_blank">Data Acquisition product line page</a>), pull things like related links and other resources (<a href="http://sine.ni.com/nips/cds/view/p/lang/en/nid/206324" target="_blank">see resources tab on this page</a>).  Search, in many ways, can be used the way people have used databases in the past.  With some metadata added, a search index is kinda like a big database, highly denormalized for speed, focusing on text search.  In fact, I think there's a master's thesis in there somewhere as to when search makes sense vs. when a database makes sense.  Databases make sense with lots of numerical information, but on the Web that's frankly a fringe use case!   On the Web it's all about text, from name/address to links to articles to product info...  When we did things like query related links out of a database table, and I mean an oracle database table on a big ass Solaris box, it was painfully slow.  Pulling from search, it's 15 milliseconds.</p>
<p>As a result, our internal search use is even more killer.  We pull Intranet pages, documents from Notes repositories, data from our Oracle ERP system, files off file shares, etc. all into one place and let people delve through it.  They've even implemented "screens" on top of some of the data (mainly because Oracle ERP is painful to use).  Our entire sales force is gaga over it.</p>
<p>Anyway, so yay to modern search technology, yay to FAST, and yay us!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webadminblog.com/index.php/2008/10/07/our-search-implementation-in-the-news/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Two Simple Ways to Read Restricted Website Content</title>
		<link>http://www.webadminblog.com/index.php/2008/08/24/two-simple-ways-to-read-restricted-website-content/</link>
		<comments>http://www.webadminblog.com/index.php/2008/08/24/two-simple-ways-to-read-restricted-website-content/#comments</comments>
		<pubDate>Sun, 24 Aug 2008 18:45:13 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Popular]]></category>
		<category><![CDATA[Web Application Security]]></category>
		<category><![CDATA[access]]></category>
		<category><![CDATA[agent]]></category>
		<category><![CDATA[browser]]></category>
		<category><![CDATA[cache]]></category>
		<category><![CDATA[cloaking]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[engine]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[restriced]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[user]]></category>
		<category><![CDATA[website]]></category>

		<guid isPermaLink="false">http://webadminblog.com/?p=45</guid>
		<description><![CDATA[Have you ever had a problem that you used a search engine to try to find the solution?  Did that search bring you results from a site that then forced you to register in order to see the content?  This happened to me all of the time before I found two simple ways to display [...]]]></description>
			<content:encoded><![CDATA[<p>Have you ever had a problem that you used a search engine to try to find the solution?  Did that search bring you results from a site that then forced you to register in order to see the content?  This happened to me all of the time before I found two simple ways to display that content without me having to register at all.</p>
<p>Let me begin by explaining the why before I tell you the how.  In order for a search engine to index a site's content, it needs to be able to see that content.  The webmasters of that site are eager to let the search engine see the content as they know it will drive additional visitors to their site.  The end result is that they have to find a way for the search engine to see the content, while at the same time obscuring it from the view of the average user.  Most of the time they do this by keying off of the browser's USER AGENT.  This creates a loophole for us to exploit since if Google is able to see the search engine results, then so can we.  Here's my two tricks to see the restricted content:</p>
<p><span id="more-45"></span></p>
<ul>
<li><strong><span style="text-decoration: underline;">Trick #1:</span> Change the user agent!</strong> In Firefox, this can be done rather easily by entering "about:config" as an address in the address bar.  Click the right mouse button to get the context menu and then select New-&gt;String.  Enter the preference name "general.useragent.override".  Now simply enter the name of the new user agent string that you want to use.  If you want to cloak yourself as Google and see what they see, try using the string "Googlebot/2.1 (+http://www.googlebot.com/bot.html)".  You can check the current value by just entering "about:" in the address bar.</li>
<li><strong><span style="text-decoration: underline;">Trick #2:</span> Check Google's cache!</strong> If you don't want to go about mucking with your browser's preferences, there's an even easier trick for you to use.  Go to http://www.google.com and enter the restricted URL, but prepend "cache:" before it.  This allows you to pull the site's indexed content out of Google's cache instead of from the site itself, bypassing the access restrictions entirely.</li>
</ul>
<p>So, there you have it.  Two simple ways to read restricted website content without ever having to register an account with the site.  Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webadminblog.com/index.php/2008/08/24/two-simple-ways-to-read-restricted-website-content/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
