Home / Web Builder Channel
 LOOK FOR...   WITH KEYWORDS:  

Consumer Watch
On The Money
Career Track
Health Quest
Business
Small Office
Web Builder
Marketing
Classifieds
Credit & Debt
Biz Finance
IR Journal
Legal Forms
Letter Templates
Archives
HOME

S U B S C R I B E

Good To Know

What People Search For
8 Things You Can Do Today To Start Increasing Traffic
Ten Web Basics For Success
John's 15 Search Engine Registration Tips
Designing Your First Web Site?

 

 

SPONSOR LINKS

Web Hosting, Low Cost
Yourname.com, 100 MB of webspace, FTP, Cgi-Bin, 50 POP3 email account, MySQL

Web Hosting, High-end
Managed dedicated servers

Automate Your Email
Use autoresponders to automatically deliver sales information to customers

Web Hosting, Economical
High speed and reliable

Register Your .TV Domain Name
Exclusive registrar of .tv domain names

Back-order Domain Names
Waiting list for domain names that expire

Personalized Domain Names
Meaningful web addresses in your own language

Web Hosting & Domain Registration
Host, transfer, sell, plus other services regarding domain names

Animated .gifs
Thousands of animated clip-art to download

 


PRINT THIS

Search Engines Also Record Private Data

Search engine spiders, the software programs that crawl through millions of webpages each day indexing new and modified pages are finding more than webmasters would like. The spiders will record just about everything they find, including passwords, credit card numbers, classified documents, and other private information that the sites' webmasters never intended to be indexed.

Most popular search engines, such as Google, AltaVista, HotBot, Lycos, and Northern Light, will pick up webpages created in HTML (HyperText Markup Language), ASCII text, and, increasingly, PDF (Adobe's Portable Document Format). Unless documents are secured in protected directories or are included in a "robots.txt" instruction file on the website, the search engine's crawling bots will read the documents and include them in their master index that can then be searched by anyone with access to the Internet.

Recently, webmasters have found that other document formats are showing up in the major search engines: word processor files, spreadsheets, graphics, and other binary files that were posted to websites for easy access by authorized employees.

In most instances when sensitive data turns up the search engine databases it's the fault of an untrained web designer. Webmasters frequently use CGI (Common Gateway Interface) scripts to execute commands behind the scenes of a website. Unless the CGI programmer is aware of potential security vulnerabilities in his script, he may be leaving a gaping hole in the site's security. For example, a CGI script that collects and stores credit card data in an unprotected ASCII (American Standard Code for Information Interchange) file may leave the data open to a search engine's crawler. Using an MySQL database on a separate server and a web-interface such as PHP, both of which are available for free, would add a layer of security to the credit card data that would prevent search engines from locating and indexing the data.

Dave's Opinion

I'm careful to check out online retailers before I enter any private information on their websites. Often, I'll call the retailer and get a feel for how they do business. I often ask to talk to their webmaster and ask about his security practices. A few rules I follow: 1) try to buy only from large retailers, 2) check references for making my first purchase, 3) add my office address as a second shipping address to my credit card, and 4) have all shipments delivered to the office.

And, if you're thinking that the robots.txt fill will solve all your problems, consider this: the robots.txt file will only turn away crawling bots that comply to standards; not all are compliant. Also, the robots.txt file can be a clue to crackers as to which directories may hold the more interesting files.

Creating a secure website takes a bit of knowledge and a bit of skill.

Dave Murphy is founder and membership director of ITrain, the International Association of Information Technology Trainers. ITrain is the global professional society for IT trainers.
Full Author Profile -->


PRINT THIS

 

DEPARTMENTS

CoolWare

Feature Story:

A Must-have Application: WinZip 9 SR-1
Untangled Web

Feature Story:

Some Simple JavaScript Functions


R E C E N T   S T O R I E S

Business Credit
The Layperson's Crash Course in Business Credit
Street-Smart Financing
How to Start or Expand Your Business with Street-Smart Financing
Attract the Perfect Investor
How to Attract the Perfect Investor for Your Business
Federal Help For Your Business
How to Obtain Local, State and Federal Help For Your Business

 

 

InsiderReports

Home  | Affiliate Login  | Search  | Advertise  | Classifieds  | Contact Us  | About Us  | Index
 

The Horizons Unlimited Group

Copyright © 1996-2009 Horizons Unlimited Group. All Rights Reserved.     Privacy Policy | Terms of Use
 


Click to verify BBB accreditation and to see a BBB report.