Archive

Archive for the ‘Domain Tools Updates’ Category

Who Won the FREE Reverse Whois Report Contest?

January 3rd, 2012 Comments off


The time has come to draw a name out of our Reverse Whois contest entries via SantasFavoriteWhois.comSantasFavoriteWhois.com. And the winner is…Paul Goldstone! His entry read, “Dear Santa, please send me a free reverse lookup report. There’s a cookie in it for you!” Congratulations, Paul!

Honorable mentions go to the following contestants who submitted some great entries:

****#1 (Bob)****

Dear Santa.

I would really appreciate it if you’d give me a Reverse Whois Report (from those nice people at Domains Tools) for Christmas. It would be a big help in maintaining my domain portfolio. You can send it to me over the internet, so you wouldn’t have to make a stop at my house. I’m sure you get enough milk and cookies everywhere else and could use one less stop.

I’ve been very good this year. As I’m sure you know, I’ve been fair and honest in my business dealings and kind to people and dogs.

Thank you for your consideration, Santa.

Your Friend -

Bob

P.S. – I’d prefer a 3 or 4 character dot com, but I know it’s too much to ask for. Maybe having the Reverse Whois Report will help me get one.

P.P.S. – Please be careful of airplanes when you’re flying around.

P.P.P.S. – Please give my best wishes for a Merry Christmas to Mrs. Clause, the reindeer and all the elves.

****#2 (Neil)****

i’ve been good at getting bad guys. LET ME WIN!

Insight Behind our Screenshots.com Launch

December 6th, 2011 Comments off

Today marks the launch of Screenshots.comScreenshots.com, a new DomainTools site that provides an excellent showcase for the millions of historical website thumbnails we’ve collected over the years.

It’s also typical of the kinds of engineering problems that seem relatively straightforward until you try them on web-scale.

Most of us know that browsing the web with Internet Explorer version 7 can be difficult. If you browse carefully you may be able to avoid the problem sites, but sooner or later you’re bound to trip up. Intentionally trying to visit every webpage on the ‘net would be downright silly.

And yet, that’s precisely what we’ve been doing for years to generate the website thumbnails you see on our Whois product. It’s also how we’ve built a database of more than 254,819,641 website screenshots (and counting!).

It’s a messy business aided somewhat by virtualization technologies and a carefully-engineered home built queueing architecture. Yet, it still presents significant engineering challenges and non-obvious business questions.

How do you teach computers to know whether a website has changed “significantly” since you last looked at it so you don’t store a bunch of duplicate images? (Hint: read about perceptual hashes and Hamming codes).

How do you decide how tall of an image to capture? For that matter, how do you capture part of the browser that’s outside the screen?

If you want your screenshot to capture what most people would see when they visit the site, which web browser and operating system do you use?

Most sites are not as OCD about cross-browser support as we are. At one time, IE7 was the best browser to target since it had the broadest support, which is why we selected it as the ‘default thumbnail browser.’  Now, after reviewing our stats, we’re thinking it’s time for an upgrade, maybe even to Firefox or Chrome.

That’s one of many things we’re changing in our thumbnail system–the system which already made Screenshots.com much more than just a bunch of images. Our engineers conceived a nifty tool that discovers interesting domain names mentioned in news feeds and highlights their screenshot on the site’s landing page. They also took several of their latest ideas and experimented with them on the Screenshots.comScreenshots.com search tool. It’s still a work in progress, but you can already use it to reveal interesting insights about a domain (try searching for “hertz” to see what their home page looks like in different TLDs).

We’re also moving quickly to expand our infrastructure, improve our capture rate, and add new servers to support the features we’re planning to add. We already had 20 virtual servers capturing screenshots; soon that number will increase to 40, with more supporting servers coming online shortly thereafter.

Now the fun part begins – we get to hear what you think of it, what your ideas are, and what novel usage patterns you come up with. Send us your feedback to support@screenshots.com or comment here.

Win a FREE Reverse Whois Report from Santa!

December 1st, 2011 Comments off

It’s that fun time of year to start another give away!

We’ve been working with Santa to provide a FREE Reverse Whois Report to one lucky DomainTools fan…

Find out all the details by visiting Santa’s Favorite Whois site.

P.S. You have until January 1st, 2012  to enter the give away by contacting Santa@SantasFavoriteWhois.com about your wish to win the report.

Good Luck and Happy Holidays from DomainTools!

DomainTools Baby Shower

November 11th, 2011 Comments off

Yes, you read the headline right. We had a baby shower  celebration this Thursday at DomainTools in honor of  Jason and his wife Erika – welcoming their baby girl in just a few weeks! We ate some ice cream cake and played a baby-themed game that was an interesting mix of trivia, pictionary and charades! Gustavo impressed us all with his charade skills – check out the footage link, which directs to our YouTube page. Hilarious.

We wish Jason and Erika the best as they embark on this exciting new adventure!

Jason and Erika

Preparing the Ice Cream Cakes

Starting the Baby Game!

We even caught video of Gustavo playing out part of the game – charades-style. Check out the video here, it’s a good excuse to check out our YouTube channel if you haven’t subscribed to the feed yet!

Presents time...

One of the gifts...a onesie with the DomainTools logo on it!

Happy Halloween from DomainTools!

October 31st, 2011 Comments off

In case you haven’t heard about our Halloween pumpkin carving contest last week, we wanted to share some fun photos of the event. We had a great time teaming up to compete for bragging rights (the 2 BEST DomainTools Halloween pumpkins of 2011)—topped off with Halloween cake and a birthday toast to Brian, our office manager!

Pumpkin carving contest in Action!

DomainTools Window Display on 5th Avenue

The Halloween Birthday cake. Happy Birthday, Brian!

You can see all of the party photos on our DomainTools Facebook page and find out which two pumpkins won bragging rights – click here

.
Happy Halloween!!

DomainTools Team Building Mini Golf Event

September 30th, 2011 Comments off

To celebrate September birthdays at DomainTools (Happy Birthday to Susan, Emily, Andrew and Kirsten!), we played some mini golf at the Seattle Interbay Golf Course the other day.  After grabbing some beverages, teams were chosen by the birthday crew. It was a perfect day to play some mini put-put out in the sun!

Prizes were given out to the top 2 teams, including gift certificates to Top Pot Donuts and Barnes and Noble. Andrew and Susan’s teams won – check out the team pictures below!

Jesse, Mike and Jason practicing right before we started the course

Kirsten after she took her shot

Andrew's team nearing the end of the course

Gustavo prepping for his shot

Susan's team near the end of the course (Susan got a hole in one!)

How To Stop Typosquatters from Stealing Your Email

September 19th, 2011 Comments off

Here at DomainTools we’re well aware of the importance companies place on protecting their brands online, especially when it comes to using our tools to hunt down cybersquatting and typosquatting. But typosquatting is not just a problem for brand managers any more, as recent shocking research has shown.

A security company called Godai recently managed to steal 120,000 emails – a huge 20 GB of data, including passwords and trade secrets – simply by registering typo domain names of large global companies.

We’re all familiar with typosquatters registering names such as wwwgoogle.comwwwgoogle.com in order to try to steal traffic from people who mistype www.google.com, and what Godai did was similar. But instead of targeting web traffic, the researchers aimed at domains used for email traffic. They called these domains “Doppelgangers”, but in reality they’re no different to a regular typo.

Some large multinational companies use three-level domains in their employees’ email addresses to show where in the world they are located. A IBM employee might have an @ca.ibm.com address if they’re based in Canada or @mx.ibm.com if they’re based in Mexico, for example.

Godai registered typos of these domains, merging the second and third levels into something like caibm.comcaibm.com, then set up catch-all email accounts and sat back and waited for the typo emails to come flooding in. And flood in they did – the company managed to capture hundreds of passwords, private employee documents and corporate trade secrets.

The researchers found that 30% of the Fortune 100 companies were vulnerable to such attacks, and that many appeared to have already been typosquatted in this way, often by registrants based in China.

With the internet being used increasingly for corporate espionage, it’s important to monitor for typos not only of primary brands, but also of domains used in sensitive internal communications.

DomainTools can help. Our Typo Finder not only generates possible typos that you can then defensively register, it also tells you whether the domains have already been registered and to whom, enabling you to conduct a further investigation or pursue a cybersquatting complaint.

Typo Finder can find domains that look like your own but for a single missing or transposed letter, or proximity-based typing slip-ups such as using an N instead of an M. The kind of typo you want to see in your output is completely configurable through a simple interface.

Combined with other DomainTools services such as Reverse Whois, Name Server Reports and Reverse IP, our Typo Finder is one of the most powerful and cost-effective fraud mitigation tools available. If you’re at all concerned about being at risk from email theft, you should check it out!

Best of DomainTools Support Inquiries – Volume 2

September 15th, 2011 Comments off

Gear Sign OfficeLast January, we posted our first DomainTools Support Inquiry blog update  and it’s time to post a Volume 2!  In case you don’t remember the background around these tickets, here’s a recap:

DomainTools Support receives a wide variety of inquiries — from appropriate account-related questions to inquiries that are not so much related to DomainTools. Please note that we are not making light of these inquiries, but we do find them entertaining and thought you may as well. We still respond to each of these comments and questions, and offer a more appropriate point of contact.

These are real Support tickets, typos and all. It’s been interesting to see what might find its way into our inbox from day to day. We receive inquiries about EVERYTHING and anything you can imagine! Here’s a sampling of some recent ones we got in (phone numbers and names blocked out for privacy):

Inquiry from April 2011:

Plaese stop texting me about girls gone wild. I have girlfriend now. xxx-xxx-xxxx
Thanks


Inquiry #1 from August 2011:

I have been a customer of reebok shoes for many years in chicgo and in fort myers florida now…I wear a size 9 med usa…your shoes don’t fit me anymore,since you change where they are made they are to narrow for me,and fit tight….I buy from the reebok store here in fort myers and he has told me they are haveing many complaints…I still like the old reebks and im wearing my last pair,I bought 4 pairs at a time,now I need to find a diff shoe…in the shoe that fits me I see these numbers…..usa9,RB906PVN6-951742…pLEASE CONTACT ME AT xxx-xxx-xxxx….THANKS FOR YOUR ATTENTION TO THIS MATTER


Inquiry #2 from August 2011:

What is the problem with the lack of stock of the Hazelnut Cafe in Walmart stores in GrandJunction CO 81501 (2 Stores) ??? We have observed a dwindling shelf stock of this item 04348706 over the last 6 months, and the stores have no acceptable answers. Both stores are totally out of this item, and have  been for about 2 weeks. Please get this item back on the shelves ASAP. And a clear concise response from you would be appreciated.

Behind The Scenes with ReverseMX

August 29th, 2011 Comments off
At DomainTools, we often throw around ideas on new and interesting data we could collect and combine with our existing data sets to discover interesting information.

One idea that had been floating around was to look up and store mail server (MX) records for all domain names we know of in the popular top level domains (TLDs). This would enable us to find relationships between domains that share mail servers, show mail server host names which resolve to the same IP address, and compare numbers of domains which use hosted mail services like Google Apps for Domains and Microsoft’s Exchange Online.

In addition to MX data, we also wanted to crawl TXT records of domains to collect SPF rules. We think SPF rules are a good idea, but since they are optional, it’s interesting to know how many domains elect to use them. For example, we can now see how many domains hosted on Google’s mail servers are publishing correct SPF records.

There was a lot of interest at DomainTools for this information and I was given some time to create a proof of concept. This quickly turned into a new DomainTools site called ReverseMX.

To find more details about what MX and SPF records are and how they are used, be sure to check out the  we have added some definitions of terms and FAQs.

ReverseMX was built in two parts. The first is what we call the ‘backend’ work where we built a distributed system to resolve MX DNS records, parse that data into a large Hive table, aggregate it in Hive and combine it with other data sets, and finally build MySQL tables for the website. The ‘frontend’ part of this product is a website powered by Django on top of these custom built tables.

We already use a Hadoop cluster for all sorts of batch data processing. As an experiment for this project, we decided to build a DNS crawler on top of Hadoop. Admittedly, this isn’t the best use of our powerful Hadoop infrastructure, but we had some idle nodes and we decided to use them.

Our DNS crawlers are implemented as a Hadoop map function which takes a domain name and ‘maps’ this to a DNS response containing the MX records for that domain. We use Hadoop streaming so the crawlers are simple Python scripts that take domain names from stdin and write DNS responses to stdout. To get enough throughput from the crawlers, we perform asynchronous DNS requests using the ADNS library and Python module. This worked so well we needed to rate limit our requests so to not put too much load on any DNS servers.

As we are using Hadoop, distributing the crawler across multiple nodes was as simple of running a large input file of domain names through our DNS mapper. The Hadoop streaming utility handles the complicated tasks of splitting the work and distributing it among a set of clusters. We just had to write the Python scripts that would accept a domain name, perform the work on it, and return a result which Hadoop would then write to the file system in it’s native format (HDFS).

To get Hadoop to play nice while working as a crawler, a few extra steps were needed. First, we turned off speculative execution to stop two nodes crawling the same data. The full crawl takes around 40 hours, so we also split the crawl into many Hadoop jobs. We were then able stop scheduling crawl jobs at certain times of the day, as well as enabling other Hadoop jobs to be interleaved with the crawl jobs. Splitting the crawl into multiple jobs also helps if a job ever fails because of network or hardware problems. If we used one job for a full crawl each node would be crawling tens of millions of Domains. If this node failed the full task for that node would need to be re-run.

When the full crawl is completed, this raw DNS data is mapped again using Hadoop through a parser which removes invalid responses and writes the data in a column format ready to be loaded into Hive.

With this data in Hive we can, for example, build a MySQL table of mail servers for the website by querying distinct mail servers along with a cluster-aware auto-increment function for the primary key. The output of these Hive queries is what is loaded into MySQL as a table. For performance, pre-calculated common queries like counts of domains that use a certain mail server are also exported to tables.

The front-end side was a standard Django implementation, although we decided not to use Django’s Models to access our custom built tables. This is the second website we’ve built with the Django framework (DailyChanges was the first) and our engineers have been very happy with it.

ReverseMX has been my pet project which I have really enjoyed building. As a backend engineer, I live for creating and processing huge data sets and building the tools to visualize and display the data to users. If you are a software developer and this sounds like fun, then there’s good news.  We are currently hiring! We have 3 open positions in the engineering department:

Director of Engineering
Python/PHP Engineer
JavaScript Engineer

Lastly, if you have any feedback regarding ReverseMX, feel free to comment on this blog, on our Twitter and Facebook pages, or via email at memberservices@DomainTools.com. Thank you in advance for your feedback!

How to Use Alerts to Get the Scoop on Your Competitors

August 25th, 2011 Comments off

Have you ever read a story on news site or blog about how a well-known company is planning a new product or service, which is based on the domain names it has recently registered? Have you ever wondered how the writer came across their information?

Last week, TechCrunch spotted that Google had become the proud owner of Android.meAndroid.me, for example. Gaming blogs were also filled with the news that Activision had registered over a dozen domains related to possible future games in its Call of Duty franchise. The news that Warner Bros is fighting for the domain TheHangover3.comTheHangover3.com strongly suggests it is planning another movie sequel.

One way to discover this kind of information would be to do a random Whois search every day on the domains you guess a company might want to register. If you have that much time to kill, good luck!

There are quicker ways, fortunately. DomainTools subscribers receive timely data about the companies that interest them, delivered direct to their in-boxes every day, after signing up to one of our suite of domain monitoring tools, such as Registrant Alert.

These tools are not only useful for bloggers or fans of particular brands. If you’re a company in a competitive marketplace, knowing which domain names your rivals are registering or buying could prove to be priceless business intelligence. Registrant Alert quite simply emails you every day with a complete list of the domain names that have just started using your chosen keywords in the Whois record. The alerts cover newly registered domains (such as the Call of Duty domains Activision defensively registered), deleting domains, as well as domains that have changed ownership (such as Android.meAndroid.me).

Registrant Alerts are very easy to set up. If you’ve ever used a Google Alert, it’s just as simple. If you are interested in what Apple has planned, monitoring for “Apple Inc” will alert you whenever the company shows up as the registrant of a domain. Be careful not to be overly broad in your query, if you want to avoid receiving too many false positives.

That’s just one way DomainTools enables you to keep track of what your favorite companies – or your competitors – are doing with domain names. If you are more technically minded, you could use Name Server Alert or one of our other monitoring tools, but I will discuss those in a future post.

Next time, I will look at how companies differ in the timing of their product-related domain name registrations, and why there’s no one-size-fits-all strategy.