It’s a spam technique! I published it for funny. Please don’t use the following tips to create any duplicate websites.
Step 1
Use an software such as HTTrack to download the target website as HTML files.
Step 2
Open all html files in some advance text-editors (e.g. notepad++).
- Replace the regular expression
<!--.*-->to blank. It is to delete the all HTML comments. - After the above replacing, there will be blank line/lines in the HTML files. Don’t worry. Just use the regular expression
\n\rto replace them. - It is almost done. However, the most website contains JavaScript code for Google Analytics or what else tracking code. You may just replace some part of the code with ease if you use the same tracking services. As I don’t really wanna publish the website. So I have replace the tracking code completely here by CTRL+R with the option recurse Rep! in notepad++. Or in CTRL+F model, select the Extended mode, and replace the new line with code \r\n to match all tracking codes. For example:
<script type="text/javascript">\r\nvar gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");\r\ndocument.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));\r\n</script>\r\n<script type="text/javascript">\r\nvar pageTracker = _gat._getTracker("UA-5******-5");\r\npageTracker._trackPageview();\r\n</script> - Don’t forget to replace the official name as well as!