Building dictionary from the web
Initially a small collection of "seed" texts are fed to the crawler (a few hundred words of running text have been sufficient in practice). Queries combining words from these texts are generated and...
View ArticleWhy is there a size limit on file uploads?
A potential problem with form processing scripts is that, by default, they attempt to process form POSTings no matter how large they are. A wily hacker could attack your site by sending a huge POST of...
View ArticleConverting HTML to text
I have used the following two scripts and found them to be not very impressive.html2text (Python script)Html2text (Perl script)The Python script converts a HTML page into Markdown (a text-to-HTML...
View ArticleProxy Settings - Enviroment Variables
Proxy can be set up using environment variables automatically for all users by creating the following...
View ArticlePrint system information under Linux
'uname' prints information about the machine and operating system it is run on.% uname -aLinux localhost.localdomain 2.4.20-8 #1 Thu Mar 13 17:18:24 EST 2003 i686 athlon i386 GNU/LinuxUse this command...
View ArticleGetting rid of java.lang.OutOfMemoryError
Most JVMs allocate memory on the heap for almost everything, except for reflective data. That is put in a separate location which is a section of the heap that is reserved for permanent generation....
View ArticleConverting FAT volumes to NTFS
The convert command converts FAT volumes to NTFS. This command is run from the command prompt.ex: convert D: /FS:NTFS- will convert D: from FAT to NTFS
View ArticleSearch for Selected Text
Highlight text in any web page and search for it on Google with the click of a button.Start by creating a new button on your browser's links bar and naming it "Search Google" or something similar.Next,...
View ArticleA Roundup of Free Operating Systems
By David ChisnallWhenever people think of a free operating system (if they do), the odds are that they will think of Linux, a clone of UNIX from 1991. They also might think of one or more of the BSDs,...
View ArticleJava on Ubuntu
Ubuntu has GCJ installed by default. A different version of Java can be installed using the Synaptic Package Manager. Sun's Java 1.5 and 1.6 are available through the package manager.The problem is,...
View ArticleHow to Debug PHP with Vim and XDebug on Linux
XDebug is definitely a wonderful tool for debugging PHP. I tried using Eclipse IDE for remote debugging but it had some version conflicts and so I searched for some other IDE but I didn't imagine that...
View ArticleResponding to DOM Events using Javascript
Responding to events occurring on a web page using javascript is a necessity these days rather than a preference. Look at the various popular websites today and you will realize that. So, I set out to...
View ArticleCustomizing search in Windows XP to index files with unknown file extensions
First of all, why would you want to customize search in Windows XP to index files with unknown file extensions?Assume you have a java file (a file with a .java extension) and when you perform a search,...
View ArticleAccessing ext2 or ext3 file systems on windows
Ext2 or Ext3 file systems are commonly used by the Linux operating system. Support for these file systems is not by default built in to windows operating systems.I remember trying to access windows...
View ArticleGrand Challenges for Engineering
National Academy of Engineering has a special site where in it will publish the grand challenges for engineering for the 21st century. The committee members are selected from varied fields. I am really...
View ArticleSocial application for Orkut
I have checked out Open Social, a common set of APIs which can be used to build social applications that work across multiple websites. I think it is really cool and at the same time not so good.I...
View ArticleSocial Graph API
Social Graph API is something to check out. Here is a blog from Google Code Blog. I haven't checked it out yet but plan to do so soon. I hope it does what it claims to do and does that well.If you are...
View ArticleFew useful linux commands
- To see how long your machine has been running $ uptime- To see some computer stats $ lspci- To see system temperature and battery charge $ acpi -t- To see a list of all running processes $ ps aux- To...
View ArticleConfiguring Bluetooth on Ubuntu
This article on LinuxQuestions.org explains clearly how to configure bluetooth on Ubuntu to send/receive files to/from bluetooth devices and also how to set up bluetooth keyboard and mouse.
View ArticleUse your GPRS phone as modem in Ubuntu
'Bunch of me: Linux Redux: Use your GPRS phone as modem in Ubuntu tells how to do it.
View ArticleIgnoring files in subversion
Two lessons for todayHow to ignore files in subversionHow to create a transcript1. How to ignore files in subversionTo ignore a directory: svn propedit svn:ignore The above command will open an editor,...
View ArticleMove running process to background in Linux
Use the following sequence of keystrokes if you want to move a running process to background in Linux.CTRL+ZbgCTRL+Z will suspend the process and the command prompt will return to you. Type bg and...
View ArticleMySQL: Purge Master Logs
Binary logs sometime take up too much space on the server. I wanted space so I had to delete the database logs. They were taking up around 2GB...
View ArticleI/O Redirection in shell or shell scripting
You will come across this situation when you want to run a process in background. Mostly, you want to capture the output or the errors thrown by the process.As you must be already knowing, three file...
View ArticleBasics: Lucene and Search
Basics about Lucene ranking -- hereSearch Internals by user rainman on wikipediaThis might also be helpful -- a thesis
View Article