I found a link to this application a while ago, but never took the time to check it out. Fortunately, it stuck in the back of my mind that there was a handy application, although even handier that I had saved my bookmarks from an old computer, because I couldn't remember the name, and I couldn't find it with a fair bit of google searching. So, I'll add this post, both to help future searchers, and also to say what a wonderful application it is. I don't know python, but could figure it out enough to fix a login bug due to SourceForge changing their login page. Since I found the bug, I assume that means no one else is using it, though the bug does only apply to new users, so maybe there are a bunch of current users.
ReleaseForge takes the clunky slow interface of sourceforge's file release process, and turns it into a simple couple click operation, even for hundreds of files. For LifeType, we generally re-release files (particularly the _all_templates.zip file) and sourceforge doesn't really like you re-releasing stuff, but wants you to rename it every time. That "feature", as well as some broken advertising javascript on each release page, causes each file release to take at least a couple minutes each. If you could mass delete files, that would help, but they should really just advertise ReleaseForge, and be done with it.
PS. Perhaps Chip is right when he says something needs to change about search engines. I didn't really buy it when I read his post (nor when I worked at a company that worked on natural language processing for search engines), but this example was one that I really couldn't find it because I didn't know word "Release Forge". I tried all sorts of other things in various combinations: easier file upload tool for sourceforge, fileforge, file release system, etc. without any success.
Posted by
Jon Daley on
May 1, 2008, 4:22 pm
| Read 3329 times
Category
Reviews:
[
first]
[
previous]
[
next]
[
newest]
Heh. I wasn't trying to say that search engines needed to change, as much I was saying that people need to learn how to use search engines more effectively (I was approaching the problem as someone with a degree in English, not as a programmer) I remember reading your various posts about the funny things people type into a search engine that brings them to your site.
If it was me looking for "Release Forge" without remembering the exact title, I probably would have used combinations of key words similar to what you did. How did you find it, or re-find it, in the end?
The only way I found it was that I remembered I had saved my Firefox bookmarks from the computer at my last job, and haven't gotten around to copying it to my current computers. I probably should have an online bookmark page instead of copying when I move between computers, though the moving happens so infrequently that I haven't looked into a better solution.
And yes, re-reading your post more carefully I see what you are saying. I saw the "All the Boolean operators in the world won't help if the words you use are vague or you don't use enough words", to mean natural language searching, since that is the sort of thing a former employer would have said. Their idea was interesting enough - if you have a document and you want to find another one like it, just copy and paste the whole paragraph into the search engine. It then calculates vectors of different terms in the paragraph (or whole article) to decide what things you are really looking for. The typical example that they would use to explain why their search was better than keyword searching is maybe you want articles about dogs and not about cats, so you type in "dogs -cats", but then you wouldn't get the article that has pages and pages about dogs, and in one place it says, "dogs are completely different than cats" in a keyword search, but in a vector search, the cat vector would be quite small in those sorts of documents, and so could still possibly match a search where you typed: "I want to learn about dogs, but I don't care about cats" or pasting in your research paper about dogs, etc, etc.
They had one customer, and lasted for a while as a CMU startup, but the funding went away at some point, and I haven't heard of anything particularly similar. Peter has some stuff sort of related, but not for search engines, I think. The Vivisimo search engine is sort of like that, though it is more like clustered keywords. I have used it in the past (and like the name vivisimo better than clusty, though it is harder to spell) and it has always been too slow, though I suspect that it might be better for people who have a harder time typing the right keywords.
I think you can teach an old dog new tricks. I don't know half of what you are talking about here, but I have discovered that I am better than many people at finding things through Google. I don't know why though -- I don't have a particular scheme; I just type in what seems logical to me.
I spent one year at CMU working on cross-lingual information retrieval, where the query was in a different language from the documents being searched. It basically worked by translating the query but with probability weights attached to each term (how likely the word is to translate the one you typed) and carrying those weights through to the language model for each document. It was pretty similar to the vector approach but used a weighted "bag of words" instead.
Hi Jon
Thanks for pointing us to that handy piece of software! I'm uploading release 1.2.8 as I'm writing this. Had no login issues with the version 1.1 I installed from the repositories (apt-get install releaseforge on Ubuntu 7.10 universe)
cheers, reto