I've got a great algorithm, now what?


10

I have an algorithm for doing OCR (Optical Character Recognition) in a way that no one has ever done before. I've built rudimentary prototype software and I've tested my algorithm against the existing OCR softwares on the market today and the results were all very positive. I want to turn this algorithm into a business venture and I have some seed money that I'm ready to invest, but I don't know where to begin. I've taught myself c++ and Qt but I have no other programming or project management experience.

I'm thinking about developing a software to sell and distribute online. Outsourcing might be an inexpensive way to go about the software development project (I could even keep the method of the algorithm a secret from my programmers) but I don't know which skills I would need to hire?

Any suggestions? Other ways of doing this? Anything else I should know before venturing down this path?

Outsourcing Seed Funding

asked Apr 12 '10 at 08:08
Blank
Amichai
241 points
  • For anyone else wondering, OCR stands for Optical character recognition – Olivier Lalonde 14 years ago
  • A quick question have you considered using your OCR in CAPTCHA solving (for legitimate reason). Does your algorithm handles many CAPTCHAs? I'm especially asking because in our software we need such solution :) So I think one of the option selling your algorithm as library to software companies like me. – The Dictator 14 years ago
  • I've thought about tackling CAPTCHAs. I'm worried about supporting spammers and I wasn't able to find a good dataset. Any ideas? – Amichai 14 years ago
  • Good to hear you don't want to support spammers trying to avoid CAPTCHAs, but in the long run, how can you prevent it? – Jeff O 14 years ago
  • What's a legitimate reason to have a computer solve a CAPTCHA? – Endolith 13 years ago

7 Answers


9

Phew, that's a hard one. Maybe I'll come back with better ideas later. For now, some initial thoughts:

  • You have written about this algorithm you have, but not much about your own dreams and desires. What do you want? Maybe your invention can lead to a pile of money, but money alone isn't a good reason for becoming an entrepreneur.
  • Absolutely keep your algorithm secret for the time being. And go see a good patent & intellectual property lawyer. In most jurisdictions you cannot patent an algorithm per se, but there are other means of legal protection that might be available to you. Take a hard look at what legal protection you can get, and how much it will cost you in fees. (Keeping secrecy now may be a requirement for obtaining a patent. And if nothing else, it keeps your competitors from copying your algorithm, and keeping your invention as a trade secret may be the final solution you'll end up with.)

About what to do with the algorithm... Maybe it's just me, but I think it's surprisingly hard to say...

  • The market for desktop OCR software is not that big; most users are already invested in one of the existing OCR software systems; and it's not clear to me that current users are looking for a better solution. And from hard-won experience, I can say that a B2C desktop application takes a lot of work to mature. There is always opportunity in an existing market, so don't let this discourage you.
  • Webapps are the new hotness, and a webapp doing OCR isn't a completely horrible idea, but it's not easy either ... big files that need to be uploaded to the server, usability challenges, market risk.
  • You could license your invention to someone. That's often quoted as the entrepreneurs dream, sitting in your living room sipping good wine while your invention is doing all the work for you... But licensing inventions can often fail due to the 'not invented here' syndrome.

Some thoughts about what you could think about:

  1. What would motivate you personally, is there anything in the above or elsewhere that is looking super-exiting to you? Building a company or product is hard work, and if you do it in an area that is motivating for you personally, then you improve your chances considerably.
  2. Without telling us secrets, are there any elements in your technology that naturally point it in a specific direction? Does it excel at uncommon words (scanning texts in small languages, nice business jargon), handwritten text (scanning medical prescriptions?), or is there something else that your tech naturally lends itself to?
answered Apr 12 '10 at 08:51
Blank
Jesper Mortensen
15,292 points

9

I've recently brainstormed up an OCR related technology - here are my notes.

  1. Create an API that third party folks can use - A lot of data is currently being captured at the consumer level and used at Evernote. They are still small and growing rapidly - create a model that allows their users to use your algorithm.
  2. Create an iPhone/Android app that can utilize your algorithm. You are tied to one niche, but that's okay - if your algo is better - as folks such as Evernote grow very intensely, you'll be in the sweet spot for them to acquire.
  3. Adapt your API to work with Dropbox - while Dropbox has yet to release their API yet - their user growth is tremendous (more than 4 million now in less than a year?) Ride that wave.
  4. Legal firms - look up the Discovery market - they deploy a lot of OCR and manual person reading - these guys need solutions that can scale very well. May require a combination of recognition and search - but with third party search coming up - you can probably get away with just concentrating on OCR. Discovery market is fantastic - has big expenses - the only issue may be that they have lots of regulation issues on where the data resides (they are usually in the midst of litigation or lawsuits). If you can provide a rack solution, pretty much ready to rock. Each project easily goes from 100,000 pages to 5 million pages. They tend to employ hundreds of people per project and are now faced with international discovery - which if your OCR can scale to handle multilingual well (bi directional, Chinese, Japanese, Logo, signatures) - you are set.
  5. Google's book scanning project - they bought out the re-captcha folks. If you can build out a proof case and plug it online - you might just get calls.
  6. Jesper had some good questions/suggestions - if it's successful in the medical area - there's so much of the lab process/research portions to investigate in.
  7. Prominent background impression - If your algo can recognize words in photos - run it through on flickr see what emerges. You might just hit on a "trend" list like that of twitter. It's a project that would consume a lot of bandwidth and processing - but that's what cloud computing is for. If you can recognize the trends of things coming up - write a paper, do a case study and call Malcolm Gladwell quick. The fashion market would love this data. Heck, if you can do a logo count (ie Canon vs Nikon) and chart it out - this is very interesting market data.
  8. Take screenshots of websites and measure the words. Now, web crawlers already can recognize text out there - but there are some content that's embedded inside the graphics - which until this idea is available - it's essentially a market that's been largely ignored by search engines. There are folks like CoolIris who are looking at a visual search of things - but it's still missing something - searching on text stuck inside graphics.
  9. Logo count - but on film - How much product placement hours did Coca-cola have last year? Break down films to each frame and then recognize and count. It's a branding research data which advertisers would love to get hold off.
  10. Employee tag recognition on CCTV - It's competing against RFID - but yours is better as its less paranoia inducing.
  11. Sports analysis - you'll have to be passionate about this - but if you are, check out Michael Lewis's book - Moneyball. (Or wait for the movie coming out next year!) Once you can recognize players in videos, photos, you'd get fame measurement, branding, game analysis, et al.

If you're keen to explore further, feel free to contact me.

Alternative business models -

  • Consulting -
The legal firm work would be able to throw a lot of business your way with solutions - if you enjoy the algorithm work and have exception cases floating up to you - then you can work with a group that already services these folks. Talk to KPMG or any of the big accounting firms.
answered Apr 12 '10 at 16:24
Blank
Alex Lam
699 points

1

You may have a product superior to the ones currently on the market, but you you have the resources to out-market the competition?
Is there a single competitor of multiple? Can you sell it to established players? Perhaps you can pitch it to several at the same time and not make a secret of it; that way you may get them bid higher against each other.

answered Apr 13 '10 at 02:43
Blank
Usabilitest
1,698 points

0

I may start by analyzing every OCR software in the market, examine each segment's potential and challenges.

By the end of that, pick one particular segment and build a competing product for it.

The result of your research above will determine the direction you take with project management, costs, outsourcing options, marketing and pretty everything your business will be down the line.

answered Apr 12 '10 at 13:07
Blank
G Rex
683 points

0

OCR software is needed very much for AR.
Imagine someone can take a foto of a chinese sign and your software gives him the translation.

answered Apr 12 '10 at 20:36
Blank
Philobus
116 points

0

You got some great ideas and input so far.

Jasper's idea of building a web app seems like a good way to go.

A web app will extend your market reach and simplify the sign-up/licensing process to use your solution. It will also allow you to test your business model and get feedback from beta users.

If you decide to try this channel out and need to get a prototype built fast to test the idea and show to investors, you might want to take a look at the Agile Platform and the special offers for startups. I work for OutSystems so feel free to DM me if you want to discuss it in more detail.

Hope this helps

answered Apr 12 '10 at 22:15
Blank
Mike
825 points

0

  1. Even though I hate software patents, but - patent your algorithm now, before someone else discovers it.
  2. Aim at licensing your algorithm to big players. Build some free apps for desktop/Web/mobile to popularize it. Stay away from spammers.
  3. Outsourcing the development is OK, but do not expect it to be that cheap. Disclosure: we did create core IP for a handful of startups.
answered Apr 13 '10 at 18:40
Blank
Dmitry Leskov
606 points

Your Answer

  • Bold
  • Italic
  • • Bullets
  • 1. Numbers
  • Quote
Not the answer you're looking for? Ask your own question or browse other questions in these topics:

Outsourcing Seed Funding