Thursday, March 24, 2011

Engineering vs. Liberal Arts: Who’s Right—Bill or Steve?

It’s commonly believed that engineers dominate Silicon Valley and that there is a correlation between the capacity for innovation and an education in mathematics and the sciences. Both assumptions are false.......
My advice to my students—and to my own children—is to study what interests them the most; to excel in fields in which they have the most passion and ability; to change the world in their own way and on their own terms. Once they master their domain, they can find the path to entrepreneurship. They can then come up with creative ways of solving the problems that they have encountered, and apply their ideas to other fields where their knowledge adds value. Maybe they can team up with the hard-core engineers who develop the clunky, inelegant, over-engineered products that Bill is famous for; maybe work with Steve to create the next iPhone or iPad.


Link to the article: http://techcrunch.com/2011/03/21/engineering-vs-liberal-arts-who%E2%80%99s-right%E2%80%94bill-or-steve/

Thursday, March 17, 2011

Digitizing Urdu: Software Will Improve Analysis of Documents, Social Networks in Pakistan's National Language


Computer scientists at the University at Buffalo and at Janya Inc. have developed the first software system that will allow for computational processing of documents in Urdu, Pakistan's national language and one of the world's five most-spoken languages.
View a video about the new software system http://www.youtube.com/watch?v=pxZoHlpTIn8 here.

Wednesday, March 16, 2011

Voice®: Patented speech technology services for websites


Voice® offers speech technology services for websites. The patented technology makes it possible to speak to a web page to introduce information or to control the interaction with the website. By Voice® speech synthesis services make it possible to generate voice messages in real time and allow website contents to be heard through speakers. The key feature is that By Voice® is difficult to distinguish from a real human voice.


Types of business relationship sought by the owner of Voice® (Lingua e-solutions):
• Comercial agreement
• License agreement
• Technical cooperation - further development
• Technical cooperation - testing of new applications
• Technical cooperation - adaptation to specific needs

LINGUA E-SOLUTIONS is a company that specializes in developing applications and services related to linguistic technologies.

If you want to know more about Voice® please visit LINGUA E-SOLUTIONS athttp://tinyurl.com/67l9d53. After registering free at InnoGet* you will find a detailed description and contact details of LINGUA E-SOLUTIONS.

Tuesday, March 8, 2011

Ontology revisited

Employing Hyper-graph or Graph to build the Knowledge Net:

Hypergraph (Formal Definition): A hypergraph G can be defined as a pair (V, E), where V is a set of vertices, and E is a set of hyper-edges between the vertices. Each hyperedge is a set of vertices: E ⊆ {{u, v, ...} ∈ 2V}. (Hyperedges are undirected.)

A Hyper Graph database is a generalized graph of entities. The generalization is two-fold:
  • Links/edges "point to" an arbitrary number of elements instead of just two as in regular graphs.
  • Links can be pointed to by other links as well.


  • A graph can cater the basic ontological problems (e.g. Storing Synonyms, Super-concepts – sub concepts and other basic relations). Using graph instead of traditional relational database to implement the Knowledge Net can provide solutions for the following problems:
  1. Knowledge Net need not be strictly predefined. It can be flexible enough to enhance or    augment later.
  2. Previously extracted Information or processed Text need not be suffered (re-processed) by further enhancement and/or augmentation of Knowledge Net.
  3. Normalization may be reduced enough and hence saving processing time.
  • But a Hypergraph can be utilized to cater more realistic approach in the same area. E.g:

Case 1: If the input Query is something related to Terrorism the system is supposed to search for all the terrorism related incidents including bomb blasts from the processed documents.
In the current system to cater such problem blast have to be declared as a sub-concept or synonym of terrorism though blast is not a terrorism in all cases. So by doing this “blast of a gas cylinder” will also be qualified as terrorism will be displayed as a relevant result which is not correct. Instead if the knowledge is stored like the following then the two different types of blast can be distinguished and results can be filtered properly.

  • Storing Processed Paragraph as an Object or as an equivalent XML Structure in the Database:
  1. A Collection of ISentence (as shown in the following Diagram) Objects can be constructed from the finally processed Paragraphs.
Case 1: If the database of use is SQL Server 2008 or like, then it can be stored directly in the Database.
 
Case 2: Alternatively, since the ISentence Objects can be converted into XML and Vice Versa, the paragraph can be stored as XML String into the Data Base. At the time of retrieval the objects have to be reconstructed again to facilitate the retrieval.

Class Diagram for Sentence Representation
[* DateTimeOfOccurance will be another Property in the IsemanticAction interface]
  • Summarized Benefit:
  1. Better Knowledge Abstraction approach (Flexible, Upgradable and Optimized).
  2. Faster Retrieval 
  3. No need for further reprocessing of extracted information in case of Knowledge Update. 
  4. More Appropriate Retrieval and Relevance ranking. 
  5. Knowledge and information representation in terms of Hypergraph or Graph leads to the road to NLG (Natural language generation) and hence can lead to develop more appropriate Summarizer, Question-Answering System etc.