|[Date Prev] [Date Next] [Thread Prev] [Thread Next]||Indexes: Main | Date | Thread | Author|
John, (01) Thank you for drawing this important relationship, it certainly resonates with me. Bayesian inference is a "version" of pattern recognition (knowledge pattern recognition) that for me is the kind of thing that Doug is pointing to when he talks about augmenting human intelligence. My own background includes some time in the Operations research department as an OR analyst, so this makes a great deal of sense to me, that you would identify these relationships. (02) I'll take the opportunity to point out the degree of interest that has occured due to the SWITCH interview. It was "slashdotted" which resulted in 14,860 downloads of the transcript of Doug's interview (35 printed pages), in the last week. Not sure how many people actually read the whole thing. (03) Here's the slashdot reference for those interested in seeing the comments - which are all over the map. One post: wondered if it was a hoax, because they couldn't believe that all this was true and they didn't know about Doug until that post. (04) http://slashdot.org/articles/03/04/15/0437224.shtml?tid=189&tid=95 (05) Mei Lin -------Original Message------- From: "John J. Deneen" <email@example.com> Sent: 04/16/03 01:17 PM To: firstname.lastname@example.org Subject: Re: [ba-ohs-talk] We the Networked People.... (06) > > Jack and Mei Lin, (07) I believe I've discovered something very important in the following quotes regarding 1) "We the Networked People" by Jim Downing April 10th (08) 2003. 2) Doug's hyperscope (OHS) "compass", 3) Prof. Pierre Levy's Project - Is a Science of Collective Intelligence Possible? <http://www.collectiveintelligence.info/>, 4) the invitational transcript by SWITCH, 5) the Keynote address at the World Library Summit in Singapore, and 6) Autonomy technology <http://www.autonomy.com/Content/Technology/>: (09) (snip) (010)
"Computers most radically and usefully extend our capabilities when they extend our ability to collaborate to solve problems beyond the compass of any single human mind..” - Englebart 2002
"Wisdom has to do with not only intuiting the long view, understanding systems in the context of their larger whole, but also acting in resonance with what is known as true and lasting. Only wisdom can guide effective decisions on how we invest our attention, both individual and organizational, in the conditions of galloping "complexity multiplied by urgency." - Doug Engelbart
"Starting from this position, the CIG will simulate evolutionary scenarios about the ideas ecosystem of the community. These simulations will give relevant indications on the course to be followed in order to progress - in each different situation - towards an increase in collective intelligence (the "compass" shows "North"), while avoiding the reefs of imbalance and unsustainability."
Engelbart: "That would be interesting but I never have. I would really like to talk to some deep theoretical linguist who has thought about how our sensory perceptual machinery works and how much beyond the terms and words we are using now could that sensory perceptual thing cope with more complex stuff. One thing I think would be really fun is I'm going to take up the Artificial Intelligence guys that have been a plague in my life for all these years and say, "great, you bring me your sentence parser and I am going to have that parser running ahead of a person who is reading and parse the sentences and then give the different parts of speech a different color brightness or maybe actually a diagram." Just to see if you got used to it if it would speed up your comprehension of a successive chain of sentences. It could."
Slayton: "This is similar to one of the claims that PARC made when they did the "future of reading" exhibition at The Tech Museum a few years ago. One of the exhibits that they had was a demonstration of reading that would display one word at a time to you very, very quickly. Through their research they claimed that they had made comprehension levels increase dramatically when the text was taken out of context in that way. What do you think about that?"
Engelbart: "I wish I would have known about that. Can somebody write down a note and give me..."
Slayton: "I'll give you the person that was working on that. I'm not sure if they're there anymore."
Engelbart: "I can track him down."
... "Another difference between innovation at the C level and innovation that is more focused on specific results is that, at the C level, context is tremendously important. We are not trying to solve a specific problem, but, instead, are reaching for insight into a broad class of activities and opportunities for improvement. That means attending to external information as well as to the specifics of the particular work at hand. In fact, in my own work, I have routinely found that when I seem to reach a dead end in my pursuit of a problem, the key is usually to move up a level of abstraction, to look at the more general case." - Doug Engelbart
Autonomy's architecture combines innovative high-performance pattern-matching algorithms with sophisticated contextual analysis and concept extraction to automate the categorization and cross-referencing of information, improve the efficiency of information retrieval and enable the dynamic personalization of digital content.
Autonomy's strength lies in a unique combination of technologies that employs advanced pattern matching techniques (non-linear adaptive digital signal processing), utilizing Bayesian Inference and Claude Shannon's principles of information theory. Autonomy software identifies the patterns that naturally occur in text, based on the usage and frequency of words or terms that correspond to specific ideas or concepts. Based on the preponderance of one pattern over another in a piece of unstructured information, Autonomy enables computers to understand that there is X% of probability that a document in question is about a specific subject. In this way, Autonomy is able to extract a document's digital essence, encode the unique "signature" of the key concepts, then enable a host of operations to be performed on that text, automatically.
The innovative high-performance pattern-matching algorithms that provide the sophisticated contextual analysis and concept extraction, automate the categorization and cross-referencing of information, thereby dynamically improving the efficiency of information retrieval and enabling the dynamic personalisation of digital content. For the first time, computers can now be enabled to automatically form an understanding of a page of text, web pages, e-mails, voice, documents and people and automate operations dependant upon them.
The theoretical underpinnings for Autonomy's approach can be traced back to Thomas Bayes, an 18th century English cleric whose works on mathematical probability were not published until after his death ( "Philosophical Transactions of the Royal Society of London" 1763). Bayes' work centred on calculating the probabilistic relationship between multiple variables and determining the extent to which one variable impacts on another.
A typical problem is to judge how relevant a document is to a given query or agent profile. Bayesian theory aids in this calculation by relating this judgement to details that we already know,such as the model of an agent. More formally, the resulting, "a posteriori" distribution p (. q1/2x ),which is applicable in judging relevance can be given as a function of the known "a priori" models and likelihood
Extensions of the theory go further than relevance information for a given query against a text.
Adaptive probabilistic concept modelling (APCM) analyses correlation between features found in documents relevant to an agent profile, finding new concepts and documents. Concepts important to sets of documents can be determined, allowing new documents to be accurately classified.
Information Theory is the mathematical foundation for all digital communications systems.
Claude Shannon's innovation as described in his "Mathematical Theory of Communication" (1949) was to discover that "information" could be treated as a quantifiable value in communications.
Consider the basic case where the units of communication (for example, words or phrases) are independent of each other. If p i is the probability of the ith unit of communication, the average quantity of information conveyed by a unit, Shannon's entropy or measure of uncertainty is:
This formula reaches its maximum when the probabilities are all equal; in this case the resulting text would be random. If this is not the case the information conveyed by the text will be less than this maximum; in other words there is some redundancy. This result is then extended, by more sophisticated mathematical arguments, to when units are related.
Natural languages contain a high degree of redundancy. A conversation in a noisy room can be understood even when some of the words cannot be heard; the essence of a news article can be obtained by skimming over the text. Information theory provides a framework for extracting the concepts from the redundancy.
Autonomy's approach to concept modelling relies on Shannon's theory that the less frequently a unit of communication occurs, the more information it conveys. Therefore ideas which are more rare within the context of a communication, tend to be more indicative of its meaning. It is this theory which enables Autonomy's software to determine the most important (or informative) concepts within a document.
At the heart of Autonomy's software is the Dynamic Reasoning Engine (DRE™). The DRE™ is based on advanced pattern-matching technology that exploits high-performance probabilistic modelling techniques.
The DRE™ performs the core information operations:
* piece of content refers to a sentence, paragraph or page of text, the body of an e-mail, a record containing human readable information, or the derived contextual information of an audio or speech snippet.
By understanding concepts as expressed within a particular context, Autonomy's technology represents a significant advance beyond simple text searching, collaborative filtering, parsing or manual tagging. As the amount of unstructured text available to users explodes, companies are in greater need of an intelligent infrastructure that can automate the management, personalisation and delivery of information while reducing the level of manual effort required to get the right information to the right people at the right time.
Computers need to be smarter. They need to understand more about the information being communicated, and less about where and how information is stored.
Autonomy looks forward to continuing progress in enabling computers to better understand the meaning of digital information in any form.
I came across this recent post related to Doug’s World Library Summit talk and thought it might be interesting for some on this list. For those of you who weren’t on the list in April last year, Henry van Eyken has posted the entire talk at http://www.fleabyte.org/eic-11.html ) , Thank you for continuing to make it available to everyone, Henry!
We the Networked People
By Jim Downing April 10th 2003
Douglas Engelbart, the man that invented the computer mouse, wrote a paper in 1962 entitled “Augmenting Human Intellect: A Conceptual Framework” Taking Vannevar Bush’s idea of the mind working by association where:
“We refer to a way of life in an integrated domain where hunches, cut-and-try, intangibles, and the human "feel for a situation" usefully co-exist with powerful concepts, streamlined terminology and notation, sophisticated methods, and high-powered electronic aids.”
He calls for man's intellect to be augmented by computers.
“Man's population and gross product are increasing at a considerable rate, but the complexity of his problems grows still faster, and the urgency with which solutions must be found becomes steadily greater in response to the increased rate of activity and the increasingly global nature of that activity. Augmenting man's intellect, in the sense defined above, would warrant full pursuit by an enlightened society if there could be shown a reasonable approach and some plausible benefits. “
In April last year delivering the keynote address to the World Library Summit held in Singapore Engelbart continued his vision for computers to augment what people do, which he has had for over 50 years. At the core of this vision lies the idea that people can make use of computers to work collaboratively together to solve complex problems, something which is going on everyday with people using the internet.
“So, it is not the computer, working alone, that produces a solution. It is the combination of people, augmented by computers. The key word here is "augment." The reason I was interested in interactive computing, even before we knew what that might mean, arose from this conviction that we would be able to solve really difficult problems only through using computers to extend the capability of people to collect information, create knowledge, manipulate and share it, and then to put that knowledge to work. Just as the tractor extends the human's ability to work the earth, and planes extend our ability to move, so does the computer extend our ability to process and use knowledge. And that knowledge production is a group activity, not an individual one. Computers most radically and usefully extend our capabilities when they extend our ability to collaborate to solve problems beyond the compass of any single human mind..”
With the arrival of the Internet it would seem that much of Engelbart's vision has been realized. Yet at it’s core the ideas of his work on "augmentation" remain unrealized. The personal computer has allowed us to work better, but we still work, for the most part, alone. Today's computers and the internet and the World Wide Web are advances, but the challenges still remain. My area of study would be to explore how the internet can be used to increase co-operation amongst people. Cooperation amplification if you like.
….. (MLF: it goes on to reflect on other ideas that extend this line of thought.)
Mei Lin Fung