亚洲精品久久久久久一区二区_99re热久久这里只有精品34_久久免费高清视频_一区二区三区不卡在线视频

Home
Letters to Editor
Domestic
World
Business & Trade
Culture & Science
Travel
Society
Government
Opinions
Policy Making in Depth
People
Investment
Life
Books/Reviews
News of This Week
Learning Chinese
Novel Way to Identify Author of Text

That notable quotable is instantly recognizable among people as a line from Shakespeare's Hamlet. But soon, even computers may be able to automatically identify strings of text with their appropriate authors -- and by using a free computer program already available on the Internet.

According to a report to be published in the Physical Review Letters magazine, researchers at La Sapienza University in Rome have found that a computer file compression program called Gzip provided an unusual means of analyzing strings of data.

Typically, computer compression programs such as Gzip shrink large computer files -- text files, for instance -- by searching for repetitive strings of information. By finding and identifying those patterns, the compression program can reduce the original file to a smaller one that contains just the basic "building blocks" of data and instructions on how to use those blocks to recreate the original, larger file.

But Emanuele Caglioti, an associate professor of mathematics at the university and one of the report's authors says that the program's compression process is also the key that helps identify files of unknown data.

When a program such as Gzip shrinks or "zips" a file, "it is learning something about the file," says Caglioti. Specifically, it is learning the file's so-called entropy, or the minimum number of bits needed to encode the file. Files of similar content would share similar entropies since they share the same common "building blocks."

"If you zip a file -- say one composed of English text -- while [the Gzip program] is reading the file, it's learning the statistics of English," says Caglioti. "The more it reads it, the more it can compress it." And adding additional English files wouldn't produce a great change in the file's size since the basic pattern -- its entropy -- is already known.

But, if the second file turns out to be Italian, Caglioti says the process has to start all over again and a new entropy is created. "It has to learn [the] Italian," says Caglioti. And "This effort has a cost in terms of bits. It takes more space to incorporate the Italian file because it's a different language."

And Caglioti and his team of researchers discovered that this same process and principle can be used to "identify' works by author. In their research, the Italian scientists collected 90 texts by 11 Italian authors and in 93 percent of the cases; the method correctly matched small text samples with the authors.

"It's pretty clever what they did," said James Riordon, a physicist with the American Institute of Physics, the group that publishes the Physical Review Letters. "Effectively, it's like you're training someone in a language to identify it."

And Caglioti say that there's no reason to believe that the compression process couldn't be used in other means. "Aside from text recognition, it can be used to compare Web pages and find ones that are similar," he says. In addition to creating a better Web search engine, Caglioti notes, "there is the challenge of biological DNA sequencing." He said genetic researchers have already reported in Bioinformatics of using similar zipper approaches to map the human genome.

Mark Adler, the programmer who co-created Gzip in early 1990 as an alternative to other file compression programs, said he was surprised someone had used his program in such a manner. "It is impressive and a little surprising that simply comparing the length of the compressed output from concatenated known and unknown text provides such high accuracy," he says.

But he remains skeptical that the Italians' research paves the way to foolproof text identifiers -- at least until more studies are done.

"At some point using entropy as a measure may not be fine enough to distinguish between authors with similar styles or use of words and phrases," he says. "I'd wonder how well it would work for author recognition if you tried to distinguish between a thousand authors instead of a dozen."

"Up to now, this is more theoretical than practical," Caglioti conceeds. But he says he and his team will continue to work with the program and see what else turns up. "We ought to try and see where it can work."

(China Daily January 31, 2002)

Copyright ? China Internet Information Center. All Rights Reserved
E-mail: webmaster@china.org.cn Tel: 86-10-68996214/15/16
亚洲精品久久久久久一区二区_99re热久久这里只有精品34_久久免费高清视频_一区二区三区不卡在线视频
欧美日本网站| 亚洲图片在区色| 欧美日韩国产首页在线观看| 亚洲一区欧美激情| 99成人免费视频| 亚洲精品一区二区三区四区高清 | 亚洲福利视频免费观看| 欧美自拍偷拍午夜视频| 久久成人av少妇免费| 欧美与黑人午夜性猛交久久久| 亚洲欧美综合国产精品一区| 性久久久久久久| 欧美一区二区三区喷汁尤物| 午夜日韩视频| 欧美亚洲一区二区在线观看| 欧美一级久久| 久久精品国产亚洲aⅴ| 亚洲第一色中文字幕| 亚洲国产精品一区二区三区| 91久久线看在观草草青青| 亚洲激情电影在线| 99re热精品| 亚洲无线视频| 亚洲伊人色欲综合网| 西瓜成人精品人成网站| 久久er精品视频| 久久久久久欧美| 免费不卡中文字幕视频| 欧美激情一区二区三区全黄 | 夜夜狂射影院欧美极品| 亚洲午夜av在线| 午夜精品亚洲一区二区三区嫩草| 久久www成人_看片免费不卡| 老色批av在线精品| 欧美精品在线一区二区| 国产精品xxxxx| 国产亚洲成av人在线观看导航| 一区二区三区在线高清| 亚洲激情网站| 亚洲私人影院| 欧美一区二区三区四区视频| 亚洲精品久久久久久久久| 亚洲一区二区毛片| 久久国产精品99国产精| 亚洲精品一区二区三区av| 亚洲一区二区三区精品动漫| 久久精品道一区二区三区| 裸体一区二区三区| 欧美日韩一区二区精品| 国产午夜久久久久| 亚洲激情网站免费观看| 亚洲日产国产精品| 国产精品成av人在线视午夜片| 国产精品中文字幕欧美| 在线观看日韩av电影| 亚洲免费成人av电影| 亚洲夜间福利| 亚洲国产日本| 亚洲欧美国产另类| 免费久久99精品国产自在现线| 欧美日韩精品久久久| 国产欧美一级| 亚洲精品视频中文字幕| 亚洲欧美中文在线视频| 亚洲精品视频在线| 欧美在线一二三| 欧美区日韩区| 国产欧美一区二区三区在线看蜜臀| 亚洲第一黄色网| 亚洲永久免费| 99精品视频免费观看| 久久国产99| 欧美日韩国产麻豆| 伊人成人网在线看| 亚洲一区二三| 亚洲人成人77777线观看| 亚洲欧美日韩天堂一区二区| 欧美.com| 国产亚洲亚洲| 亚洲视频香蕉人妖| 日韩视频免费观看高清在线视频| 欧美在线亚洲在线| 欧美午夜精品久久久久免费视 | 在线视频欧美日韩精品| 久久综合伊人77777麻豆| 国产精品久久久久aaaa| 亚洲欧洲精品成人久久奇米网| 久久成人免费| 久久丁香综合五月国产三级网站| 欧美日韩国产系列| 1769国内精品视频在线播放| 欧美一区二区精品在线| 亚洲欧美国产77777| 欧美日韩久久不卡| 91久久国产精品91久久性色| 久久精品论坛| 久久久777| 国产日韩精品久久久| 亚洲午夜激情| 亚洲午夜一区| 欧美日韩高清在线| 亚洲欧洲一区二区三区在线观看| 亚洲国产精品va在线观看黑人| 欧美综合二区| 国产日韩欧美在线视频观看| 亚洲无限av看| 亚洲综合精品四区| 欧美日韩另类视频| 日韩一区二区高清| 一区二区欧美国产| 欧美理论片在线观看| 亚洲六月丁香色婷婷综合久久| 亚洲精品久久久久中文字幕欢迎你| 久久深夜福利免费观看| 狠狠88综合久久久久综合网| 久久国产精品久久久久久| 久久国产欧美精品| 国产亚洲欧美一区二区| 欧美在线亚洲在线| 久久综合亚州| 亚洲国产成人在线| 亚洲精品国产系列| 欧美激情综合五月色丁香| 亚洲欧洲在线一区| aa亚洲婷婷| 欧美日韩亚洲一区二区三区在线观看| 亚洲精品日韩精品| 一区二区三区高清在线观看| 欧美日韩亚洲一区二区| 亚洲一二三区精品| 香蕉久久一区二区不卡无毒影院| 国产伦精品一区二区三区照片91 | 久久av一区| 快射av在线播放一区| 亚洲高清网站| 亚洲另类在线一区| 欧美日韩在线观看一区二区| 一区二区精品在线观看| 亚洲综合三区| 国产麻豆9l精品三级站| 欧美在线亚洲| 欧美黑人在线观看| av成人免费在线| 欧美一级免费视频| 一区在线免费| 中日韩高清电影网| 国产精品亚洲人在线观看| 性欧美大战久久久久久久免费观看 | 欧美日韩ab片| 亚洲一区二区免费| 久久噜噜噜精品国产亚洲综合| 在线观看三级视频欧美| 中文精品视频| 国产欧美精品日韩精品| 亚洲国产免费| 欧美日韩免费观看一区| 亚洲综合丁香| 老司机成人网| 日韩视频在线免费| 久久电影一区| 亚洲欧洲精品一区二区三区| 亚洲在线视频网站| 国产一区在线免费观看| 亚洲乱码国产乱码精品精98午夜 | 久久狠狠久久综合桃花| 亚洲另类春色国产| 99热这里只有成人精品国产| 国产精品乱码久久久久久| 午夜国产精品视频| 欧美成人一品| 亚洲视屏在线播放| 毛片基地黄久久久久久天堂| 一本久久综合亚洲鲁鲁五月天| 久久精品视频在线看| 亚洲人成在线影院| 欧美专区中文字幕| 亚洲日本视频| 欧美在线看片a免费观看| 亚洲欧洲一区二区三区久久| 欧美在线综合| 亚洲精品永久免费| 久久免费视频网| 99国产精品久久| 久久免费视频在线| 亚洲色诱最新| 嫩草成人www欧美| 亚洲一区二区黄| 欧美激情第六页| 欧美在线免费一级片| 欧美无乱码久久久免费午夜一区| 久久国产99| 国产精品五月天| 一本色道婷婷久久欧美| 狠狠入ady亚洲精品| 亚洲欧美视频在线观看视频| 亚洲黄色视屏| 久久久一区二区| 亚洲一区精品电影| 欧美日本亚洲| 亚洲欧洲综合|