文本訊息視覺化
視覺化是資料科學的一個重點。
以數值訊息為主的作圖套件有很多,如
ggplot2
,ggvis
,rCharts
,d3Network
。
文字雲 Word cloud
製作簡易
文字雲之外的文字視覺化 Beyond the word cloud
詞泡 Word bubble
詞網 Word Network
詞樹 word tree
線上工具試玩
treemap
of words. check this tutorial
Linguistic Motion Charts
論證視覺化
文本的網路科學
文本的【多奇異點】(polysingularity) 是一種較為神奇的觀點,要處理將動態複雜塞進線性敘事結構的過程。很有量子語言學的味道。
This is a very important process, because it allows expression to be specific (to the particular time and space) and at the same time maintain co-isolated multiplicities (the underlying experience of the text). We call this process polysingularity because it has several possible “solutions” that co-exist simultaneously and yet only one solution is available at each point of time and space for actualization (Gabdulkhaev, 2005; Simonenko, 1965; Boikov, 2000). Polysingularity emerges when our experience meets the commonly accepted notion of linear time. Therefore it’s an expression of a certain purpose from the multitude of simultaneously existing possibilities. The question of what is real gets a totally different aspect when we think of it in terms of polysingularity.
文本可以視為知覺與特定表達目的的介面。有很多的詮解可能同時存在,但一次一個。
18 秒的短期記憶。
將文本表示成圖形 (visual representation of text as a graph) 的直接想法,是把詞當節點,之間的關係作為節點之間的鄰近性。
InfraNodus open-source text to network visualization tool, where the text is scanned twice using 5- and 2-word “windows” that record co-occurrences between the words depending on their proximity to each other in these windows.
有無可能可以藉此看出主題結構 topical structure ? 群組情緒?
字詞的關聯網路可以某個程度揭示 歷史觀
Big data analysis of state of the union remarks changes view of American History
Last updated