Modeling narrative structure and dynamics with networks, sentiment analysis, and topic modeling
Autoři:
Semi Min aff001; Juyong Park aff001
Působiště autorů:
Graduate School of Culture Technology, Korea Advanced Institute of Science & Technology, Daejeon, Republic of Korea
aff001; BK21 Plus Postgraduate Program for Content Science, Daejeon, Republic of Korea
aff002; Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
aff003
Vyšlo v časopise:
PLoS ONE 14(12)
Kategorie:
Research Article
prolekare.web.journal.doi_sk:
https://doi.org/10.1371/journal.pone.0226025
Souhrn
Human communication is invariably executed in the form of a narrative, an account of connected events comprising characters, actions, and settings. A coherent and well-structured narrative is therefore essential for effective communication, confusion caused by a haphazard attempt at storytelling being a common experience. This also suggests that a scientific understanding of how a narrative is formed and delivered is key to understanding human communication and dialog. Here we show that the definition of a narrative lends itself naturally to network-based modeling and analysis, and they can be further enriched by incorporating various text analysis methods from computational linguistics. We model the temporally unfolding nature of narrative as a dynamical growing network of nodes and edges representing characters and interactions, which allows us to characterize the story progression using the network growth pattern. We also introduce the concept of an interaction map between characters based on associated sentiments and topics identified from the text that characterize their relationships explicitly. We demonstrate the methods via application to Victor Hugo’s Les Misérables. Going beyond simple, aggregate occurrence-based methods for narrative representation and analysis, our proposed methods show promise in uncovering its essential nature of a highly complex, dynamic system that reflects the rich structure of human interaction and communication.
Klíčová slova:
Network analysis – Community structure – Emotions – Culture – Built structures – Language – Complex systems – Computational linguistics
Zdroje
1. Michel Jean-Baptiste and Shen Yuan Kui and Aiden Aviva Presser and Veres Adrian and Gray Matthew K and Pickett Joseph P and Hoiberg Dale and Clancy Dan and Norvig Peter and Orwant Jon and others. Quantitative analysis of culture using millions of digitized books. Science. 2011;331(6014):176–182. doi: 10.1126/science.1199644 21163965
2. Project Gutenberg;. https://www.gutenberg.org.
3. Dodds Peter Sheridan and Clark Eric M and Desu Suma and Frank Morgan R and Reagan Andrew J and Williams Jake Ryland and Mitchell Lewis and Harris Kameron Decker and Kloumann Isabel M and Bagrow James P and others. Human language reveals a universal positivity bias. Proceedings of the National Academy of Sciences. 2015;112(8):2389–2394. doi: 10.1073/pnas.1411678112
4. Schich Maximilian and Song Chaoming and Ahn Yong-Yeol and Mirsky Alexander and Martino Mauro and Barabási Albert-László and Helbing Dirk. A network framework of cultural history. Science. 2014;345(6196):558–562. doi: 10.1126/science.1240064 25082701
5. Kim Daniel and Son Seung-Woo and Jeong Hawoong. Large-scale quantitative analysis of painting arts. Scientific reports. 2014;4:7370. doi: 10.1038/srep07370 25501877
6. Lee Byungwhee and Kim Daniel and Sun Seunghye and Jeong Hawoong and Park Juyong. Heterogeneity in chromatic distance in images and characterization of massive painting data set. PLoS ONE. 2018;13(9):e0204430. doi: 10.1371/journal.pone.0204430 30252919
7. Abbott HP. The Cambridge introduction to narrative. Cambridge University Press, Cambridge; 2008.
8. Moretti F. Network theory, plot analysis. New Left Review. 2011;81:80–102.
9. Moretti F. Distant reading. Verso, New York; 2013.
10. Phamplets by Stanford Literary Lab;. https://litlab.stanford.edu/pamphlets/.
11. Box Office Mojo;. http://www.boxofficemojo.com/.
12. The Numbers;. http://www.the-numbers.com/.
13. Newman M. Networks: an introduction. Oxford University Press, New York; 2010.
14. Albert Réka and Barabási Albert-László. Statistical mechanics of complex networks. Reviews of modern physics. 2002;74(1):47. doi: 10.1103/RevModPhys.74.47
15. Easley D, Kleinberg J. Networks, crowds, and markets: Reasoning about a highly connected world. Cambridge University Press, Cambridge; 2010.
16. Han Jiawei and Kamber Micheline and Pei Jian. Data mining: concepts and techniques: concepts and techniques. Elsevier, New York; 2011.
17. Adamic LA, Huberman BA. Power-law distribution of the world wide web. Science. 2000;287(5461):2115–2115. doi: 10.1126/science.287.5461.2115a
18. Albert R, Jeong H, Barabási AL. Internet: Diameter of the world-wide web. Nature. 1999;401(6749):130–131. doi: 10.1038/43601
19. Jeong H, Tombor B, Albert R, Oltvai ZN, Barabási AL. The large-scale organization of metabolic networks. Nature. 2000;407(6804):651–654. doi: 10.1038/35036627 11034217
20. Borgatti SP, Foster PC. The network paradigm in organizational research: A review and typology. Journal of management. 2003;29(6):991–1013. doi: 10.1016/S0149-2063(03)00087-4
21. Grimm Volker and Revilla Eloy and Berger Uta and Jeltsch Florian and Mooij Wolf M and Railsback Steven F and Thulke Hans-Hermann and Weiner Jacob and Wiegand Thorsten and DeAngelis Donald L. Pattern-oriented modeling of agent-based complex systems: lessons from ecology. Science. 2005;310(5750):987–991. doi: 10.1126/science.1116681 16284171
22. Park D, Bae A, Schich M, Park J. Topology and evolution of the network of western classical music composers. EPJ Data Science. 2015;4(1):1–15. doi: 10.1140/epjds/s13688-015-0039-z
23. Bae Arram and Park Doheum and Ahn Yong-Yeol and Park Juyong. The Multi-Scale Network Landscape of Collaboration. PLoS ONE. 2016;11(3):e0151784. doi: 10.1371/journal.pone.0151784 26990088
24. Newman ME, Girvan M. Finding and evaluating community structure in networks. Physical review E. 2004;69(2):026113. doi: 10.1103/PhysRevE.69.026113
25. Elson DK, Dames N, McKeown KR. Extracting social networks from literary fiction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics; 2010. p. 138–147.
26. Mac Carron P, Kenna R. Universal properties of mythological networks. EPL (Europhysics Letters). 2012;99(2):28002. doi: 10.1209/0295-5075/99/28002
27. Mac Carron P, Kenna R. Network analysis of the Íslendinga sögur–the Sagas of Icelanders. The European Physical Journal B. 2013;86(10):1–9. doi: 10.1140/epjb/e2013-40583-3
28. Kydros D, Notopoulos P, Exarchos G. Homer’s Iliad-A Social Network Analytic Approach. International Journal of Humanities and Arts Computing. 2015;9(1):115–132. doi: 10.3366/ijhac.2015.0141
29. Waumans Michaël C and Nicodème Thibaut and Bersini Hugues. Topology Analysis of Social Networks Extracted from Literature. PloS one. 2015;10(6):e0126470. doi: 10.1371/journal.pone.0126470 26039072
30. Rimmon-Kenan S. Narrative fiction: Contemporary poetics. Routledge, London; 2003.
31. Bal M, Van Boheemen C. Narratology: Introduction to the theory of narrative. University of Toronto Press, Toronto; 2009.
32. Field S. Screenplay: The foundations of screenwriting. Delta, New York; 2007.
33. Vogler C. The Writer’s journey. Michael Wiese Productions, Seattle; 2007.
34. Welsh A. Opening and Closing Les Misérables. Nineteenth-Century Fiction. 1978;33:8–23. doi: 10.2307/2932924
35. Propp V. Morphology of the Folktale. University of Texas Press, Austin, Texas; 2010.
36. Hugo V. Les misérables. vol. 5. Lassalle; 1862.
37. Knuth DE. The Stanford Graphbase. Addison-Wesley; 1993.
38. Tausczik YR, Pennebaker JW. The psychological meaning of words: LIWC and computerized text analysis methods. Journal of language and social psychology. 2010;29(1):24–54. doi: 10.1177/0261927X09351676
39. Gonçalves P, Araújo M, Benevenuto F, Cha M. Comparing and combining sentiment analysis methods. In: Proceedings of the first ACM conference on Online social networks. ACM; 2013. p. 27–38.
40. Jurgens D, Stevens K. The S-Space package: an open source package for word space models. In: Proceedings of the ACL 2010 System Demonstrations. Association for Computational Linguistics; 2010. p. 30–35.
41. Van de Cruys T, Apidianaki M. Latent semantic word sense induction and disambiguation. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics; 2011. p. 1476–1485.
42. Stevens K, Kegelmeyer P, Andrzejewski D, Buttler D. Exploring topic coherence over many models and many topics. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics; 2012. p. 952–961.
43. Lee DD, Seung HS. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401(6755):788–791. doi: 10.1038/44565 10548103
44. Lee DD, Seung HS. Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems; 2001. p. 556–562.
45. Xu W, Liu X, Gong Y. Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. ACM; 2003. p. 267–273.
46. Zhao Y, Karypis G. Empirical and theoretical comparisons of selected criterion functions for document clustering. Machine Learning. 2004;55(3):311–331. doi: 10.1023/B:MACH.0000027785.44527.d6
47. Min S, Park J. Network Science and Narratives: Basic Model and Application to Victor Hugo’s Les Misérables. In: Complex Networks VII: Studies in Computational Intelligence. Springer, New York; 2016. p. 257–266.
48. McKee R. Substance, Structure, Style, and the Principles of Screenwriting. HarperCollins, New York; 1997.
Článok vyšiel v časopise
PLOS One
2019 Číslo 12
- Metamizol jako analgetikum první volby: kdy, pro koho, jak a proč?
- Masturbační chování žen v ČR − dotazníková studie
- Nejasný stín na plicích – kazuistika
- Těžké menstruační krvácení může značit poruchu krevní srážlivosti. Jaký management vyšetření a léčby je v takovém případě vhodný?
- Somatizace stresu – typické projevy a možnosti řešení
Najčítanejšie v tomto čísle
- Methylsulfonylmethane increases osteogenesis and regulates the mineralization of the matrix by transglutaminase 2 in SHED cells
- Oregano powder reduces Streptococcus and increases SCFA concentration in a mixed bacterial culture assay
- The characteristic of patulous eustachian tube patients diagnosed by the JOS diagnostic criteria
- Parametric CAD modeling for open source scientific hardware: Comparing OpenSCAD and FreeCAD Python scripts