The Role of Text Analytics and Information Retrieval in the Accounting Domain

Document Type : Review Paper

Authors

1 Assistant Professor of Accounting Department at Al-Zahra University

2 Ph.D. student of Alzahra University

Abstract

Today, much of the financial information is released through text documents and various groups use the written information to make financial decisions. So far, several studies in the field of analysis and retrieval of accounting documents and texts have been undertaken worldwide which could have an impact on the perception of the users of this information. Given the problems of users in utilizing in financial reports, accessibility and the ability to understand the information and the issues related information retrieval are investigated as the main objectives in this study. Therefore, The present paper examines the present body of literature in two major areas relevant to text analytics and information retrieval in the domain of accounting as follows: 1. the content analysis and readability of accounting narratives, and related text-mining work, and 2. the information retrieval literature stream that addresses the extraction of both text elements as well as quantities imbedded in text from accounting documents. The problem of the lack of a uniform terminology effect of issues about accessibility and ability to understand the information. Finally this study tries to offer a road map for future research in the field of accounting documents and data retrieval that includes 1) the success of standard setters in achieving of their goals, 2) construction of glossary of accounting, 3) identify of new words entered to glossary and 4) standardization of accounting terminology

Keywords

Main Subjects


Abrahamson, E. and E. Amir. 1996. The information content of the president’s letter to shareholders. Journal of Business Finance & Accounting 23 _8_: 1157–1182.
Back, B. J. Toivonen, H. Vanharanta, and A. Visa. 2001. Comparing numerical data and text information from annual reports using self-organizing maps. International Journal of Accounting InformationSystems 2: 249–269.
Belkaoui, A. 1980. The interprofessional linguistic communication of accounting concepts: An experiment in sociolinguistics. Journal of Accounting Research 18 _2_: 362–374.
Beynon, M. J. M. A. Clatworthy, and M. J. Jones. 2005. The prediction of profitability using accounting narratives: A variable-precision rough set approach. Intelligent Systems in Accounting and FinanceManagement 12: 227–242.
Boo, E. and R. Simnett. 2002. The information content of management’s prospective comments in financially distressed companies: A note. Abacus 38 _2_: 280–295.
Borthick, A. F. P. L. Bowen, D. R. Jones, and M. H. K. Tse. 2001. The effects of information request ambiguity and construct incongruence on query development. Decision Support Systems 32: 3–25.
Brennan, N. M. E. Guillamon-Saorin, and A. Pierce. 2009. Impression management: Developing and illustrating a scheme of analysis for narrative disclosures—A methodological note. Accounting, Auditing & Accountability Journal 22 _5_: 789–832.
Burke, J. F. 1997. Report on standards overload. The CPA Journal 66 _3_: 11.
Clatworthy, M. and M. J. Jones and. 2003. Financial reporting of good news and bad news: Evidence from accounting narratives. Accounting and Business Research 33 3: 171–185.
Clatworthy, M. and M. J. Jones and. 2006. Differential patterns of textual characteristics and company performance in the chairman’s statement. Accounting, Auditing & Accountability Journal 19 4: 493–511.
Clatworthy, M. and M. J. Jones. 2001. The effect of thematic structure on the variability of annual report readability. Accounting, Auditing & Accountability Journal 14 3: 311–326.
Cong, Y. A. Kogan, and M. Vasarhelyi. 2007. Extraction of structure and content from the EDGAR database: A template-based approach. Journal of Emerging Technologies in Accounting 4 1: 69–86.
Davis, A. J. Piger, and L. Sedor. 2008. Beyond the numbers: Managers’ use of optimistic and pessimistic tone in earnings press releases. Working paper, University of Oregon.
Demers, E. and C. Vega. 2008. Soft information in earnings announcements: News or noise? Working paper,Board of Governors of the Federal Reserve System.
Ding, C. and P. Chen. 2006. Mining executive compensation data from SEC filings. In Proceedings of the nd International Conference on Data Engineering Workshops, 1–7.
Engelberg, J. 2008. Costly information processing: Evidence from earnings announcements. Working paper, Northwestern University.
Fidel, R. 1991. Searchers’ selection of search keys: II. Controlled vocabulary or free-text searching. Journal of the American Society for Information Science 42 7: 501–514.
Fisher, I. E and M. R. Garnsey and. 2010. Improving information retrieval from accounting documents: A prototype digital thesaurus for employee benefits. Paper presented at the 2010 AAA Midyear Meeting of the Information Systems and the Strategic and Emerging Technologies Sections January, Clearwater, FL
Fisher, I. E and M. R. Garnsey. 2006. The semantics of change as revealed through an examination of financial accounting standards amendments. Journal of Emerging Technologies in Accounting 3 1: 41–59.
Fisher,I. Garnsey,R. Tam,K. (2010). The Role of Text Analytics and Information Retrieval in the Accounting Domain. Journal of emerging technologies in accounting. Vol. 7. pp. 1–24
Gangolly, J. 1995. Some thoughts on the engineering of financial accounting standards. In Artificial Intelligence in Accounting and Auditing III, edited by M. A. Vasarhelyi, pp. 177–190. New York, NY: Markus Weiner Publishing.
Garnsey, M. R A. 2008 preliminary examination of differences in language between U. S. GAAP and international GAAP. Collected Papers of the 17th Annual Research Workshop on Artificial Intelligence and Emerging Technologies in Accounting Auditing and Tax, Anaheim, CA.
Garnsey, M. R A. 2009 comparison of language in the Codification and the FASB and its predecessors pronouncements,paper presented at the AAA Information Systems Section Midyear Meeting, Charleston, SC.
Garnsey, M. R and A. W. Hotaling. 2007. Improving identification of search terms in the FARS database. Review of Business Information Systems. 11 1: 45–55.
Garnsey, M. R and I. E. Fisher. 2008. Appearance of new terms in accounting language: A preliminary examination of accounting pronouncements and financial statements. Journal of Emerging Technologies in Accounting 5 1: 17–36.
Garnsey, M. R. 2006. Automatic classification of financial accounting concepts. Journal of Emerging Technologies in Accounting 3 1: 21–39.
Gerdes, J. 2003. EDGAR-analyzer: Automating the analysis of corporate data contained in the SEC’s EDGAR database. Decision Support Systems 35 1: 7–29.
Goel, S. 2008. Qualitative information in annual reports and the detection of corporate fraud: A natural language processing perspective. Doctoral dissertation, University at Albany–SUNY.
Goldberg, L. 1965. An Inquiry into the Nature of Accounting. Sarasota, FL: AAA.
Ingrid E. Fisher, Margaret R. Garnsey, Sunita Goel, and Kinsun Tam (2010) The Role of Text Analytics and Information Retrieval in the Accounting Domain. Journal of Emerging Technologies in Accounting: December 2010, Vol. 7, No. 1, pp. 1-24.
Jacob, E. K. 1994. Classification and crossdisciplinary communication: Breaching the boundaries imposed by classificatory structure. In Advances in Knowledge Organization, Vol.: Knowledge Organization and Quality Management, edited by H. Albrechtsen, and S. Oernager, 101–108. Frankfurt/Main, Germany: Indeks Verlag.
Jones, M. J –, and P. A. Shoemaker. 1994. Accounting narratives: A review of empirical studies of content and readability. Journal of Accounting Literature 14: 142–184.
Kloptchenko, A. T. Eklund, J. Karlsson, B. Back, J. Vanharanta, and A. Visa. 2004. Combining data and text mining techniques for analyzing financial reports. Intelligent Systems in Accounting and Finance Management 12 1: 29–41.
Leinnemann, C. F. Schlottmann, D. Seese, and T. Stuempert. 2001. Automatic extraction and analysis of The Role of Text Analytics and Information Retrieval 21 Journal of Emerging Technologies in Accounting Volume 7, 2010 American Accounting Association
Li, F. 2006. Do stock market investors understand the risk sentiment of corporate annual reports? Working paper, University of Michigan.
Miller, G. S. 2002. Earnings performance and discretionary disclosure. Journal of Accounting Research 40 1: 173–203.
Muddamalle, M. R. 1998. Natural language versus controlled vocabulary in information retrieval: A case study in soil mechanics. Journal of the American Society for Information Science 49 10: 881–887.
Schroeder, N. and C. Gibson. 1990. Readability of management’s discussion and analysis. Accounting Horizons 4 4: 78–87.
Tetlock, C. M. Saar-Tsechansky, and S. Macskassy. 2008. More than words: Quantifying language to measure firms’ fundamentals. The Journal of Finance 63: 1437–1467.