Abstract
An inclusive history of the novel during the long 19th century, one involving the demography of novels and novelists, has been widely discussed. A precondition for this kind of historical research is an exhaustive list of novels published during the period. The contribution of this paper are estimates of yearly rates of new novel production between 1837 and 1919 in the British Isles.
You have full access to this open access chapter, Download chapter PDF
Similar content being viewed by others
1 Introduction
An inclusive history of the novel during the long 19th century, one involving the demography of novels and novelists, has been widely discussed. A precondition for this kind of historical research is an exhaustive list of novels published during the period. The contribution of this paper are estimates of yearly rates of new novel production between 1837 and 1919 in the British Isles.Footnote 1 James Raven, Antonia Forster,Footnote 2 Peter Garside, Rainer Schöwerling,Footnote 3 Peter Garside et al.Footnote 4 (hereafter ‘RFGS’) have already provided an exhaustive bibliography covering the years 1770 to 1836. The approach used to estimate novelistic production here is applicable to other geographic and linguistic contexts. Moreover, year-to-year changes in the rate of new novel publication in one region are likely to be informative about how quickly (or slowly) novelistic production is changing in neighboring regions. Yearly estimates of the rate of novelistic production are essential in a variety of projects associated with literary and publishing history. For example, these estimates facilitate bibliographic work as they permit literary historians to assess the completeness of existing bibliographies for a given year.
This contribution is usefully seen as participating in a broader project: a data-intensive, sociologically-inclined literary history seeking to addresses long-standing questions in the history of the novel using digital surrogates of surviving literary works.Footnote 5 Although sociology of literature in the 1950s and 1960s foundered due to a lack of information about writers, works, and related documents—or representative samples thereof—contemporary library-scale digitization, inexpensive computational resources, and widespread sharing of machine-readable datasets permit resuming a variety of research projects on more secure footing.
Before presenting and discussing the new estimates of yearly novelistic production in the British Isles, this paper mentions existing research which stands to benefit from an exhaustive bibliography of the novel in the 19th century.
2 Related Work
Projects likely to benefit from the availability of an exhaustive bibliography of published novels tend to be affiliated with the sociology of literature or sociologically-inclined literary history. Sociologically-inclined literary history has long been associated with inclusionist demands as well as the use of new methods. To the extent that previous research agendas linked to sociology of literature failed or were abandoned due to the lack of reliable information about novelistic production, these agendas merit revisiting for reasons mentioned earlier—notably the relative abundance of book page images and book reviews as well as machine-readable bibliographic and biographical records. These agendas hold out the promise of studying literature at multiple scales and of enlarging the available vocabularies for discussing the history of literature.
A characteristic demand of sociologically-inclined literary history is the demand for an inclusive literary history. The most recent advocate for an inclusive history of the novel is Franco Moretti.Footnote 6 Of the ca. 25,000 new novels published in the British Isles during the 19th century, Moretti estimates only about 200—less than 1%—figure in teaching and research.Footnote 7 The demand for an inclusive approach to literary history and many research questions discussed under the heading of sociology of literature are present in earlier works as well, in particular Daniel Mornet’s work from the 1910s and in Robert Escarpit’s Sociologie de la littérature (1958).Footnote 8 Mornet embraced an inclusive approach to the study of literary taste, explored patterns in reading broadly, and suggested attending to the potential influence of ‘minor’ authors. The research agendas of Escarpit, Mornet and sociologically-inclined literary history in general suffered due to the lack of accessible and trusted bibliographic data.Footnote 9 The resources available to them did not include, needless to say, machine-readable bibliographic records of all surviving 19th-century books much less digital facsimiles of the pages of tens of thousands of surviving works.
The testimony of John SutherlandFootnote 10 to the lack of available bibliographical and biographical details is informative. Sutherland mentions the “sheer unavailability of necessary empirical knowledge” as an obstruction to literary sociology, adding, “[O]ne of the things that makes literary sociology so easy to do at the moment is that we don’t know enough to make it difficult.”Footnote 11 These sentiments echo Eliot’s description of difficulties encountered in his work during the late 1990s.Footnote 12
A second characteristic of sociologically-inclined literary history, tied to the demand for an inclusive literary history, is an openness to new methods. The demand for a diversification of methods—in particular, beyond close reading (“direct textual reading”)Footnote 13—is familiar in Moretti’s work and one linked explicitly to the need to analyze the morphology of the hundreds of thousands of literary works.Footnote 14 The openness to a mixture of methods in sociologically-inclined literary history is easy to distinguish from recent forms of interdisciplinarity in literary studies because, as observers have noted, the latter is sharply constrained: methods are allowed safe passage into literary studies only if they do not use numbers.Footnote 15 Sociologically-inclined literary history, by contrast, entertains the borrowing of survey methods from the social sciences and a range of techniques from statistics, computational linguistics, complex systems, and biological systematics.Footnote 16
3 The Rise of the Text Industry
Between 1840 and 1860 the rate at which new novels were published in the British Isles grew approximately twice as fast as the population of individuals able to read (‘readers’) (Fig. 1).Footnote 17 (The method used to estimate the rate of growth during this period are described in the Methodological Appendix.) Such a development would have required a doubling of numerous processes of interest to literary historians, not least a doubling of the number of publishable manuscripts. Existing narratives of the period do not observe this growth or, to the extent that they do, attribute growth to factors such as changing tastes among readers and clever marketing strategies by publishers. This section argues that factors such as declining unit costs and changes in the industrial organization of publishing deserve to play a leading role in the narrative of the growth in novelistic production in the 19th century.
The pace of growth in new novels outstripped that of book publishing in general which itself was growing faster than the reading population between 1840 and 1860.Footnote 19 Cataloging the yearly production of new titles is an obvious first step towards addressing a range of unanswered questions concerning the material processes by which literature is produced, distributed, marketed, and consumed. As SutherlandFootnote 20 notes, the extent of scholarly ignorance on these matters is near total. For example, nobody knows how many people pursued careers as novelists in the British Isles in the 19th century.Footnote 21 Narrow intervals containing the number of new titles published each year allow us to begin to answer these and related questions.Footnote 22
Between 1840 and 1860, the rate at which new titles appeared grew appreciably. After 1860 growth relative to the reading population is distinctly slower. It is something of a mystery, then, why the rate of new novel publication, adjusted for the reading population, would rise dramatically between 1840 and 1860. Such change involved a variety of other processes of interest: a doubling of manuscripts received, a doubling of paper consumed by publishing books, a doubling of the labor of compositors, and so forth. Given the fixed costs associated with writing or printing a novel, the increase in the number of new novels appearing in the literary market merits a thorough accounting. How was this extra demand (on the publisher’s side) for manuscripts met? Assuming that the demand could not be met entirely by the existing population of novelists, where did the new novelists come from? Did their recruitment diversify the socio-economic or regional background of the existing pool of writers? (Table 1)
Two developments deserve to be part of any account of the increase in the rate at which new novels were being published. These developments have tended not to feature prominently in stories of the growth in new novels during the period or have tended to be equated with other factors such as changing tastes and clever advertising strategies. Both developments would have contributed to a decline in the unit cost associated with publishing a novel. This, in turn, would have led to a higher rate of new novel publication. After describing the two developments, we will discuss how declining unit costs would have likely led to an increased rate of new novel publication.
The first development is declining factor costs associated with the emergence of steam-powered printing and steam-powered paper production. It is clear that the cost of paper and the cost of printing declined tremendously over the 19th century, although precisely when and where the decline occurred needs to be determined.Footnote 24 Steam-power was introduced in the production of paper around 1807 and steam-powered printing was introduced around 1814.Footnote 25 By 1825 half of all paper in England was made by machine.Footnote 26 If paper and printing costs were not already declining by 1830, it seems likely there was at least the expectation of their doing so in the near future. This is consequential because paper and machining costs made up a considerable fraction of the cost of publishing a book.Footnote 27 estimates suggest that in 1850 paper and machining costs, taken together, exceeded the cost of compositing in print run of 1,000.Footnote 28
New and maturing forms of industrial organization are the second factor likely contributing to declining unit costs. Something which increasingly resembles the modern publishing industry emerges between 1840 and 1860.Footnote 29 Associated developments likely contributed to declining unit costs via economies of scale, declining overhead, accounting improvements, and declining cost of capital.
In 1844 regulation was introduced that required companies to post audited balance sheets with the Registrar of Companies, a requirement which did not directly affect publishers, which were typically organized as partnerships, but gives a sense of broader changes in the business environment. WeedonFootnote 30 considers 1844 a turning point, writing that “from this time [1844] on recognizable and systematic accounting systems began to take shape in many publishing houses.”Footnote 31 (Capital was indeed difficult to raise in the 1830s and grew easier over time.Footnote 32) One piece of evidence which would be consistent with an account focused on economies of scale would be evidence suggesting that larger publishers were steadily increasing the number of new novels they published. Since we lack comprehensive bibliographies for the period of interest there is no way to demonstrate this directly. We can, however, show that there is no evidence of any significant decline in concentration among publishers before 1840 (Fig. 2 and 3). We should also recall that measuring the output of a publisher using new novels understates production—and, with it, economies of scale—as many publishers of novels began to reissue previously published novels. And reissuing novels was an activity which grew in significance only after 1830. For example, Richard Bentley, who is among the top four publishers for 1835–1836, published 126 reissued works in the Standard Novels series between 1831 and 1855, 19 of which were published (with Henry Colburn) before 1833.Footnote 33
One well-known anecdote suggests the potential magnitude of efficiency gains between 1840 and 1860. Dickens’ Dombey and Son (1846) and Our Mutual Friend (1865) were printed in similar quantities (30,000 and 32,000 respectively). Adjusting for deflation, the unit cost of Our Mutual Friend was roughly half (58%) of that of Dombey and Son.Footnote 36 If this example is typical, there is little need to search for additional factors contributing to the acceleration observed between 1840 and 1860. Publishing a book simply becomes less expensive.
We should expect to see individual publishers publishing more books if per unit costs fall. Why this should be is not immediately obvious because nothing forces a publisher, given positive expected returns and the opportunity to publish more books at the same cost, to take advantage of the opportunity. In a competitive environment, however, a variety of pressures would favor the publisher which prints the greater number of titles. Since publishing more books would bring with it increased revenue, lower variability in revenue, and economies of scale, firms printing more books would tend to accrue more financial resources, something which tends to be a competitive advantage. Moreover, bringing a greater number of books to market is, by itself, a costly and advantageous signal of quality to readers and to potential writers. Although there’s considerable uncertainty about when and by how much unit costs declined during the period, we should be more confident that publishers who elected to print more rather than fewer titles would be more likely to survive in a competitive environment. In light of this, we should anticipate observing a greater number of novels being published when unit costs decline.
Competing narratives which attribute the increase in the per-reader rate of new novel appearance between 1840 and 1860 to growing demand for novels in the population deserve to be viewed with scepticism. Such accounts suggest that the increase is due to one or both of the following: 1) an increase in the number of novel readers (in excess of what would be expected from a growing reading population) and 2) an intensification of reading among existing novel readers. The precise mechanism proposed varies but can include, for example, publishers’ ‘stimulating demand’ with clever advertisements or typography. It would also include the suggestion that publishers, individually or collectively, managed to appeal to “public taste” in a previously unknown manner which bolstered demand.Footnote 37 An expanding market for books is often simply assumed, although typically without clarifying how fast the expansion is occurring—in particular, if it is expanding faster than the reading population:Footnote 38 is not exceptional in mentioning “[t]he burgeoning mass market of the first half of the nineteenth century.”Footnote 39
One straightforward accounting of the increase in new titles would look to an increase in the number of readers of new novels among the existing population of readers. Such an increase is easy to imagine if we are persuaded that not everyone who can read and afford access to novels does indeed spend time reading novels. The strongest reason to discount this narrative, however, is that time spent on reading is inelastic. Given a finite number of hours in each day, time spent on reading is constrained and, with it, the number of books one is able to read in a given year. That the population of readers might, collectively, increase time spent reading new novels by as much as ten percent between 1840 and 1860 is difficult to credit. And a lateral shift away from non-novel reading to novel reading is incompatible with the evidence: the rate at which all books (non-novels and novels) is also accelerating over much of the period 1840–1860.Footnote 40
Another accounting of the growth in new novels focuses on the improving economic fortunes of the population. Given the considerable expense of a new novel (ca. 21 s 6d) or an annual subscription to a circulating library (ca. 20s)—roughly a week’s wages for an unskilled worker worker in 1850—it is certain that a large fraction of the reading population could not afford access to new novels before 1840. This narrative, however, is implausible because there is no evidence of rising wages until well after the acceleration in the rate of new novel publication. Contrary to expectations about a period firmly within the industrial revolution, typical incomes did not start to rise until after 1830 and then only at a relatively slow pace: an average yearly rate of growth of 0.86% between 1830 and 1860.Footnote 41
The theory that the population of existing novel readers intensified their reading of new novels after 1840 and that this new demand influenced the rate of publication of new novels is difficult to credit for reasons already mentioned. Time available for reading sharply constrains the scope for intensification of novel reading among those who were already avid novel readers in 1840. For those who did not read many novels, increasing the rate at which they read novels more than even ten percent would encounter considerable friction. New novels remained luxury goods and circulating libraries’ subscriptions limited the number of novels which could be borrowed at any given time.
What about the export market for novels? Total book exports do increase during the 19th century but there does not appear to be any particular departure or development of interest prior to or during the 1830s and 1840s.Footnote 42
The rapid growth between 1840 and 1860 in the per-reader rate of new novel publication deserves greater attention than it has received. This period of growth is surprising because it occurs before mass literacy and the general growth in incomes associated with the industrial revolution. It also deserves attention from literary historians because the sustained rate of growth is high enough that it could not have occurred without widely-felt changes in a variety of processes at the heart of the literary market.
Novel writers, in particular, must have felt the consequences of a doubling in the rate of new novel publication. For example, a doubling of the rate of new novels published would require either an intensification of labor by existing writers, a broadening of the population of writers, or some combination of the two. Any of these would likely be associated with substantial and durable changes in the relationship between writers and publishers. For example, an intensification of work or a broadening of the population of writers might have favored certain kinds of writers or the production of certain kinds of novels (e.g., novels in certain genres or with certain formal features) if these novels were easier (in some sense) to produce.
4 Conclusion
The digitization of bibliographic data and surviving novels makes a variety of tasks involved in the study of literary history less time-consuming and less resource intensive. In many cases this development enables research that would otherwise be abandoned as impractical.
With distance from the arrival of—and optimism surrounding—large-scale library digitization projects in the late 2000s, it is perhaps easier to reflect soberly on the necessary supports for doing data-intensive literary history at scale. Without some knowledge of how many novels were published each year—along with related information such as publisher concentration and the demography of novelists—is difficult to make use of digital facsimiles. This paper contributes a description of the growth of novelistic production during the long 19th century, and, in particular, new estimates of the number of novels published each year between 1837 and 1919. These estimates will facilitate current bibliographic work and support future research in sociologically-inclined literary history.
Notes
- 1.
Following Peter Garside/James Raven/Rainer Schöwerling, “General Introduction,” in: Peter Garside, James Raven, Rainer Schöwerling (eds.), The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Oxford (2000), novels are defined descriptively: printed works referred to as novels by contemporary readers. As ibid. note, this definitional strategy becomes particularly viable after 1770 as the characteristics of works referred to as novels stabilize. There is, however, a concern about the tendency of this definitional strategy to exclude literary works plausibly characterized as novels. But different definitions of the novel tend to agree on particulars in most cases. And the cases where disagreement is likely to occur tends to be easy to predict. For example, a definition of the novel which tends to include novel-like juvenile fiction and novel-like religious fiction – both excluded by the definition in use in RFGS – would be likely to agree on particulars 90% of the time (Troy Bassett, personal communication, November 9, 2015).
- 2.
James Raven/Antonia Forster, The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Vol. 1.: 1770–1799, eds. Peter Garside/James Raven/Rainer Schöwerling, Oxford (2000).
- 3.
Peter Garside/Rainer Schöwerling, The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Vol. 2: 1880–1829, eds. Peter Garside, James Raven, Rainer Schöwerling, Oxford (2000).
- 4.
Peter Garside/Anthony Mandal/Verena Ebbes et al., The English Novel, 1830–36: A Bibliographic Survey of Fiction Published in the British Isles, Cardiff/Paderborn (2003).
- 5.
The practical barriers to this genre of research have been slowly disappearing. Before the advent of library-scale digitization in 2002, accessing a representative sample of 19th century novels would have been prohibitively expensive and time-consuming. (Google Inc.’s book digitization project was launched internally at Google in 2002 History of Google Books, https://books.google.com/intl/com/googlebooks/history.html [retrieved on April 25, 2018]) With print runs often in the very low thousands, surviving works exist, often in small numbers and often in special collections, in libraries across North America and Europe. In this setting, the only way to access a book was to be in physical proximity to the work. This situation has improved remarkably in the last decade. The Internet Archive, the most significant institution providing unrestricted access to scans of books from North American libraries, has digitized over 2.9 million books from North American libraries alone and continues to add thousands of scans a month. On August 17, 2016 the Internet Archive had 2,375,270 scans in its North American libraries collection and 531,641 scans in its Canadian Libraries collection (https://archive.org [retrieved on April 25, 2018]).
- 6.
- 7.
Franco Moretti, „The Slaughterhouse of Literature,” in: MLQ: Modern Language Quarterly 61/1 (2000), 207–227, here: 207.
- 8.
- 9.
Robert Darnton, “Reading, Writing, and Publishing in Eighteenth-Century France: A Case Study in the Sociology of Literature,” in: Daedalus 100/1 (1971), 214–256 enumerates the shortcomings of Escarpit’s statistics in Sociologie de la littérature: “To take 937 writers over 410 years [1490–1900] is to spread the sampling pretty thin – an average of 2.3 writers a year. Adding or subtracting a single man could shift the graph by 5% or more, […]” ibid., 217. By the standards of the time, Escarpit’s coverage is impressive. Escarpit covers more than twice as many writers per year as Raymond Williams, who in The Social History of English Writers considers 350 writers born between 1740 and 1920 (so, less than 1 writer a year) Raymond Williams, The Long Revolution, Harmondsworth, Middlesex, 1965, 254. (Williams used the Oxford Introduction to English Literature and the Dictionary of National Biography.)
- 10.
John Sutherland, „Publishing History: A Hole at the Centre of Literary Sociology,” in: Critical Inquiry 14/3 (1988), 574–589.
- 11.
Ibid., 588.
- 12.
Simon Eliot, “Very Necessary but Not Quite Sufficient: A Personal View of Quantitative Analysis in Book History,” in: Book History 5/1 (2002), 283–293, DOI https://doi.org/10.1353/bh.2002.0006.
- 13.
Franco Moretti: “Conjectures on World Literature,” in: New Left Review 1 (2000).
- 14.
Moretti (Ann. 6), 208–209.
- 15.
James F. English, “Everywhere and Nowhere: The Sociology of Literature After ‘the Sociology of Literature’,” in: New Literary History 41/2 (2010), v–xxiii, DOI https://doi.org/10.1353/nlh.2010.0005, xii.
- 16.
- 17.
The rate of publication of new novels grew at twice the rate of growth of the population and roughly twice the rate of the reading public. In 1820 approximately 6.0 new novels were published per million readers. By 1850, 10.5 new novels per million readers were appearing each year.
- 18.
Cross marks indicate known data from RFGS. Points are median estimates and error bars indicate 80% credible intervals. The model used for estimating intervals is described in the Methodological Appendix. UK Population figures are from Angus Maddison (Statistics on World Population, GDP and Per Capita GDP, 1–2008 AD, n.p. 2009). Literacy figures are collected in Gregory Clark (A Farewell to Alms: A Brief Economic History of the World, Princeton 2007). Population figures prior to 1820 are only available for 1800 and 1810. Especially prior to 1820, population and literacy estimates are somewhat unreliable. Adjusting for population and literacy is, however, essential due to the pace and scale of change. Between 1800 and 1900 population likely increased by between 300 and 400%. Basic literacy in 1800 was likely around 61% for men and 42% for women. By 1900 literacy was approaching 100%.
- 19.
The proportion of all editions which were new novels is increasing during this period. Figure 2.3 in Weedon (2003) shows the growth of titles in Publishers’ Circular and NSTC relative to the number of people able to read Alexis Weedon, Victorian Publishing: The Economics of Book Production for a Mass Market, 1836–1916, Aldershot, UK, (2003), 50.
- 20.
Sutherland (Ann. 10).
- 21.
Ibid., 574–575; John Sutherland, Victorian Fiction: Writers, Publishers, Readers. Houndmills, Basingstoke, Hampshire (1995), 151–164.
- 22.
Evidence that accounting for the material conditions of publication can yield useful insights is available in existing research. For example, Lee Erickson, The Economy of Literary Form: English Literature and Theindustrialization of Publishing, 1800–1850, Baltimore (1996), argues declining paper costs contributed to poetry’s decline and John Sutherland, „Chips off the Block: Dickens’s Serialising Imitators,” in: n.e., Dickens and Other Victorians, London (1988), 97–119, DOI https://doi.org/10.1007/978-1-349-19503-9_7 (retrieved on April 25, 2018), argues market forces frustrated imitators of Dickensian serialization.
- 23.
Sources: 1779–1836, RFGS; ‘Novels (model)’ shows the 80 percent credible interval predicted by the model; Nineteenth-Century Short Title Catalog (NSTC) is NSTC (LOCED) from Eliot (1997). The duplicate values in NSTC (LOCED) for the 1850s and 1860s are not typos; the number of titles associated with each decade are the same.
- 24.
Weedon (Ann. 18), 66.
- 25.
Ibid., 64; James Raven, The Business of Books: Booksellers and the English Book Trade, 1450–1850, New Haven, CT, London (2007), 224.
- 26.
Weedon (Ann. 18), 64.
- 27.
Ibid.
- 28.
Ibid., 87.
- 29.
Raven (Ann. 25), 328 f.
- 30.
Weedon (Ann. 18).
- 31.
Ibid., 62.
- 32.
Ibid.
- 33.
Roger P. Wallins, “Richard Bentley,” in: Patricia Anderson, Jonathan Rose (eds.), British Literary Publishing Houses, 1820–1880, Detroit (1991, 39–52), here: 43 f.
- 34.
Calculations rely on manually identifying the publisher using the ‘publication’ field in a random sample of 360 titles published between 1810 and 1836 appearing in RFGS. Error bars (80% credible intervals) reflect uncertainty due to sampling. Yearly rate is calculated by dividing the number of novels published in each period by the number of years in the period.
- 35.
Calculations rely on manually identifying the publisher using the ‘publication’ field in a random sample of 360 titles published between 1810 and 1836 appearing in RFGS. Error bars (80% credible intervals) reflect uncertainty due to sampling.
- 36.
Sutherland (Ann. 10), 19.
- 37.
Raven (Ann. 25), writing about the period leading up to 1840, discusses strategies used to “stimulate demand”, including a growing “sophistication” in booksellers’ advertising tactics ibid., 269–270. Missing from this account is any way to assess the relative importance of these tactics in the growing sales of books. Were these tactics essential or marginally important? Absent booksellers’ stoking of demand, would sales of novels have grown at half the rate that they did? It seems possible that in a competitive environment improvements in typography and product design might have had zero influence on overall demand. That is, if one firm begins advertising, competitors will face pressure to do so as well even if the advertising has a negligible (or negative) effect on the overall quantity of goods demanded. Here advertising only potentially influences which firms receive business – not how much business they receive.
- 38.
Ibid.
- 39.
Ibid., 334.
- 40.
Weedon (Ann. 18), 50.
- 41.
Robert C. Allen., “Engels’ Pause: Technical Change, Capital Accumulation, and Inequality in the British Industrial Revolution,” in: Explorations in Economic History 46/4 (2009), 418–435, DOI https://doi.org/10.1016/j.eeh.2009.04.004.
- 42.
Weedon (Ann. 18), 40 f.
- 43.
Raven/Forster (Ann. 2); Garside/Schöwerling (Ann. 3).
- 44.
Garside/Mandal/Ebbes et al. (Ann. 4).
- 45.
Ibid.; Raven/Forster (Ann. 2); Garside/Schöwerling (Ann. 3).
- 46.
Simon Eliot, “‘Patterns and Trends’ and the ‘NSTC’: Some Initial Observations. Part One,” in: Publishing History; Cambridge 42 (1997), 79–104.
- 47.
Ibid., 86.
- 48.
Simon Eliot, Some Patterns and Trends in British Publishing, 1800–1919, London (1994).
- 49.
Paul H. Garthwaite/Joseph B. Kadane/Anthony O’Hagan, “Statistical Methods for Eliciting Probability Distributions,” in: Journal of the American Statistical Association 100/470 (2005), 680–700, DOI https://doi.org/10.2307/27590587.
- 50.
At the Circulating Library: A Database of Victorian Fiction, 1837–1901, http://www.victorianresearch.org/atcl/ (retrieved on April 25 2018).
- 51.
Ellen Miller Casey, “Edging Women out?: Reviews of Women Novelists in the Athenaeum, 1860–1900,” in: Victorian Studies 39/2 (1996), 151–171.
- 52.
Andrew Gelman/John B Carlin/Hal S. Stern et al., Bayesian Data Analysis, Boca Raton (32013), 501–516.
- 53.
If \(Y\) is distributed according to a \(NegativeBinomial_{2} \left( {\mu ,\phi } \right)\) distribution then \(E\left( Y \right) = \mu\) and \(Var\left( Y \right) = \mu + \frac{{\mu^{2} }}{\phi }\).
- 54.
Stan: A C + + Library for Probability and Sampling, Version 2.16. O. O. 2017, DOI https://doi.org/10.5281/zenodo.814234.
- 55.
All digital references were last viewed on April 25, 2018.
References
All digital references were last viewed on April 25, 2018.
Allen, Robert C., “Engels’ Pause: Technical Change, Capital Accumulation, and Inequality in the British Industrial Revolution,” in: Explorations in Economic History 46/4 (2009), 418–435, DOI https://doi.org/10.1016/j.eeh.2009.04.004.
Miller Casey, Ellen, “Edging Women out?: Reviews of Women Novelists in the Athenaeum, 1860-1900,” in: Victorian Studies 39/2 (1996), 151–171.
Clark, Gregory, A Farewell to Alms: A Brief Economic History of the World, Princeton 2007.
Darnton, Robert, “Reading, Writing, and Publishing in Eighteenth-Century France: A Case Study in the Sociology of Literature,” in: Daedalus 100/1 (1971), 214–256.
Eliot, Simon, “‘Patterns and Trends’ and the ‘NSTC’: Some Initial Observations. Part One,” in: Publishing History; Cambridge 42 (1997), 79–104.
Eliot, Simon, Some Patterns and Trends in British Publishing, 1800–1919, London 1994.
Eliot, Simon, “Very Necessary but Not Quite Sufficient: A Personal View of Quantitative Analysis in Book History,” in: Book History 5/1 (2002), 283–293, DOI https://doi.org/10.1353/bh.2002.0006.
English James F., “Everywhere and Nowhere: The Sociology of Literature After ‘the Sociology of Literature’,” in: New Literary History 41/2 (2010), v–xxiii, DOI https://doi.org/10.1353/nlh.2010.0005.
Erickson Lee, The Economy of Literary Form: English Literature and Theindustrialization of Publishing, 1800–1850, Baltimore 1996.
Escarpit Robert, Sociologie de la littérature, Paris 1958.
Garside, Peter/Mandal, Anthony/Ebbes, Verena et al., The English Novel, 1830–36: A Bibliographic Survey of Fiction Published in the British Isles, Cardiff/Paderborn 2003.
Garside, Peter/Raven, James/Schöwerling, Rainer, „General Introduction,” in: Peter Garside, James Raven, Rainer Schöwerling (eds.), The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Oxford 2000.
Garside, Peter/Schöwerling Rainer, The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Vol. 2: 1880–1829, eds. Peter Garside, James Raven, Rainer Schöwerling, Oxford 2000.
Garthwaite, Paul H./Kadane, Joseph B./O’Hagan, Anthony, “Statistical Methods for Eliciting Probability Distributions,” in: Journal of the American Statistical Association 100/470 (2005), 680–700, DOI https://doi.org/10.2307/27590587
Gelman, Andrew/Carlin, John B./Stern, Hal S. et al., Bayesian Data Analysis, Boca Raton 32013.
Maddison, Angus, Statistics on World Population, GDP and Per Capita GDP, 1–2008 AD, n.p. 2009.
Moretti, Franco, „Conjectures on World Literature,” in: New Left Review 1 (2000).
Moretti, Franco, Graphs, Maps, Trees: Abstract Models for Literary History, London 2005.
Moretti, Franco, „Network Theory, Plot Analysis,” in: New Left Review 68 (2011), 80–102.
Moretti, Franco, „The Slaughterhouse of Literature,” in: MLQ: Modern Language Quarterly 61/1 (2000), 207–227.
Mornet, Daniel, „Les Enseignements Des Bibliothèques Privées (1750-1780),” in: Revue d’Histoire littéraire de la France 17/3 (1910), 449–496.
Radway, Janice A., Reading the Romance: Women, Patriarchy, and Popular Literature, Chapel Hill 1984.
Raven, James/Forster, Antonia, The English Novel, 1770–1829: A Bibliographical Survey of Prose Fiction Published in the British Isles, Vol. 1.: 1770–1799, eds. Peter Garside/James Raven/Rainer Schöwerling, Oxford 2000.
Raven, James, The Business of Books: Booksellers and the English Book Trade, 1450–1850, New Haven, CT, London 2007.
Sutherland, John, „Chips off the Block: Dickens’s Serialising Imitators,” in: n.e., Dickens and Other Victorians, London 1988, 97–119, DOI https://doi.org/10.1007/978-1-349-19503-9_7.
Sutherland, John, „Publishing History: A Hole at the Centre of Literary Sociology,” in: Critical Inquiry 14/3 (1988), 574–589.
Sutherland, John, Victorian Fiction: Writers, Publishers, Readers. Houndmills, Basingstoke, Hampshire 1995.
Wallins, Roger P., „Richard Bentley,” in: Patricia Anderson, Jonathan Rose (eds.), British Literary Publishing Houses, 1820–1880, Detroit 1991, 39–52.
Weedon, Alexis, Victorian Publishing: The Economics of Book Production for a Mass Market, 1836–1916, Aldershot, UK, 2003.
Williams Raymond, The Long Revolution, Harmondsworth, Middlesex, 1965.
Online Resources
About Google Books, History of Google Books, https://books.google.com/intl/com/googlebooks/history.html.
At the Circulating Library: A Database of Victorian Fiction, 1837–1901, http://www.victorianresearch.org/atcl/.
Internet Archive, https://archive.org.
Stan: A C++ Library for Probability and Sampling, Version 2.16. O. O. 2017, DOI https://doi.org/10.5281/zenodo.814234.
Acknowledgements
We gratefully acknowledge Troy Bassett for his feedback, his work on At the Circulating Library, and for providing estimates of the number of new novels published in 1886, 1891, and 1894. We would like to thank Cosma Shalizi for valuable comments on an early version of this paper. We are also grateful to Franco Moretti for providing machine-readable novel publication data for years prior to 1800.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Methodological Appendix: Inferring the Rate of New Novel Publication, 1800–1919
Methodological Appendix: Inferring the Rate of New Novel Publication, 1800–1919
The quantities which will be estimated are the underlying yearly rates at which new novels (i.e., previously unpublished novels) appeared between 1800 and 1919. The number of novels published between 1800 and 1836 (inclusive) are known and can be used to check the predictions of the model.
1.1 1. Data
New Titles 1800–1836: Garside/Schöwerling and Garside et al. (‘RFGS’).
Thanks to an exhaustive survey we know the number of new novels appearing between 1770 and 1836 (inclusive).Footnote 43 (Only the numbers associated with the years 1800–1836 are used by the model.) The bibliography by RFGS both provides the most important time series used by the model and anchors the effort more generally by providing a particular definition of the novel. The counts of new novel publications during these years will be denoted with the variable \(y_{i}\) where \(i\) indexes the year. For example, the first year, 1800, is associated with the index 1. A sequence of counts will be indicated by joining the starting index and ending index with a colon, e.g., \(y_{1:120}\).
NSTC Titles 1800–1870 with London, Oxford, Cambridge, Edinburgh, and Dublin Imprint Locations (NSTC (LOCED))
Simon EliotFootnote 44 extracted yearly totals of titles listed in the Nineteenth-Century Short Title Catalog (NSTC) whose publication information indicates the place of publication was either London, Oxford, Cambridge, Edinburgh, or Dublin (LOCED). The difficulty encountered in using these figures is that for years after 1836 we do not know what percentage of NSTC (LOCED) titles are new novels. These percentages must be inferred. The NSTC (LOCED) counts are recorded in the sequence \(n_{1:71}\), where the index indicates the year.
The NSTC assigns undated material to the nearest half-decade (to a year ending with a ‘0’ or a ‘5’).Footnote 45 Rather than modify the model to accommodate this idiosyncrasy, years which end in a ‘0’ or a ‘5’ are not used in the model.
Publishers’ Circular Yearly Totals for 1843 to 1919 (PC)
Publishers’ Circular yearly totals are counts of the number of new editions reported in Publishers’ Circular between 1843 and 1919. These totals are reported in Simon Eliot.Footnote 46 These counts are recorded in the sequence \(p_{1:120}\), where the index indicates the year. No reliable counts are available before 1843, so the only the subsequence \(p_{44:120}\) is used.
The difficulty with using the PC data is the same difficulty encountered with the NSTC (LOCED) titles. The percentage of PC editions which are new novels is unknown. Although the specific percentage is unknown, we do have a considerable amount of prior information. From one year to the next, for example, it seems a safe assumption that the percentage is unlikely to change dramatically. Even over a longer period, we can be confident that wild swings are highly unlikely. We do not anticipate that the percentage of all editions which are new novels would swing from 2% between 1820 and 1824 to 20% between 1845 and 1849.
Elicited Quartiles for 1886, 1891, and 1894
Following the procedure described in Paul H. Garthwaite, Joseph B. Kadane, Anthony O’Hagan,Footnote 47 we elicited from Troy Bassett information about the number of new novels published during three years 1886, 1891, and 1894. For each year, quartiles of the distribution matching Bassett’s beliefs about the total number of new novels published in that year were elicited. As editor of At the Circulating Library: A Database of Victorian Fiction, 1837–1901,Footnote 48 Bassett has a range of information valuable in making accurate estimates. For example, Bassett has precise information about lower bounds for the number of novels published during these years (i.e., the number of novels already recorded for these years in the database) and the rate at which new novels are currently being identified.
The elicited quartiles used are the following:
-
1886: 393.75, 481.25, 612.5
-
1891: 481.25, 525.0, 612.5
-
1894: 656.25, 787.5, 962.5
Because At the Circulating Library uses a broader definition of the novel than, the quartiles shown above reflect a discount of 12.5% from the original elicited quartiles. This number was chosen as an approximation of the discount reported by Bassett. Bassett states that between 10 and 15% of the novels recorded in At the Circulating Library would be excluded from RFGS.
Given the quartiles of a distribution, a probability distribution which has approximately the same quartiles can be found. In this case, Gamma distributions are used as the approximating distributions.
Athenaeum Reviews of Novels: 1860, 1865, …, 1900: Ellen Miller Casey
Ellen Miller Casey provides counts for the number of novels reviewed in The Athenaeum during nine years: 1860, 1865, 1870, 1875, 1880, 1885, 1890, 1895, 1900. (The Athenaeum was a London literary magazine published from 1828 to 1921.) This series shows a steep increase in the number of titles reviewed. Like every source of book reviews published at the time, these counts underestimate the total number of new novels published as not all new novels are reviewed. These nine counts are recorded in the sequence \(a_{1:120}\) where the index indicates the year. Only values associated with the nine years are used.
1.2 2. Model
The model offers a narrative of novelistic production in terms of the yearly rate of new novel publication. This narrative can be broken down into two main pieces: an underlying trend and periodic deviations from the trend.
The model assumes that the rate of new novel publication tends to grow at a fixed percentage each year during the period. The model also makes assumptions about deviations from this growth trend. Accomodating deviations is essential because there is no doubt that a variety of events affected the rate of (new) novel publications during the period–e.g., a cholera epidemic in the 1840s, a recession in the 1850s, and World War I. In order to make accurate yearly predictions of the number of novels, the model must be flexible enough to accommodate significant deviations from trend.
The growth trend and deviations are modeled using a Gaussian process.Footnote 49 Instead of having year-specific deviations be independently distributed (as in conventional linear models), deviations are assumed to be correlated with each other in a manner specified by a Gaussian process. The Gaussian process is characterized by a mean function \(m\) and a covariance function \(k\) (sometimes called a kernel). The covariance function used in the model is the squared exponential covariance function. An important parameter in the covariance function is the characteristic length-scale \(l\), which governs to what extent deviations in nearby years are correlated with each other.
In symbols, the yearly rates of new novel publication are modeled on a log-scale as follows:
\(K\) is an \(n \times n\) covariance matrix whose elements are specified by the covariance function \(k\). An element \(\left( {i,j} \right)\) of \(K\) corresponds to \(k\left( {x_{i} ,x_{j} } \right)\). \(t_{1:120}\) is a column vector of integers associated with the years (1 for 1800, 2 for 1801, …, 120 for 1919). The rate of new novel publication in year \(i\) is \(exp\left( {\lambda_{i} } \right)\). Generic weakly informative priors are placed on all the parameters not explicitly mentioned above. The characteristic length-scale \(l_{\lambda }\) is modeled with a Gamma distribution placing 90% of its mass on values between 1 and 10. This prior distribution expresses the belief that deviations will tend to persist for between 1 and 10 years. Such a prior is, for example, consistent with the belief that a financial crisis might affect the rate of publication in the short term but would cease to influence publication rates in years which are more than ten years distant from the crisis.
The yearly rates of new novel publication \(exp\left( {\lambda_{1} } \right),exp\left( {\lambda_{2} } \right), \ldots exp\left( {\lambda_{120} } \right)\) are connected to observations of the number of new novels published with a Negative Binomial sampling distribution. The total number of new novels published is known for years between 1800 and 1836 (inclusive). In symbols, this sampling model reads:
The negative binomial distribution has a variety of parameterizations. \(NegativeBinomial_{2}\) is parameterized by a mean (i.e., location) parameter and a parameter controlling dispersion.Footnote 50
A second Gaussian process models the yearly proportion of PC titles which are new novels. In order to use the Gaussian process, these proportions must be expressed on the log odds scale. (The log odds is the logarithm of the odds, \(log\left( {\frac{p}{1 - p}} \right)\), where \(p\) is a proportion between 0 and 1.) In contrast to changes in the yearly rate of new novel publication, the proportion of PC titles which are new novels is expected to change slowly. Whereas an economic depression might affect the rate of new novel publication over a period of several years, it would be far less likely to affect the proportion of PC titles which are novels. In symbols, the yearly proportions are modeled as follows:
The proportion of PC titles which are novels in the year with index \(i\) is \(logit^{ - 1} \left( {\nu_{i} } \right)\). (\(logit^{ - 1} \left( x \right)\), the inverse logistic function, inverts the transformation of a proportion to the log odds scale.) The characteristic length-scale \(l_{\nu }\) is modeled with a Gamma distribution placing 90% of its mass on values between 8 and 36. This prior distribution expresses the belief that deviations will tend to persist for between 8 and 36 years.
In symbols, the sampling model for the observed yearly counts of PC titles is the following:
Similarly, the sampling model for the observed yearly counts of NSTC (LOCED) titles is as follows:
The NSTC (LOCED) counts are always higher than the PC title counts. We make the assumption that the yearly rate for NSTC (LOCED) counts is equal to the yearly rate for PC title counts multiplied by a constant factor, \(\pi_{n}\).
The distributions elicited from Bassett are incorporated as follows:
These distributions approximate the elicited quartiles described above. Note that \(\lambda_{i}\) is the logarithm of the yearly rate of publication. The approximating distributions therefore also use the log scale.
Finally, the counts of new novels reviewed in The Antheneum are incorporated as follows:
Because The Antheneum does not review all new novels published, the rate of new novel publication is discounted by a factor \(\pi_{a}\). An informative Gamma prior placing 90% of mass on a value between 30 and 70% is used as it seems very likely that The Antheneum reviews a fraction of new novels each year lying in this range.
1.3 3. Inference
Posterior inference is performed using Stan.Footnote 51
Rights and permissions
Open Access Dieses Kapitel wird unter der Creative Commons Namensnennung 4.0 International Lizenz (http://creativecommons.org/licenses/by/4.0/deed.de) veröffentlicht, welche die Nutzung, Vervielfältigung, Bearbeitung, Verbreitung und Wiedergabe in jeglichem Medium und Format erlaubt, sofern Sie den/die ursprünglichen Autor(en) und die Quelle ordnungsgemäß nennen, einen Link zur Creative Commons Lizenz beifügen und angeben, ob Änderungen vorgenommen wurden.
Die in diesem Kapitel enthaltenen Bilder und sonstiges Drittmaterial unterliegen ebenfalls der genannten Creative Commons Lizenz, sofern sich aus der Abbildungslegende nichts anderes ergibt. Sofern das betreffende Material nicht unter der genannten Creative Commons Lizenz steht und die betreffende Handlung nicht nach gesetzlichen Vorschriften erlaubt ist, ist für die oben aufgeführten Weiterverwendungen des Materials die Einwilligung des jeweiligen Rechteinhabers einzuholen.
Copyright information
© 2022 Der/die Autor(en)
About this chapter
Cite this chapter
Riddell, A., Betancourt, M. (2022). Reassembling the Novel. The English Novel, 1789–1919. In: Jannidis, F. (eds) Digitale Literaturwissenschaft. Germanistische Symposien. J.B. Metzler, Stuttgart. https://doi.org/10.1007/978-3-476-05886-7_33
Download citation
DOI: https://doi.org/10.1007/978-3-476-05886-7_33
Published:
Publisher Name: J.B. Metzler, Stuttgart
Print ISBN: 978-3-476-05885-0
Online ISBN: 978-3-476-05886-7
eBook Packages: J.B. Metzler Humanities (German Language)