Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 18 July 2022

ESG performance and stock prices: evidence from the COVID-19 outbreak in China

  • Zengfu Li 1 ,
  • Liuhua Feng 1 ,
  • Zheng Pan 2 &
  • Hafiz M. Sohail   ORCID: orcid.org/0000-0002-7885-5549 1  

Humanities and Social Sciences Communications volume  9 , Article number:  242 ( 2022 ) Cite this article

18k Accesses

14 Citations

3 Altmetric

Metrics details

  • Social science

This paper investigates the role of environmental, social, and governance (ESG) performance in stock prices during the market financial crisis caused by the COVID-19 pandemic. We use the Chinese listed company data as the bases for adopting an event-study method to identify the impact of ESG performance on cumulative abnormal returns. Empirical results suggest that ESG performance significantly increases firms’ cumulative abnormal returns and has asymmetric effects during the pandemic. Our results are robust to various robustness checks that consider the replacement of event window period, ESG measurement, adding other control variables, and sample exclusion of Hubei Province. We further find that reputation and insurance effects are important mechanisms through which ESG performance influences stock prices. Lastly, heterogeneous analyses show that ESG effects are considerably pronounced among firms with low human capital and bad image and in high-impact regions.

Similar content being viewed by others

research paper on share prices

Worldwide divergence of values

research paper on share prices

The impact of Russia–Ukraine war on crude oil prices: an EMC framework

research paper on share prices

Bayesian statistics and modelling

Introduction.

In recent years, environmental, social, and governance (ESG) investments, frequently called ethical or sustainable investments, have rapidly increased globally (Galbreath, 2013 ). ESG investing is an investment process that integrates ESG considerations into investment decisions (Mǎnescu, 2011 ). Given the COVID-19 pandemic, the necessity of ESG investing has been highlighted again (Demers et al., 2021 ; Manabe and Nakagawa, 2022 ). Investors are interested in ESG investments for at least two reasons (Renneboog et al., 2008 ). First, by focusing on ESG investments, ethical investment practices are actively promoted (Baldini et al., 2018 ; Broadstock et al., 2021 ). Second, ESG investments are increasingly recognized as improving the performance of managed portfolios, reducing portfolio risks, and increasing returns (Albuquerque et al., 2020 ; Díaz et al., 2021 ; Broadstock et al., 2021 ).

Early literature on ESG investing has been partially inspired by studies of the eminent economist Milton Friedman (Friedman, 2007 ), who argued that ESG practices constitute a misallocation and misappropriation of valuable corporate resources. Renneboog et al. ( 2008 ) concluded that existing studies hint, but do not explicitly demonstrate, that ethical investors are willing to accept sub-optimal financial performance to pursue social or ethical objectives. Subsequently, a series of studies have expanded on the preceding literature. Some studies on ESG investing have focused on the application of returns and risk management. Hartzmark and Sussman ( 2019 ) found that investors make positive predictions on sustainable assets, steering money away from funds with low portfolio sustainability ratings to those with high ratings. They also found no evidence that high-sustainability funds outperform low-sustainability funds. Demers et al. ( 2021 ) determined that ESG performance facilitates the accumulation of intangible assets but does not serve as protection against downside risk.

However, emerging studies have supported the view that ESG-themed investments have low downside risks and are minimally volatile in price during turbulent times. Hoepner et al. ( 2021 ) and Pedersen et al. ( 2021 ) obtained empirical evidence that ESG engagement reduces firms’ downside risks and their exposure to downside risk factors. Albuquerque et al. ( 2020 ) developed a theoretical framework to show that stocks with high ESG ratings have significantly higher returns, lower return volatilities, and higher trading volumes than other stocks. Broadstock et al. ( 2021 ) showed that high-ESG portfolios typically outperform low-ESG portfolios, thereby mitigating financial risks during financial crises.

Although there is limited research on the specific role of ESG performance during times of crisis, some insights have been gained from the 2008–2009 global financial crisis. Nejati et al. ( 2010 ) noted that the root causes of the current economic crisis could be moderated by a global transparency and accountability system and a public reporting of ESG performance. Erragraguy and Revelli ( 2015 ) showed that the adoption of ESG standards by firms during the crisis increased transparency, mitigated information asymmetries, and improved stock market liquidity and quality. Henke ( 2016 ) demonstrated that high-ESG-rated funds outperformed low-ESG-rated funds during the crisis, further supporting the view that investors place intrinsic value on ESG investments.

In the first few months of 2020, the sudden market-wide financial crisis was triggered in response to the emerging global health crisis (i.e., COVID-19), the consequences of which were more severe than those of the Great Depression in 1929–1933 and the global financial crisis in 2007/2008 (Broadstock et al., 2021 ). The current study shows that stock prices are empirically tested for negative shocks during the COVID-19 pandemic in some products and firms but not in others (Al-Awadhi et al., 2020 ; Shen et al., 2020 ). This result leads to a compromise opinion that the role of ESG generally depends on exposure to crisis shocks. Moreover, we are motivated to question how ESG effects vary along with different products and firms during public crises. However, only a few studies have indicated the specific role of ESG performance in crisis periods. Therefore, the goal of this paper is to fill in this research gap.

We use the Chinese listed company data as the bases for adopting an event-study method to identify the impact of ESG performance on cumulative abnormal returns. The empirical results show that ESG performance is positively associated with cumulative abnormal returns during the COVID-19 pandemic. When decomposing firms with positive and negative shocks, we find that cumulative abnormal returns are positively related to ESG among firms with negative shocks but not positive shocks. These results suggest that the importance of ESG performance is reinforced in times of crisis, and is consistent with the inference that investors use ESG performance as a signal of future returns and risk mitigation.

Our work builds on the current literature on the role of ESG performance in stock prices (Duuren et al., 2016 ; Remmer Sassen et al., 2016 ; Jagannathan et al., 2017 ) and extends the results on ESG to public crisis events. We use an event study approach to analyze the volatility of ESG performance on stock prices during the COVID-19 pandemic. Compared with the existing literature (Nejati et al., 2010 ; Erragraguy and Revelli, 2015 ; Henke, 2016 ), this study provides a more comprehensive perspective on the importance of ESG performance in major crises. Understanding the importance of ESG is necessary because it is a crucial indicator of risk management, non-financial performance, and sustainability. Through ESG practices, firms can obtain significant reputation and risk protection to reduce price volatility in times of crisis, thereby contributing to their long-term operations and sustainability.

This study adds to the limited number of prior studies that have examined the impact of firms’ ESG performance on their stock prices but have provided conflicting results (Friedman, 2007 ; Renneboog et al., 2008 ; Hartzmark and Sussman, 2019 ; Demers et al., 2021 ). The current study also provides additional empirical evidence on the important mechanisms of ESG performance through further analysis, thereby opening a black box for a positive relationship between ESG performance and stock prices. The results of this study suggest that reputation and insurance effects are important mechanisms by which ESG performance influences stock prices. This outcome reflects the fact that sustainability investments have low downside risks and are minimally volatile in price during turbulent periods. This finding clarifies the important role of current ESG practices in guiding investors’ decision-making and provides empirical evidence for investors to focus on sustainable investment.

This paper also complements the literature on the effects of public crises on financial markets. Existing literature has discussed the influence on financial markets mainly in terms of the 2008–2009 financial crisis (Nejati et al., 2010 ; Erragraguy and Revelli, 2015 ; Henke, 2016 ). We differ from these studies by specifically focusing on the COVID-19 pandemic. Note that the subject of our study is Chinese listed companies. Given that China is the second-largest economy in the world where the COVID-19 public crisis spread earlier and was interrupted by containment measures, a reasonable undertaking is to investigate the influence of this crisis on financial markets in the Chinese sample we use compared with other countries.

The remainder of this paper is organized as follows. Section “Data and methodology” presents the sample and variables. Section “Empirical results” discusses the main empirical results and robustness analyses. Section “Further analysis” provides further mechanisms. Lastly, section “Conclusion” concludes the study.

Data and methodology

Data of this study cover Chinese non-financial A-share listed firms in 2020. We collect data from several resources. First, we acquire ESG data from China Sino-Securities Index Information Service (Shanghai) Company Limited, a third-party data provider based in China specializing in ESG data. Second, stock prices, firm financial data, and firm management data are obtained from China Securities Markets and Accounting Research Database. In particular, stock prices are measured using cumulative abnormal returns calculated by utilizing the event-study method. Third, we obtain media coverage data mainly from the Chinese Research Data Service database to measure the level of media attention.

Main variables

Data of this study cover Chinese non-financial A-share listed firms in 2020. We collect data from several resources. First, we acquire ESG data from China Sino-Securities Index Information Service (Shanghai) Company Limited, a third-party data provider based in China specializing in ESG data. Second, stock prices, firm financial data, and firm management data are obtained from the China Securities Markets and Accounting Research Database. Specifically, stock prices are measured using cumulative abnormal returns calculated by the event-study method. Third, we obtain media coverage data mainly from the Chinese Research Data Service database to measure the level of media attention.

Stock price (CAR)

Cumulative abnormal return (CAR) is widely used as a measure of stock price. We follow the previous literature (Demers et al., 2021 ; Zhang et al., 2021 ; Broadstock et al., 2021 ) and use CAR estimated through an event study approach as the dependent variable. CAR is an impartial estimate of additional or reduced firm value that accrues as a result of the occurrence (McWilliams and Siegel, 1997 ; Campbell et al., 1998 ; Fernando et al., 2012 ). In the COVID-19 context, this paper uses the event study method to analyze the impact of ESG performance on stock prices.

In an event study, we need to determine the event date, event window, estimation window, and estimation model. The details are as follows. (1) Event date : As COVID-19 received national attention on 20 January 2020, we draw on existing literature (Zhang et al., 2021 ) and use this date as an event date in the event study. On 20 January 2020, China’s central government provided important instructions on the prevention and containment of COVID-19. On the same day, panic ensued after the announcement by pulmonary experts that COVID-19 was more transmissible than previous diseases and could be passed from one person to another person. As of 20 January 2020, the Chinese National Health Council officially issued the number of new cases in each province and incorporated COVID-19 into the Infectious Diseases Act and Sanitary and Isolation Act (Zhang et al., 2021 ). (2) Event window : This paper follows previous studies (Kanas, 2005 ; Miyajima and Yafeh, 2007 ; Fernando et al., 2012 ) and selects the event window consisting of 11 days from t −5 to t +5 days. (3) Estimation window : Campbell et al. ( 1998 ) documented that the selection of estimation window for short-term events can be 120 days or even longer. Thus, we choose the estimated window period of 175 days from t −210 to t −36 days. (4) Estimation model : We use the existing literature (Campbell et al., 1998 ) as a basis to take the OLS market model for calculating the expected returns.

This study requires the calculation of expected, abnormal, and cumulative abnormal returns (Campbell et al., 1998 ). We first calculate each firm’s expected return (ER i,t ) during the event period:

where ER i,t is the expected return of firm i on day t during the event period, R M , t represents the market return on day t during the event period, and β 0 and β 0 are the estimated parameters in Model (1). Thereafter, we use expected returns (ER i , t ) gained from Model (1) to calculate the abnormal returns during the event period. This study estimates abnormal returns (AR i,t ) based on the Fama et al. ( 1969 ) market model:

where AR i,t is the abnormal return of firm i on day t during the event period, R i,t is the actual return of firm i on day t during the event period, and ER i , t is the expected return obtained from Model (1). Lastly, we use abnormal returns (AR i,t ) obtained from Model (2) to calculate the cumulative abnormal returns:

where CAR i,t is a calculation from the period of days between k and j . In this paper, the event windows to calculate CAR is [−5, 5].

ESG performance (ESG)

The Independent variable used in this study is the quarterly score of ESG performance, which is calculated based on three dimensions: environmental, social, and governance. Given that ESG data are disclosed quarterly, we calculate the average of ESG over four quarters to measure the annual ESG performance. We also follow the previous literature (Zhang et al., 2021 ; Broadstock et al., 2021 ; Díaz et al., 2021 ; Demers et al., 2021 ) and include the following set of control variables in the estimation: leverage ratio (Lev), firm profitability (Roa), firm size (Size), nature of equity (Soe), degree of risk (Beta), shareholder structure (Top1), number of board members (Board), the duality of CEO and chairman (Dual), the proportion of independent directors (Independ), intangible asset (Intangible), and tangible asset (Tangible). The detailed definitions of the variables are presented in Table 1 .

Summary statistics

In response to the COVID-19 pandemic, this paper selects Chinese A-share listed companies as a study sample. This study selects the sample based on the following considerations. First, the financial sector is excluded owing to the uniqueness of its business, financial reporting, and regulatory structure. Second, we exclude firms with losses and those specially treated by stock exchanges. Third, we exclude samples with an estimation window of under 175 trading days. Fourth, our article removes samples with missing values in the ESG, CAR, and control variables. Lastly, continuous variables are winsorized at the 1% and 99% levels to mitigate concerns with extreme values. The resulting sample in our study is 2188 observations.

Table 2 provides the descriptive statistics of the variables. CAR index ranges from −0.2496 to 0.4665. In addition, the mean and median of CAR are −0.0115 and −0.0367, respectively, with a standard deviation of 0.1223. This result suggests that the level of cumulative abnormal returns varies considerably across firms and that overall cumulative abnormal returns are low. Moreover, this situation implies that firms are generally subject to negative shocks during the COVID-19 pandemic. The average ESG is 0.8447, which is within the range of good. For the control variables, the average leverage is 44.21%, ROA is 5.13%, and firm size is 22.59. Our sampled chairman has ~27.06% of the CEO, and the largest shareholder holder accounts for 34.61% of firm stocks. The average board size is about 8 (= e 2.1260 ) members, 37.61% of whom are independent directors. This result is consistent with the CSRC requirement for board independence (Wang et al., 2021 ). The distribution of the control variables in this paper is similar to that reported in previous research (Zhang et al., 2021 ; Broadstock et al., 2021 ; Díaz et al., 2021 ).

Figure 1 shows a graphical representation of AARs and CARs for the event window [−5, 5]. As shown in Fig. 1 , the two trend lines rise initially and fall thereafter from the announcement date. In the 4 days from 20 to 23 January 2020, firms reacted positively to the pandemic market. In the early stage of the COVID-19 pandemic, the demand for products to prevent pandemic infection was greater than the supply. This phenomenon prompted speculators to use arbitrage opportunities to invest, thereby explaining the positive response of the capital market.

figure 1

Trend of AARs and CARs for the event window ( t −5 – t +5 days).

On the 4th day of the COVID-19 outbreak (23 February 2020), the city of Wuhan, which was the most affected city, was closed. Shortly after the lockdown, the stock market was closed for the Chinese Spring Festival from 24 January to 2 February. The market reopened on 3 February. Note that on the 5th day of trading following the COVID-19 outbreak, which was the first trading day after the closure of Wuhan (3 February 2020), the response of firms to the pandemic was extremely negative. The pandemic caused chaos and weakened the economy of China, particularly after the closure of Wuhan. The closure of the city to curb the further spread of the disease will destroy the entire logistics and supply chain system (Tang et al., 2021 ). Most factories faced shutdowns, production stoppages, and even closures, thereby causing a significant downward trend across the capital markets. The downward trend of the stock market was consistent with the previous statistical description that the COVID-19 epidemic in 2019 has had a significant negative impact on the financial market.

Empirical results

Effect of esg performance on firms’ stock price.

We use the following multiple regression model to investigate the general relationship between ESG performance and firm stock prices during the COVID-19 pandemic:

where i denotes the firm, CAR represents the firm’s cumulative abnormal return during the COVID-19 pandemic, and ESG is the firm’s ESG performance measure proxied by its ESG score. Our results focus on β 1 , which captures how ESG performance affects a firm’s stock price. The vector Controls i stacks a series of control variables that account for the impact of firm characteristics on the stock price. Details of these variables are described in the section “ESG performance (ESG)”. We also include province-fixed effects to control for unobservable regional characteristics. ESG is a highly industry-related variable (Yu and Luu, 2021 ). To control for industry characteristics, we control for industry fixed effects, in which ε i is the error term. Variables and definitions used in the model are shown in Table 1 .

We report the results of the baseline OLS regressions in Table 3 . In column (1), we include only the ESG and CAR indicators after controlling for the province- and industry-fixed effects. The coefficient of ESG is 0.1479 ( t -stat 3.5072), which is positive and statistically significant at the 1% level. This result indicates that ESG practices play an important role in reducing price volatility in times of crisis. Our results show that ESG performance has a significant positive impact on the cumulative excess return. In other cases, we include firm control variables in column (2). However, estimates remain positive and statistically significant. For control variables, we find that the coefficient of Size is significantly positive, suggesting that large firms reduce their risk of stock price declines. Conversely, the coefficient of Lev is negative, implying that firms with more leverages will suffer a greater risk of stock price declines. Our results are consistent with those of previous studies regarding control variables (Demers et al., 2021 ; Broadstock et al., 2021 ). In summary, our results provide suggestive evidence that firms with high ESG ratings are conducive to mitigating the downside risk of stock prices during the COVID-19 pandemic.

To significantly understand whether or not the role of ESG depends on the risk exposure characteristics of crisis shock, this paper examines the influence of ESG performance on stock prices of firms subject to positive and negative shocks during the COVID-19 pandemic. We choose two approaches to classify the sample into firms with positive and negative shocks during the pandemic. On the one hand, this paper draws on the existing literature (Kanas, 2005 ; Miyajima and Yafeh, 2007 ) in classifying the positive and negative shock groups by determining whether or not the cumulative abnormal return is above 0. On the other hand, we use the previous literature (Al-Awadhi et al., 2020 ; Shen et al., 2020 ) as a basis for identifying the following industries as positive groups: information technology and medicine manufacturing industries. Moreover, we select the following industries to be identified as negative groups: tourism, transportation, restaurants, wholesale and retail trade, realty business, and export manufacturing industries.

Table 4 shows the results of the tests of exposure characteristics during the crisis. The first two columns and last two columns show that the results are similar whether the sample is divided in the first or second way. Coefficients of ESG for firms in the negative group are positive and statistically significant. Conversely, coefficients of ESG for firms in the positive group is not statistically significant. These results suggest that the positive effect of ESG performance on stock prices is more significant among firms that are more severely affected by negative shocks. That is, our findings provide evidence that ESG performance acts as a risk protection tool that contributes to the sustainability of operations in turbulent times, specifically among firms with severe negative shocks.

Robustness tests

Alternative event window periods.

In the baseline regression, we mainly use an event window of [−5, 5] for the test. However, the role of ESG performance for the shorter or longer event windows is unclear. This section investigates whether or not our results hold after replacing the shorter and longer event windows. These event windows are as follows: [−3, 3], [−7, 10], [−7, 11], and [−7, 14]. The first four columns of Table 5 present the results of the tests for alternative event window periods, in which each column represents a different event window. Column (1) shows that the coefficient of ESG is 0.0752 ( t -stat 2.8186) in the short-term event window [−3, 3], which is positive at the 1% level. We also find that the coefficient of ESG is 0.1327 ( t -stat 2.6966) in the long-term event window [−7, 10], which is positive at the 1% level. Coefficient of ESG is also significantly positive in the event windows [−7, 11] and [−7, 14], albeit at a weaker level of significance (5%, 10%). In summary, these findings provide strong evidence that ESG plays an important role in preventing downside risk during turbulent times.

Alternative proxies for ESG performances

Given that raw data for the ESG score is quarterly, this paper uses it for testing the annual data of firms that are susceptible to measurement error and raise concerns about their validity. To address these potential issues, we consider two alternative proxies for ESG performance. First, we choose the median of the quarterly data on ESG scores (ESG1) to measure ESG performance. Second, we measure ESG performance as ESG score in the first quarter of year t  + 1 (ESG2). Similarly, we find a robust positive association between ESG performance and stock price, as shown in columns (5) and (6) of Table 5 .

Excluding the Hubei sample

This paper considers a subsample that excludes Hubei Province, which was the most affected by COVID-19. The early outbreak of COVID-19 in China was concentrated in Wuhan (Wuhan), Hubei Province, and spread rapidly to other provinces. Hubei has many listed firms. Thus, we draw on the existing literature (Ren et al., 2021 ) and exclude the Hubei sample to emphasize that our results are not driven by firms in the province most affected by COVID-19. We re-estimate our regression using the subsamples. Column (7) of Table 5 shows that the coefficient of ESG is positive and significant, which remains consistent with our findings.

Inclusion of other variables

Existing research has suggested that institutional investor ownership and cash holdings of firms may be associated with stock price volatility (Bushee and Noe, 2000 ; Chang et al., 2017 ). Whether or not our regression results are sensitive to include institutional investor ownership and cash holdings has aroused our concern. To mitigate this concern, we control for institutional investor ownership and cash holdings of firms in the regression process. In column (8) of Table 5 , empirical results remain positive and statistically significant for the cumulative abnormal returns. The inclusion of firms’ institutional investor ownership and cash holdings does not induce material change in coefficient magnitudes. That is, empirical evidence implies that the inclusion of institutional investor ownership and cash holdings should not be an issue in our research.

Further analysis

Potential mechanism.

This section extends the preceding results to clarify the potential mechanisms of why ESG performance positively affects firms’ cumulative abnormal returns.

Reputation effect

Demers et al. ( 2021 ) suggested that ESG performance is an intangible asset, which plays a positive role in optimizing supply chain partnerships, enhancing consumer product satisfaction, and improving employee productivity. Given the attention of social media and stakeholders, ESG practices are gradually becoming a tool for corporate impression management that can enhance social visibility and expand market share (Lokuwaduge and Heenetigala, 2017 ; Xie et al., 2019 ). Thus, we expect ESG performance to be conducive to increasing cumulative abnormal corporate returns through reputation enhancement.

In particular, we take the number of positive online media and financial newspaper reports to represent the reputation gained by the firm. Thereafter, we use the natural logarithm of the number of positive online media and financial newspaper reports as our variables New1 and New2, respectively. To test the reputation effect mechanism, we introduce an interaction term between ESG and New1 or New2 in our model. As shown in columns (1) and (2) of Table 6 , the coefficients of New1 and New2 are significantly positive, suggesting that ESG practices are beneficial in enhancing corporate reputation. We also find that the coefficients of ESG × New1 and ESG × New2 are significantly negative, implying that the reputation effect of ESG performance is more pronounced among firms with lower reputations. Our findings demonstrate that ESG practices are beneficial in enhancing corporate reputation to improve cumulative excess returns during turbulent times.

Insurance effect

Academics and practitioners agree that firms’ risk exposures are linked to their ESG profiles. Albuquerque et al. ( 2020 ) presented empirical evidence suggesting that firms’ increasing product differentiation through ESG investments reduces systemic risk and improves firm value. Hoepner et al. ( 2021 ) found corroborating evidence that ESG practices reduce firms’ exposure to downside risk factors. These results support practitioner arguments that including ESG factors in investment decisions can mitigate uncompensated portfolio risks (Jagannathan et al., 2017 ; Pandey and Kumari, 2021 ; Broadstock et al., 2021 ). Hence, we expect ESG performance to be considered a risk management tool during the COVID-19 pandemic to increase cumulative excess returns by reducing business risk.

This paper Utilizes the previous literature (Ghosh and Olsen, 2009 ) as basis in using the standard deviation of sales revenue over the past 5 years (Risk1) to measure business operating risk. To remove the effect of industry, we also use the standard deviation of sales revenue over the past 5 years adjusted for industry (Risk2) to measure business operating risk. We introduce an interaction term between ESG and EU1 or EU2 in the model to test the insurance effect mechanism. The results in columns (3) and (4) of Table 6 show that the coefficients of Risk1 and Risk2 are negative and significant. This result is consistent with findings of previous studies that ESG performance reduces downside risk. We also find that coefficients of ESG × Risk1 and ESG × Risk2 are significantly positive, suggesting that the insurance effect of ESG performance is markedly pronounced among firms with high operational risk. The results of this study indicate that ESG practices can be used as a risk management tool in turbulent times, thereby increasing cumulative abnormal returns. These findings further support the insurance-enhancing effects of ESG practices.

Cross-sectional analysis

In this subsection, we conduct several cross-sectional tests to examine how the impact of ESG performance on cumulative abnormal returns varies with firm characteristics during the COVID-19 pandemic.

Cross-sectional tests of human capital characteristics

We investigate whether or not the human capital characteristics of firms affect the contribution of ESG performance in terms of cumulative abnormal returns during the COVID-19 pandemic. Apart from the detrimental influence of the virus on staff safety, the lockdown and physical separation measures damage firms’ financial performance. Fahlenbrach et al. ( 2020 ) showed that labor-intensive firms, in which work-from-home policies are difficult to implement, have high exposure to COVID-19. By contrast, firms with high technological equipment are less affected and even have the opportunity to expand their business (Li et al., 2021 ). Thus, we predict that firms with low human capital are considerably at risk during the COVID-19 pandemic, thereby possibly increasing the sensitivity of ESG performance to cumulative abnormal returns.

This paper draws on prior literature (Li et al., 2021 ) and selects the ratio of the number of employees to sales as a measure of human capital intensity (Labor). The more value of Labor, the higher the productivity of employees and the higher the human capital of the firm. In addition, we divide the two groups of high and low human capital according to whether or not Labor is above the median value of the sample. Columns (1) and (2) of Table 7 show that the coefficient of ESG is significantly positive at the 1% level among firms with low human capital, but it is not statistically significant among firms with high human capital. Thus, the role of ESG performance in preventing stock price downside is markedly significant among firms with low human capital during the COVID-19 pandemic. This result is consistent with our expectations.

Cross-sectional test of public image characteristics

This subsection further investigates whether or not our main results are influenced by corporate image. The concepts of corporate image and trust have obtained special relevance, which significantly influence individual behavior (Rindell et al., 2015 ). Flavián et al. ( 2005 ) argued that the image perceived by consumers makes the factors existing in the transaction visible, thereby reducing the risk perceived by individuals and increasing the possibility of purchase. Compared with firms with bad image, firms with good image are expected to have stronger protection against downside risks (Lee et al., 2022 ). Consequently, we predict that firms with bad image have a high downside risk during the COVID-19 pandemic, thereby possibly increasing the sensitivity of ESG performance to cumulative abnormal returns.

This study chooses the number of negative reports in online media (Bad_image) measure corporate image. Columns (3) and (4) of Table 7 report that the coefficient of ESG is significantly positive at the 1% level among firms with numerous negative online media reports (firms with Bad_image above the sample median), but it is not statistically significant at a low number of negative online media reports. This finding is consistent with our expectation that firms with bad image face high downside risk during the COVID-19 pandemic, thereby increasing the sensitivity of ESG performance to cumulative abnormal returns.

Cross-sectional test of regional characteristics

Lastly, we explore the heterogeneous effect of ESG performance on cumulative abnormal returns for firms with different geographical locations. Shen et al. ( 2020 ) noted that by using the region as a criterion, COVID-19 has a major negative influence on the serious-impact regions. After China began its comprehensive campaign against COVID-19, seven provinces, namely, Hubei, Hunan, Henan, Jiangxi, Anhui, Guangdong, and Zhejiang, enforced harsh labor restrictions and the resumption of work. These restrictions have led to a decline in consumption levels and closure of many firms in high-impact areas. By contrast, for cities far from the infected areas, the resumption of operations will be significantly earlier. Fu and Shen ( 2021 ) showed that the early resumption of work sends a signal of reduced risk to stakeholders, thereby promoting firms to obtain more investment capital. In summary, we expect that the protection of ESG performance against downside risk during the COVID-19 pandemic is more significant in high-impact regions.

We draw on the previous literature (Shen et al., 2020 ) in selecting seven provinces (i.e., Hubei, Hunan, Henan, Jiangxi, Anhui, Guangdong, and Zhejiang) as high-impact regions and other provinces as low-impact regions. Columns (5) and (6) of Table 7 show that the coefficient of ESG is significantly positive at the 1% level for firms in high-impact regions, but it is not statistically significant for firms in low-impact regions. This result suggests that ESG practices of firms in high-impact regions play a key role in risk management and reduce stock price volatility during the COVID-19 pandemic.

During the COVID-19 pandemic, the significant decrease in global equity values reflects strong negative investor sentiments (Broadstock et al., 2021 ). Hence, we ask if this negative sentiment transfers asymmetrically across the firms, or whether or not ESG performance may be used as a valuable signal for systematically avoiding negative risk during the crisis. However, only a few studies have provided the specific role of ESG performance in this crisis period. Therefore, the goal of this paper is to fill in this research gap.

We use a unique environmental setting and find that ESG performance is positively associated with cumulative abnormal returns around the COVID-19 pandemic and has an asymmetric impact. We contribute to the literature with empirical evidence on the resilience of stocks with high ESG performance during financial crises. This finding is consistent with the view that investors may take ESG performance as a signal of risk mitigation during the crisis. We further find that the reputation and insurance effects are important mechanisms through which ESG performance influences stock price. Moreover, heterogeneous analyses show that ESG effects are considerably pronounced among firms with low human capital, bad image, and in high-impact regions. Overall, we conclude that ESG practices can be used as a risk management tool to enhance share price resilience, particularly in turbulent times.

Our findings are of particular relevance to business managers, investors, and policy makers. For managers, this paper provides empirical evidence supporting ESG investing as a value-enhancing strategy. In addition, we find evidence that ESG practices act as impression and risk management tools to reduce risk downside in turbulent times. Thus, firms should elevate their ESG performance to make them markedly attractive targets in the market to expand their market share. For investors, ESG investments improve the performance of managed portfolios, reduce portfolio risk, and increase returns. Investors should consider ESG factors in their investment decisions to enhance investment returns in turbulent times. Lastly, policy makers should advocate the adoption of ESG practices and encourage companies to disclose information on ESG performance, which are essential for economic sustainability.

Several limitations should be considered in future research. One of the limitations of this study is that the data set only includes large listed firms selected from the China Securities Market and Accounting Research Database. In our opinion, limited resources may play a key role in determining the ESG performance of small firms. Consequently, the inclusion of small- and medium-sized firms may provide different results, which will be left for future research. Furthermore, this study is mainly based on a sample of Chinese listed companies, thereby limiting the generalizability of this research. Lastly, future research could investigate whether or not these results are valid in the context of developed countries or international markets, in which business strategies and ESG disclosures of firms are influenced by the institutional environment.

Data availability

Data set used in this study is available from the corresponding author on the reasonable request. Further data is publically available on China Securities Markets and Accounting Research Databases.

Al-Awadhi AM, Alsaifi K, Al-Awadhi A, Alhammadi S (2020) Death and contagious infectious diseases: impact of the COVID-19 virus on stock market returns. J Behav Exp Finance 27:100326. https://doi.org/10.1016/j.jbef.2020.100326

Article   PubMed   PubMed Central   Google Scholar  

Albuquerque R, Koskinen Y, Yang S, Zhang C (2020) Love in the time of COVID-19: the resiliency of environmental and social stocks. CEPR Discussion Papers

Baldini M, Maso LD, Liberatore G, Mazzi F, Terzani S (2018) Role of country- and firm-level determinants in environmental, social, and governance disclosure. J Bus Eth 150:79–98. https://doi.org/10.1007/s10551-016-3139-1

Article   Google Scholar  

Broadstock DC, Chan K, Cheng LTW, Wang X (2021) The role of ESG performance during times of financial crisis: evidence from COVID-19 in China. Finance Res Lett 38:101716. https://doi.org/10.1016/j.frl.2020.101716

Bushee BJ, Noe CF (2000) Corporate disclosure practices, institutional investors, and stock return volatility. J Account Res 38:171–202. https://doi.org/10.2307/2672914

Campbell JY, Mackinlay AC, Lo AW (1998) The econometrics of financial markets. J Empir Finance 3(1):15–102

MATH   Google Scholar  

Chang Y, Benson K, Faff R (2017) Are excess cash holdings more valuable to firms in times of crisis? Financial constraints and governance matters. Pac-Basin Finance J 45:157–173

Demers E, Hendrikse J, Joos P, Lev B (2021) ESG did not immunize stocks during the COVID-19 crisis, but investments in intangible assets did. J Bus Finance Account 48:433–462. https://doi.org/10.1111/jbfa.12523

Díaz V, Ibrushi D, Zhao J (2021) Reconsidering systematic factors during the Covid-19 pandemic—the rising importance of ESG. Finance Res Lett 38:101870

Duuren EV, Plantinga A, Scholtens B (2016) ESG integration and the investment management process: fundamental investing reinvented. J Bus Eth 138:525–533

Erragraguy E, Revelli C (2015) Should Islamic investors consider SRI criteria in their investment strategies. Finance Res Lett 14:11–19. https://doi.org/10.1016/j.frl.2015.07.003

Fahlenbrach R, Rageth K, Stulz RM (2020) How valuable is financial flexibility when revenue stops? Evidence from the COVID-19 crisis. Working Paper Series

Fama E, Fisher L, Jensen M, Roll R (1969) The adjustment of stock prices to new information. Int Econ Rev 10:1–21. https://doi.org/10.2139/ssrn.321524

Fernando CS, May AD, Megginson WL (2012) The value of investment banking relationships: evidence from the collapse of lehman brothers. J Finance 67:235–270

Flavián C, Guinalíu M, Torres E (2005) The influence of corporate image on consumer trust: a comparative analysis in traditional versus internet banking. Internet Res 15:447–470. https://doi.org/10.1108/10662240510615191

Friedman M (2007) The social responsibility of business is to increase its profits. In: Zimmerli WCh, Holzinger M, Koordination CSR und Nachhaltigkeit (eds). Corporate ethics and corporate governance. Springer, pp. 173–178

Fu M, Shen H (2021) COVID-19 and corporate performance in the energy industry. Energy Res Lett 1:1–4. https://doi.org/10.46557/001c.12967

Galbreath J (2013) ESG in focus: the Australian evidence. J Bus Eth 118:529–541. https://doi.org/10.1007/s10551-012-1607-9

Ghosh D, Olsen L (2009) Environmental uncertainty and managers’ use of discretionary accruals. Account Organ Soc 34:188–205. https://doi.org/10.1016/j.aos.2008.07.001

Hartzmark SM, Sussman AB (2019) Do investors value sustainability? A natural experiment examining ranking and fund flows. J Finance 74:2789–2837. https://doi.org/10.1111/jofi.12841

Henke H-M (2016) The effect of social screening on bond mutual fund performance. J Bank Finance 67:69–84. https://doi.org/10.1016/j.jbankfin.2016.01.010

Hoepner AGF, Oikonomou I, Sautner Z, Starks LT, Zhou X (2021) ESG shareholder engagement and downside risk. Soc Sci Res Netw. Finance Working Paper No. 671/2020.

Jagannathan R, Ravikumar A, Sammon M (2017) Environmental, social, and governance criteria: why investors are paying attention. National Bureau of Economic Research

Kanas A (2005) Pure contagion effects in international banking: the case of BCCI’s failure. J Appl Econ 8:101–123. https://doi.org/10.1080/15140326.2005.12040620

Lee S, Lee D, Hong C, Park M-H (2022) Performance of socially responsible firms during the COVID-19 crisis and trading behavior by investor type: evidence from the Korean stock market. Finance Res Lett 45:102660. https://doi.org/10.1016/j.frl.2021.102660

Li K, Liu X, Mai F, Zhang T (2021) The role of corporate culture in bad times: evidence from the COVID-19 pandemic. J Financ Quant Anal 56:2545–2583

Lokuwaduge CSDS, Heenetigala K (2017) Integrating environmental, social and governance (ESG) disclosure for a sustainable development: an Australian study. Bus Strategy Environ 26:438–450. https://doi.org/10.1002/bse.1927

Manabe T, Nakagawa K (2022) The value of reputation capital during the COVID-19 crisis: evidence from Japan. Finance Res Lett 46:102370. https://doi.org/10.1016/j.frl.2021.102370

Mǎnescu C (2011) Stock returns in relation to environmental, social and governance performance: mispricing or compensation for risk? Sustain Dev19:95–118

McWilliams A, Siegel D (1997) Event studies in management research: theoretical and empirical issues. Acad Manag J 40:626–657. https://doi.org/10.2307/257056

Miyajima H, Yafeh Y (2007) Japan’s banking crisis: an event-study perspective. J Bank Finance 31:2866–2885. https://doi.org/10.1016/j.jbankfin.2007.03.006

Nejati M, Shah Bin A, Shahbudin, Bin Amran A (2010) Sustainable development: a competitive advantage or a threat? Bus Strategy Ser 11:84–89

Pandey DK, Kumari V (2021) Event study on the reaction of the developed and emerging stock markets to the 2019-nCoV outbreak. Int Rev Econ Finance 71:467–483. https://doi.org/10.1016/j.iref.2020.09.014

Pedersen LH, Fitzgibbons S, Pomorski L (2021) Responsible investing: the ESG-efficient frontier *. J Financ Econ 142:572–597. https://doi.org/10.1016/j.jfineco.2020.11.001

Ren Z, Zhang X, Zhang Z (2021) New evidence on COVID-19 and firm performance. Econ Anal Policy 72:213–225. https://doi.org/10.1016/j.eap.2021.08.002

Renneboog L, Ter Horst J, Zhang C (2008) Socially responsible investments: institutional aspects, performance, and investor behavior. J Bank Finance 32:1723–1742. https://doi.org/10.1016/j.jbankfin.2007.12.039

Rindell A, Santos FP, de Lima AP (2015) Two sides of a coin: connecting corporate brand heritage to consumers’ corporate image heritage. J Brand Manag 22:467–484. https://doi.org/10.1057/bm.2015.20

Sassen R, Hinze A-K, Hardeck I (2016) Impact of ESG factors on firm risk in Europe. J Bus Econ 86:867–904. https://doi.org/10.1007/s11573-016-0819-3

Shen H, Fu M, Pan H, Yu Z, Chen Y (2020) The impact of the COVID-19 pandemic on firm performance. Emerg Markets Finance Trade 56:2213–2230

Tang C-H, Chin C-Y, Lee Y-H (2021) Coronavirus disease outbreak and supply chain disruption: evidence from Taiwanese firms in China. Res Int Bus Finance 56:101355. https://doi.org/10.1016/j.ribaf.2020.101355

Article   PubMed   Google Scholar  

Wang L, Dai Y, Kong D (2021) Air pollution and employee treatment. J Corporate Finance 70:102067. https://doi.org/10.1016/j.jcorpfin.2021.102067

Xie J, Nozawa W, Yagi M, Fujii H, Managi S (2019) Do environmental, social, and governance activities improve corporate financial performance? Bus Strategy Environ 28:286–300. https://doi.org/10.1002/bse.2224

Yu EP, Luu BV (2021) International variations in ESG disclosure—do cross-listed companies care more. Int Rev Financ Anal 75:101731. https://doi.org/10.1016/j.irfa.2021.101731

Zhang P, Gao J, Li X (2021) Stock liquidity and firm value in the time of COVID-19 pandemic. Emerg Mark Finance Trade 57:1578–1591. https://doi.org/10.1080/1540496X.2021.1898368

Download references

Acknowledgements

This research work was funded by National Social Science Fund of China under the grant (19BGL057).

Author information

Authors and affiliations.

School of Economics & Management, South China Normal University, Guangzhou, Guangdong, China

Zengfu Li, Liuhua Feng & Hafiz M. Sohail

College of Management, Zhongkai University of Agriculture and Engineering, Guangzhou, Guangdong, China

You can also search for this author in PubMed   Google Scholar

Contributions

LHF conceptualized, conducted the econometric analysis, wrote the introduction and conducted the literature review. ZL conceptualized, conducted the econometric analysis and supervised the work. ZP was mainly responsible for writing structure guidance. HMS analyzed the findings and wrote the conclusions and recommendations and edited the entire draft.

Corresponding author

Correspondence to Zengfu Li .

Ethics declarations

Competing interests.

The author declares no competing interests.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Additional information.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Li, Z., Feng, L., Pan, Z. et al. ESG performance and stock prices: evidence from the COVID-19 outbreak in China. Humanit Soc Sci Commun 9 , 242 (2022). https://doi.org/10.1057/s41599-022-01259-5

Download citation

Received : 01 March 2022

Accepted : 04 July 2022

Published : 18 July 2022

DOI : https://doi.org/10.1057/s41599-022-01259-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

research paper on share prices

AIP Publishing Logo

Factors affecting share prices: A literature revisit

[email protected]

  • Article contents
  • Figures & tables
  • Supplementary Data
  • Peer Review
  • Reprints and Permissions
  • Cite Icon Cite
  • Search Site

Mohammad Almashaqbeh , Md Aminul Islam , Rosni Bakar; Factors affecting share prices: A literature revisit. AIP Conf. Proc. 3 May 2021; 2339 (1): 020143. https://doi.org/10.1063/5.0045110

Download citation file:

  • Ris (Zotero)
  • Reference Manager

The aim of the paper is to present key factors they are determinants of share prices in the stock markets globally. To achieve the aim of this study many of previous studies have been reviewed. This paper revisits previous studies by various researches on factors affecting the share price in both developed and developing economies. These researchers looked at the relationships between stock prices and selected internal and external factors affecting a firm’s share price. “Some of the factors affecting share prices can be generalized to both developed and developing economies. Some studies have concluded that company fundamentals such as institutional factors and the nature of the firm or business are major factors that affect stock prices”. “However, other studies indicated that firm’s boundaries, organization, governance, ownership pattern, auditing, and value relevance of accounting information is the most influencing factors of stock prices”. “In addition, the effect of inter related factors has been covered in some other studies”. As a revisit, this work derived from secondary sources. This revisit provides researchers and practitioners with the latest improvements of approaches by minimizing their effort in collecting the relevant material and a better understanding of each factor the affecting share prices, draw attention to specific areas for further research. Based on this result the study of ownership structures, audit quality and value relevance of accounting information and relations between them very important and they key factors to determinants of share prices.

Sign in via your Institution

Citing articles via, publish with us - request a quote.

research paper on share prices

Sign up for alerts

  • Online ISSN 1551-7616
  • Print ISSN 0094-243X
  • For Researchers
  • For Librarians
  • For Advertisers
  • Our Publishing Partners  
  • Physics Today
  • Conference Proceedings
  • Special Topics

pubs.aip.org

  • Privacy Policy
  • Terms of Use

Connect with AIP Publishing

This feature is available to subscribers only.

Sign In or Create an Account

  • Open access
  • Published: 28 August 2020

Short-term stock market price trend prediction using a comprehensive deep learning system

  • Jingyi Shen 1 &
  • M. Omair Shafiq   ORCID: orcid.org/0000-0002-1859-8296 1  

Journal of Big Data volume  7 , Article number:  66 ( 2020 ) Cite this article

268k Accesses

164 Citations

91 Altmetric

Metrics details

In the era of big data, deep learning for predicting stock market prices and trends has become even more popular than before. We collected 2 years of data from Chinese stock market and proposed a comprehensive customization of feature engineering and deep learning-based model for predicting price trend of stock markets. The proposed solution is comprehensive as it includes pre-processing of the stock market dataset, utilization of multiple feature engineering techniques, combined with a customized deep learning based system for stock market price trend prediction. We conducted comprehensive evaluations on frequently used machine learning models and conclude that our proposed solution outperforms due to the comprehensive feature engineering that we built. The system achieves overall high accuracy for stock market trend prediction. With the detailed design and evaluation of prediction term lengths, feature engineering, and data pre-processing methods, this work contributes to the stock analysis research community both in the financial and technical domains.

Introduction

Stock market is one of the major fields that investors are dedicated to, thus stock market price trend prediction is always a hot topic for researchers from both financial and technical domains. In this research, our objective is to build a state-of-art prediction model for price trend prediction, which focuses on short-term price trend prediction.

As concluded by Fama in [ 26 ], financial time series prediction is known to be a notoriously difficult task due to the generally accepted, semi-strong form of market efficiency and the high level of noise. Back in 2003, Wang et al. in [ 44 ] already applied artificial neural networks on stock market price prediction and focused on volume, as a specific feature of stock market. One of the key findings by them was that the volume was not found to be effective in improving the forecasting performance on the datasets they used, which was S&P 500 and DJI. Ince and Trafalis in [ 15 ] targeted short-term forecasting and applied support vector machine (SVM) model on the stock price prediction. Their main contribution is performing a comparison between multi-layer perceptron (MLP) and SVM then found that most of the scenarios SVM outperformed MLP, while the result was also affected by different trading strategies. In the meantime, researchers from financial domains were applying conventional statistical methods and signal processing techniques on analyzing stock market data.

The optimization techniques, such as principal component analysis (PCA) were also applied in short-term stock price prediction [ 22 ]. During the years, researchers are not only focused on stock price-related analysis but also tried to analyze stock market transactions such as volume burst risks, which expands the stock market analysis research domain broader and indicates this research domain still has high potential [ 39 ]. As the artificial intelligence techniques evolved in recent years, many proposed solutions attempted to combine machine learning and deep learning techniques based on previous approaches, and then proposed new metrics that serve as training features such as Liu and Wang [ 23 ]. This type of previous works belongs to the feature engineering domain and can be considered as the inspiration of feature extension ideas in our research. Liu et al. in [ 24 ] proposed a convolutional neural network (CNN) as well as a long short-term memory (LSTM) neural network based model to analyze different quantitative strategies in stock markets. The CNN serves for the stock selection strategy, automatically extracts features based on quantitative data, then follows an LSTM to preserve the time-series features for improving profits.

The latest work also proposes a similar hybrid neural network architecture, integrating a convolutional neural network with a bidirectional long short-term memory to predict the stock market index [ 4 ]. While the researchers frequently proposed different neural network solution architectures, it brought further discussions about the topic if the high cost of training such models is worth the result or not.

There are three key contributions of our work (1) a new dataset extracted and cleansed (2) a comprehensive feature engineering, and (3) a customized long short-term memory (LSTM) based deep learning model.

We have built the dataset by ourselves from the data source as an open-sourced data API called Tushare [ 43 ]. The novelty of our proposed solution is that we proposed a feature engineering along with a fine-tuned system instead of just an LSTM model only. We observe from the previous works and find the gaps and proposed a solution architecture with a comprehensive feature engineering procedure before training the prediction model. With the success of feature extension method collaborating with recursive feature elimination algorithms, it opens doors for many other machine learning algorithms to achieve high accuracy scores for short-term price trend prediction. It proved the effectiveness of our proposed feature extension as feature engineering. We further introduced our customized LSTM model and further improved the prediction scores in all the evaluation metrics. The proposed solution outperformed the machine learning and deep learning-based models in similar previous works.

The remainder of this paper is organized as follows. “ Survey of related works ” section describes the survey of related works. “ The dataset ” section provides details on the data that we extracted from the public data sources and the dataset prepared. “ Methods ” section presents the research problems, methods, and design of the proposed solution. Detailed technical design with algorithms and how the model implemented are also included in this section. “ Results ” section presents comprehensive results and evaluation of our proposed model, and by comparing it with the models used in most of the related works. “ Discussion ” section provides a discussion and comparison of the results. “ Conclusion ” section presents the conclusion. This research paper has been built based on Shen [ 36 ].

Survey of related works

In this section, we discuss related works. We reviewed the related work in two different domains: technical and financial, respectively.

Kim and Han in [ 19 ] built a model as a combination of artificial neural networks (ANN) and genetic algorithms (GAs) with discretization of features for predicting stock price index. The data used in their study include the technical indicators as well as the direction of change in the daily Korea stock price index (KOSPI). They used the data containing samples of 2928 trading days, ranging from January 1989 to December 1998, and give their selected features and formulas. They also applied optimization of feature discretization, as a technique that is similar to dimensionality reduction. The strengths of their work are that they introduced GA to optimize the ANN. First, the amount of input features and processing elements in the hidden layer are 12 and not adjustable. Another limitation is in the learning process of ANN, and the authors only focused on two factors in optimization. While they still believed that GA has great potential for feature discretization optimization. Our initialized feature pool refers to the selected features. Qiu and Song in [ 34 ] also presented a solution to predict the direction of the Japanese stock market based on an optimized artificial neural network model. In this work, authors utilize genetic algorithms together with artificial neural network based models, and name it as a hybrid GA-ANN model.

Piramuthu in [ 33 ] conducted a thorough evaluation of different feature selection methods for data mining applications. He used for datasets, which were credit approval data, loan defaults data, web traffic data, tam, and kiang data, and compared how different feature selection methods optimized decision tree performance. The feature selection methods he compared included probabilistic distance measure: the Bhattacharyya measure, the Matusita measure, the divergence measure, the Mahalanobis distance measure, and the Patrick-Fisher measure. For inter-class distance measures: the Minkowski distance measure, city block distance measure, Euclidean distance measure, the Chebychev distance measure, and the nonlinear (Parzen and hyper-spherical kernel) distance measure. The strength of this paper is that the author evaluated both probabilistic distance-based and several inter-class feature selection methods. Besides, the author performed the evaluation based on different datasets, which reinforced the strength of this paper. However, the evaluation algorithm was a decision tree only. We cannot conclude if the feature selection methods will still perform the same on a larger dataset or a more complex model.

Hassan and Nath in [ 9 ] applied the Hidden Markov Model (HMM) on the stock market forecasting on stock prices of four different Airlines. They reduce states of the model into four states: the opening price, closing price, the highest price, and the lowest price. The strong point of this paper is that the approach does not need expert knowledge to build a prediction model. While this work is limited within the industry of Airlines and evaluated on a very small dataset, it may not lead to a prediction model with generality. One of the approaches in stock market prediction related works could be exploited to do the comparison work. The authors selected a maximum 2 years as the date range of training and testing dataset, which provided us a date range reference for our evaluation part.

Lei in [ 21 ] exploited Wavelet Neural Network (WNN) to predict stock price trends. The author also applied Rough Set (RS) for attribute reduction as an optimization. Rough Set was utilized to reduce the stock price trend feature dimensions. It was also used to determine the structure of the Wavelet Neural Network. The dataset of this work consists of five well-known stock market indices, i.e., (1) SSE Composite Index (China), (2) CSI 300 Index (China), (3) All Ordinaries Index (Australian), (4) Nikkei 225 Index (Japan), and (5) Dow Jones Index (USA). Evaluation of the model was based on different stock market indices, and the result was convincing with generality. By using Rough Set for optimizing the feature dimension before processing reduces the computational complexity. However, the author only stressed the parameter adjustment in the discussion part but did not specify the weakness of the model itself. Meanwhile, we also found that the evaluations were performed on indices, the same model may not have the same performance if applied on a specific stock.

Lee in [ 20 ] used the support vector machine (SVM) along with a hybrid feature selection method to carry out prediction of stock trends. The dataset in this research is a sub dataset of NASDAQ Index in Taiwan Economic Journal Database (TEJD) in 2008. The feature selection part was using a hybrid method, supported sequential forward search (SSFS) played the role of the wrapper. Another advantage of this work is that they designed a detailed procedure of parameter adjustment with performance under different parameter values. The clear structure of the feature selection model is also heuristic to the primary stage of model structuring. One of the limitations was that the performance of SVM was compared to back-propagation neural network (BPNN) only and did not compare to the other machine learning algorithms.

Sirignano and Cont leveraged a deep learning solution trained on a universal feature set of financial markets in [ 40 ]. The dataset used included buy and sell records of all transactions, and cancellations of orders for approximately 1000 NASDAQ stocks through the order book of the stock exchange. The NN consists of three layers with LSTM units and a feed-forward layer with rectified linear units (ReLUs) at last, with stochastic gradient descent (SGD) algorithm as an optimization. Their universal model was able to generalize and cover the stocks other than the ones in the training data. Though they mentioned the advantages of a universal model, the training cost was still expensive. Meanwhile, due to the inexplicit programming of the deep learning algorithm, it is unclear that if there are useless features contaminated when feeding the data into the model. Authors found out that it would have been better if they performed feature selection part before training the model and found it as an effective way to reduce the computational complexity.

Ni et al. in [ 30 ] predicted stock price trends by exploiting SVM and performed fractal feature selection for optimization. The dataset they used is the Shanghai Stock Exchange Composite Index (SSECI), with 19 technical indicators as features. Before processing the data, they optimized the input data by performing feature selection. When finding the best parameter combination, they also used a grid search method, which is k cross-validation. Besides, the evaluation of different feature selection methods is also comprehensive. As the authors mentioned in their conclusion part, they only considered the technical indicators but not macro and micro factors in the financial domain. The source of datasets that the authors used was similar to our dataset, which makes their evaluation results useful to our research. They also mentioned a method called k cross-validation when testing hyper-parameter combinations.

McNally et al. in [ 27 ] leveraged RNN and LSTM on predicting the price of Bitcoin, optimized by using the Boruta algorithm for feature engineering part, and it works similarly to the random forest classifier. Besides feature selection, they also used Bayesian optimization to select LSTM parameters. The Bitcoin dataset ranged from the 19th of August 2013 to 19th of July 2016. Used multiple optimization methods to improve the performance of deep learning methods. The primary problem of their work is overfitting. The research problem of predicting Bitcoin price trend has some similarities with stock market price prediction. Hidden features and noises embedded in the price data are threats of this work. The authors treated the research question as a time sequence problem. The best part of this paper is the feature engineering and optimization part; we could replicate the methods they exploited in our data pre-processing.

Weng et al. in [ 45 ] focused on short-term stock price prediction by using ensemble methods of four well-known machine learning models. The dataset for this research is five sets of data. They obtained these datasets from three open-sourced APIs and an R package named TTR. The machine learning models they used are (1) neural network regression ensemble (NNRE), (2) a Random Forest with unpruned regression trees as base learners (RFR), (3) AdaBoost with unpruned regression trees as base learners (BRT) and (4) a support vector regression ensemble (SVRE). A thorough study of ensemble methods specified for short-term stock price prediction. With background knowledge, the authors selected eight technical indicators in this study then performed a thoughtful evaluation of five datasets. The primary contribution of this paper is that they developed a platform for investors using R, which does not need users to input their own data but call API to fetch the data from online source straightforward. From the research perspective, they only evaluated the prediction of the price for 1 up to 10 days ahead but did not evaluate longer terms than two trading weeks or a shorter term than 1 day. The primary limitation of their research was that they only analyzed 20 U.S.-based stocks, the model might not be generalized to other stock market or need further revalidation to see if it suffered from overfitting problems.

Kara et al. in [ 17 ] also exploited ANN and SVM in predicting the movement of stock price index. The data set they used covers a time period from January 2, 1997, to December 31, 2007, of the Istanbul Stock Exchange. The primary strength of this work is its detailed record of parameter adjustment procedures. While the weaknesses of this work are that neither the technical indicator nor the model structure has novelty, and the authors did not explain how their model performed better than other models in previous works. Thus, more validation works on other datasets would help. They explained how ANN and SVM work with stock market features, also recorded the parameter adjustment. The implementation part of our research could benefit from this previous work.

Jeon et al. in [ 16 ] performed research on millisecond interval-based big dataset by using pattern graph tracking to complete stock price prediction tasks. The dataset they used is a millisecond interval-based big dataset of historical stock data from KOSCOM, from August 2014 to October 2014, 10G–15G capacity. The author applied Euclidean distance, Dynamic Time Warping (DTW) for pattern recognition. For feature selection, they used stepwise regression. The authors completed the prediction task by ANN and Hadoop and RHive for big data processing. The “ Results ” section is based on the result processed by a combination of SAX and Jaro–Winkler distance. Before processing the data, they generated aggregated data at 5-min intervals from discrete data. The primary strength of this work is the explicit structure of the whole implementation procedure. While they exploited a relatively old model, another weakness is the overall time span of the training dataset is extremely short. It is difficult to access the millisecond interval-based data in real life, so the model is not as practical as a daily based data model.

Huang et al. in [ 12 ] applied a fuzzy-GA model to complete the stock selection task. They used the key stocks of the 200 largest market capitalization listed as the investment universe in the Taiwan Stock Exchange. Besides, the yearly financial statement data and the stock returns were taken from the Taiwan Economic Journal (TEJ) database at www.tej.com.tw/ for the time period from year 1995 to year 2009. They conducted the fuzzy membership function with model parameters optimized with GA and extracted features for optimizing stock scoring. The authors proposed an optimized model for selection and scoring of stocks. Different from the prediction model, the authors more focused on stock rankings, selection, and performance evaluation. Their structure is more practical among investors. But in the model validation part, they did not compare the model with existed algorithms but the statistics of the benchmark, which made it challenging to identify if GA would outperform other algorithms.

Fischer and Krauss in [ 5 ] applied long short-term memory (LSTM) on financial market prediction. The dataset they used is S&P 500 index constituents from Thomson Reuters. They obtained all month-end constituent lists for the S&P 500 from Dec 1989 to Sep 2015, then consolidated the lists into a binary matrix to eliminate survivor bias. The authors also used RMSprop as an optimizer, which is a mini-batch version of rprop. The primary strength of this work is that the authors used the latest deep learning technique to perform predictions. They relied on the LSTM technique, lack of background knowledge in the financial domain. Although the LSTM outperformed the standard DNN and logistic regression algorithms, while the author did not mention the effort to train an LSTM with long-time dependencies.

Tsai and Hsiao in [ 42 ] proposed a solution as a combination of different feature selection methods for prediction of stocks. They used Taiwan Economic Journal (TEJ) database as data source. The data used in their analysis was from year 2000 to 2007. In their work, they used a sliding window method and combined it with multi layer perceptron (MLP) based artificial neural networks with back propagation, as their prediction model. In their work, they also applied principal component analysis (PCA) for dimensionality reduction, genetic algorithms (GA) and the classification and regression trees (CART) to select important features. They did not just rely on technical indices only. Instead, they also included both fundamental and macroeconomic indices in their analysis. The authors also reported a comparison on feature selection methods. The validation part was done by combining the model performance stats with statistical analysis.

Pimenta et al. in [ 32 ] leveraged an automated investing method by using multi-objective genetic programming and applied it in the stock market. The dataset was obtained from Brazilian stock exchange market (BOVESPA), and the primary techniques they exploited were a combination of multi-objective optimization, genetic programming, and technical trading rules. For optimization, they leveraged genetic programming (GP) to optimize decision rules. The novelty of this paper was in the evaluation part. They included a historical period, which was a critical moment of Brazilian politics and economics when performing validation. This approach reinforced the generalization strength of their proposed model. When selecting the sub-dataset for evaluation, they also set criteria to ensure more asset liquidity. While the baseline of the comparison was too basic and fundamental, and the authors did not perform any comparison with other existing models.

Huang and Tsai in [ 13 ] conducted a filter-based feature selection assembled with a hybrid self-organizing feature map (SOFM) support vector regression (SVR) model to forecast Taiwan index futures (FITX) trend. They divided the training samples into clusters to marginally improve the training efficiency. The authors proposed a comprehensive model, which was a combination of two novel machine learning techniques in stock market analysis. Besides, the optimizer of feature selection was also applied before the data processing to improve the prediction accuracy and reduce the computational complexity of processing daily stock index data. Though they optimized the feature selection part and split the sample data into small clusters, it was already strenuous to train daily stock index data of this model. It would be difficult for this model to predict trading activities in shorter time intervals since the data volume would be increased drastically. Moreover, the evaluation is not strong enough since they set a single SVR model as a baseline, but did not compare the performance with other previous works, which caused difficulty for future researchers to identify the advantages of SOFM-SVR model why it outperforms other algorithms.

Thakur and Kumar in [ 41 ] also developed a hybrid financial trading support system by exploiting multi-category classifiers and random forest (RAF). They conducted their research on stock indices from NASDAQ, DOW JONES, S&P 500, NIFTY 50, and NIFTY BANK. The authors proposed a hybrid model combined random forest (RF) algorithms with a weighted multicategory generalized eigenvalue support vector machine (WMGEPSVM) to generate “Buy/Hold/Sell” signals. Before processing the data, they used Random Forest (RF) for feature pruning. The authors proposed a practical model designed for real-life investment activities, which could generate three basic signals for investors to refer to. They also performed a thorough comparison of related algorithms. While they did not mention the time and computational complexity of their works. Meanwhile, the unignorable issue of their work was the lack of financial domain knowledge background. The investors regard the indices data as one of the attributes but could not take the signal from indices to operate a specific stock straightforward.

Hsu in [ 11 ] assembled feature selection with a back propagation neural network (BNN) combined with genetic programming to predict the stock/futures price. The dataset in this research was obtained from Taiwan Stock Exchange Corporation (TWSE). The authors have introduced the description of the background knowledge in detail. While the weakness of their work is that it is a lack of data set description. This is a combination of the model proposed by other previous works. Though we did not see the novelty of this work, we can still conclude that the genetic programming (GP) algorithm is admitted in stock market research domain. To reinforce the validation strengths, it would be good to consider adding GP models into evaluation if the model is predicting a specific price.

Hafezi et al. in [ 7 ] built a bat-neural network multi-agent system (BN-NMAS) to predict stock price. The dataset was obtained from the Deutsche bundes-bank. They also applied the Bat algorithm (BA) for optimizing neural network weights. The authors illustrated their overall structure and logic of system design in clear flowcharts. While there were very few previous works that had performed on DAX data, it would be difficult to recognize if the model they proposed still has the generality if migrated on other datasets. The system design and feature selection logic are fascinating, which worth referring to. Their findings in optimization algorithms are also valuable for the research in the stock market price prediction research domain. It is worth trying the Bat algorithm (BA) when constructing neural network models.

Long et al. in [ 25 ] conducted a deep learning approach to predict the stock price movement. The dataset they used is the Chinese stock market index CSI 300. For predicting the stock price movement, they constructed a multi-filter neural network (MFNN) with stochastic gradient descent (SGD) and back propagation optimizer for learning NN parameters. The strength of this paper is that the authors exploited a novel model with a hybrid model constructed by different kinds of neural networks, it provides an inspiration for constructing hybrid neural network structures.

Atsalakis and Valavanis in [ 1 ] proposed a solution of a neuro-fuzzy system, which is composed of controller named as Adaptive Neuro Fuzzy Inference System (ANFIS), to achieve short-term stock price trend prediction. The noticeable strength of this work is the evaluation part. Not only did they compare their proposed system with the popular data models, but also compared with investment strategies. While the weakness that we found from their proposed solution is that their solution architecture is lack of optimization part, which might limit their model performance. Since our proposed solution is also focusing on short-term stock price trend prediction, this work is heuristic for our system design. Meanwhile, by comparing with the popular trading strategies from investors, their work inspired us to compare the strategies used by investors with techniques used by researchers.

Nekoeiqachkanloo et al. in [ 29 ] proposed a system with two different approaches for stock investment. The strengths of their proposed solution are obvious. First, it is a comprehensive system that consists of data pre-processing and two different algorithms to suggest the best investment portions. Second, the system also embedded with a forecasting component, which also retains the features of the time series. Last but not least, their input features are a mix of fundamental features and technical indices that aim to fill in the gap between the financial domain and technical domain. However, their work has a weakness in the evaluation part. Instead of evaluating the proposed system on a large dataset, they chose 25 well-known stocks. There is a high possibility that the well-known stocks might potentially share some common hidden features.

As another related latest work, Idrees et al. [ 14 ] published a time series-based prediction approach for the volatility of the stock market. ARIMA is not a new approach in the time series prediction research domain. Their work is more focusing on the feature engineering side. Before feeding the features into ARIMA models, they designed three steps for feature engineering: Analyze the time series, identify if the time series is stationary or not, perform estimation by plot ACF and PACF charts and look for parameters. The only weakness of their proposed solution is that the authors did not perform any customization on the existing ARIMA model, which might limit the system performance to be improved.

One of the main weaknesses found in the related works is limited data-preprocessing mechanisms built and used. Technical works mostly tend to focus on building prediction models. When they select the features, they list all the features mentioned in previous works and go through the feature selection algorithm then select the best-voted features. Related works in the investment domain have shown more interest in behavior analysis, such as how herding behaviors affect the stock performance, or how the percentage of inside directors hold the firm’s common stock affects the performance of a certain stock. These behaviors often need a pre-processing procedure of standard technical indices and investment experience to recognize.

In the related works, often a thorough statistical analysis is performed based on a special dataset and conclude new features rather than performing feature selections. Some data, such as the percentage of a certain index fluctuation has been proven to be effective on stock performance. We believe that by extracting new features from data, then combining such features with existed common technical indices will significantly benefit the existing and well-tested prediction models.

The dataset

This section details the data that was extracted from the public data sources, and the final dataset that was prepared. Stock market-related data are diverse, so we first compared the related works from the survey of financial research works in stock market data analysis to specify the data collection directions. After collecting the data, we defined a data structure of the dataset. Given below, we describe the dataset in detail, including the data structure, and data tables in each category of data with the segment definitions.

Description of our dataset

In this section, we will describe the dataset in detail. This dataset consists of 3558 stocks from the Chinese stock market. Besides the daily price data, daily fundamental data of each stock ID, we also collected the suspending and resuming history, top 10 shareholders, etc. We list two reasons that we choose 2 years as the time span of this dataset: (1) most of the investors perform stock market price trend analysis using the data within the latest 2 years, (2) using more recent data would benefit the analysis result. We collected data through the open-sourced API, namely Tushare [ 43 ], mean-while we also leveraged a web-scraping technique to collect data from Sina Finance web pages, SWS Research website.

Data structure

Figure  1 illustrates all the data tables in the dataset. We collected four categories of data in this dataset: (1) basic data, (2) trading data, (3) finance data, and (4) other reference data. All the data tables can be linked to each other by a common field called “Stock ID” It is a unique stock identifier registered in the Chinese Stock market. Table  1 shows an overview of the dataset.

figure 1

Data structure for the extracted dataset

The Table  1 lists the field information of each data table as well as which category the data table belongs to.

In this section, we present the proposed methods and the design of the proposed solution. Moreover, we also introduce the architecture design as well as algorithmic and implementation details.

Problem statement

We analyzed the best possible approach for predicting short-term price trends from different aspects: feature engineering, financial domain knowledge, and prediction algorithm. Then we addressed three research questions in each aspect, respectively: How can feature engineering benefit model prediction accuracy? How do findings from the financial domain benefit prediction model design? And what is the best algorithm for predicting short-term price trends?

The first research question is about feature engineering. We would like to know how the feature selection method benefits the performance of prediction models. From the abundance of the previous works, we can conclude that stock price data embedded with a high level of noise, and there are also correlations between features, which makes the price prediction notoriously difficult. That is also the primary reason for most of the previous works introduced the feature engineering part as an optimization module.

The second research question is evaluating the effectiveness of findings we extracted from the financial domain. Different from the previous works, besides the common evaluation of data models such as the training costs and scores, our evaluation will emphasize the effectiveness of newly added features that we extracted from the financial domain. We introduce some features from the financial domain. While we only obtained some specific findings from previous works, and the related raw data needs to be processed into usable features. After extracting related features from the financial domain, we combine the features with other common technical indices for voting out the features with a higher impact. There are numerous features said to be effective from the financial domain, and it would be impossible for us to cover all of them. Thus, how to appropriately convert the findings from the financial domain to a data processing module of our system design is a hidden research question that we attempt to answer.

The third research question is that which algorithms are we going to model our data? From the previous works, researchers have been putting efforts into the exact price prediction. We decompose the problem into predicting the trend and then the exact number. This paper focuses on the first step. Hence, the objective has been converted to resolve a binary classification problem, meanwhile, finding an effective way to eliminate the negative effect brought by the high level of noise. Our approach is to decompose the complex problem into sub-problems which have fewer dependencies and resolve them one by one, and then compile the resolutions into an ensemble model as an aiding system for investing behavior reference.

In the previous works, researchers have been using a variety of models for predicting stock price trends. While most of the best-performed models are based on machine learning techniques, in this work, we will compare our approach with the outperformed machine learning models in the evaluation part and find the solution for this research question.

Proposed solution

The high-level architecture of our proposed solution could be separated into three parts. First is the feature selection part, to guarantee the selected features are highly effective. Second, we look into the data and perform the dimensionality reduction. And the last part, which is the main contribution of our work is to build a prediction model of target stocks. Figure  2 depicts a high-level architecture of the proposed solution.

figure 2

High-level architecture of the proposed solution

There are ways to classify different categories of stocks. Some investors prefer long-term investments, while others show more interest in short-term investments. It is common to see the stock-related reports showing an average performance, while the stock price is increasing drastically; this is one of the phenomena that indicate the stock price prediction has no fixed rules, thus finding effective features before training a model on data is necessary.

In this research, we focus on the short-term price trend prediction. Currently, we only have the raw data with no labels. So, the very first step is to label the data. We mark the price trend by comparing the current closing price with the closing price of n trading days ago, the range of n is from 1 to 10 since our research is focusing on the short-term. If the price trend goes up, we mark it as 1 or mark as 0 in the opposite case. To be more specified, we use the indices from the indices of n  −  1 th day to predict the price trend of the n th day.

According to the previous works, some researchers who applied both financial domain knowledge and technical methods on stock data were using rules to filter the high-quality stocks. We referred to their works and exploited their rules to contribute to our feature extension design.

However, to ensure the best performance of the prediction model, we will look into the data first. There are a large number of features in the raw data; if we involve all the features into our consideration, it will not only drastically increase the computational complexity but will also cause side effects if we would like to perform unsupervised learning in further research. So, we leverage the recursive feature elimination (RFE) to ensure all the selected features are effective.

We found most of the previous works in the technical domain were analyzing all the stocks, while in the financial domain, researchers prefer to analyze the specific scenario of investment, to fill the gap between the two domains, we decide to apply a feature extension based on the findings we gathered from the financial domain before we start the RFE procedure.

Since we plan to model the data into time series, the number of the features, the more complex the training procedure will be. So, we will leverage the dimensionality reduction by using randomized PCA at the beginning of our proposed solution architecture.

Detailed technical design elaboration

This section provides an elaboration of the detailed technical design as being a comprehensive solution based on utilizing, combining, and customizing several existing data preprocessing, feature engineering, and deep learning techniques. Figure  3 provides the detailed technical design from data processing to prediction, including the data exploration. We split the content by main procedures, and each procedure contains algorithmic steps. Algorithmic details are elaborated in the next section. The contents of this section will focus on illustrating the data workflow.

figure 3

Detailed technical design of the proposed solution

Based on the literature review, we select the most commonly used technical indices and then feed them into the feature extension procedure to get the expanded feature set. We will select the most effective i features from the expanded feature set. Then we will feed the data with i selected features into the PCA algorithm to reduce the dimension into j features. After we get the best combination of i and j , we process the data into finalized the feature set and feed them into the LSTM [ 10 ] model to get the price trend prediction result.

The novelty of our proposed solution is that we will not only apply the technical method on raw data but also carry out the feature extensions that are used among stock market investors. Details on feature extension are given in the next subsection. Experiences gained from applying and optimizing deep learning based solutions in [ 37 , 38 ] were taken into account while designing and customizing feature engineering and deep learning solution in this work.

Applying feature extension

The first main procedure in Fig.  3 is the feature extension. In this block, the input data is the most commonly used technical indices concluded from related works. The three feature extension methods are max–min scaling, polarizing, and calculating fluctuation percentage. Not all the technical indices are applicable for all three of the feature extension methods; this procedure only applies the meaningful extension methods on technical indices. We choose meaningful extension methods while looking at how the indices are calculated. The technical indices and the corresponding feature extension methods are illustrated in Table  2 .

After the feature extension procedure, the expanded features will be combined with the most commonly used technical indices, i.e., input data with output data, and feed into RFE block as input data in the next step.

Applying recursive feature elimination

After the feature extension above, we explore the most effective i features by using the Recursive Feature Elimination (RFE) algorithm [ 6 ]. We estimate all the features by two attributes, coefficient, and feature importance. We also limit the features that remove from the pool by one, which means we will remove one feature at each step and retain all the relevant features. Then the output of the RFE block will be the input of the next step, which refers to PCA.

Applying principal component analysis (PCA)

The very first step before leveraging PCA is feature pre-processing. Because some of the features after RFE are percentage data, while others are very large numbers, i.e., the output from RFE are in different units. It will affect the principal component extraction result. Thus, before feeding the data into the PCA algorithm [ 8 ], a feature pre-processing is necessary. We also illustrate the effectiveness and methods comparison in “ Results ” section.

After performing feature pre-processing, the next step is to feed the processed data with selected i features into the PCA algorithm to reduce the feature matrix scale into j features. This step is to retain as many effective features as possible and meanwhile eliminate the computational complexity of training the model. This research work also evaluates the best combination of i and j, which has relatively better prediction accuracy, meanwhile, cuts the computational consumption. The result can be found in the “ Results ” section, as well. After the PCA step, the system will get a reshaped matrix with j columns.

Fitting long short-term memory (LSTM) model

PCA reduced the dimensions of the input data, while the data pre-processing is mandatory before feeding the data into the LSTM layer. The reason for adding the data pre-processing step before the LSTM model is that the input matrix formed by principal components has no time steps. While one of the most important parameters of training an LSTM is the number of time steps. Hence, we have to model the matrix into corresponding time steps for both training and testing dataset.

After performing the data pre-processing part, the last step is to feed the training data into LSTM and evaluate the performance using testing data. As a variant neural network of RNN, even with one LSTM layer, the NN structure is still a deep neural network since it can process sequential data and memorizes its hidden states through time. An LSTM layer is composed of one or more LSTM units, and an LSTM unit consists of cells and gates to perform classification and prediction based on time series data.

The LSTM structure is formed by two layers. The input dimension is determined by j after the PCA algorithm. The first layer is the input LSTM layer, and the second layer is the output layer. The final output will be 0 or 1 indicates if the stock price trend prediction result is going down or going up, as a supporting suggestion for the investors to perform the next investment decision.

Design discussion

Feature extension is one of the novelties of our proposed price trend predicting system. In the feature extension procedure, we use technical indices to collaborate with the heuristic processing methods learned from investors, which fills the gap between the financial research area and technical research area.

Since we proposed a system of price trend prediction, feature engineering is extremely important to the final prediction result. Not only the feature extension method is helpful to guarantee we do not miss the potentially correlated feature, but also feature selection method is necessary for pooling the effective features. The more irrelevant features are fed into the model, the more noise would be introduced. Each main procedure is carefully considered contributing to the whole system design.

Besides the feature engineering part, we also leverage LSTM, the state-of-the-art deep learning method for time-series prediction, which guarantees the prediction model can capture both complex hidden pattern and the time-series related pattern.

It is known that the training cost of deep learning models is expansive in both time and hardware aspects; another advantage of our system design is the optimization procedure—PCA. It can retain the principal components of the features while reducing the scale of the feature matrix, thus help the system to save the training cost of processing the large time-series feature matrix.

Algorithm elaboration

This section provides comprehensive details on the algorithms we built while utilizing and customizing different existing techniques. Details about the terminologies, parameters, as well as optimizers. From the legend on the right side of Fig.  3 , we note the algorithm steps as octagons, all of them can be found in this “ Algorithm elaboration ” section.

Before dive deep into the algorithm steps, here is the brief introduction of data pre-processing: since we will go through the supervised learning algorithms, we also need to program the ground truth. The ground truth of this research is programmed by comparing the closing price of the current trading date with the closing price of the previous trading date the users want to compare with. Label the price increase as 1, else the ground truth will be labeled as 0. Because this research work is not only focused on predicting the price trend of a specific period of time but short-term in general, the ground truth processing is according to a range of trading days. While the algorithms will not change with the prediction term length, we can regard the term length as a parameter.

The algorithmic detail is elaborated, respectively, the first algorithm is the hybrid feature engineering part for preparing high-quality training and testing data. It corresponds to the Feature extension, RFE, and PCA blocks in Fig.  3 . The second algorithm is the LSTM procedure block, including time-series data pre-processing, NN constructing, training, and testing.

Algorithm 1: Short-term stock market price trend prediction—applying feature engineering using FE + RFE + PCA

The function FE is corresponding to the feature extension block. For the feature extension procedure, we apply three different processing methods to translate the findings from the financial domain to a technical module in our system design. While not all the indices are applicable for expanding, we only choose the proper method(s) for certain features to perform the feature extension (FE), according to Table  2 .

Normalize method preserves the relative frequencies of the terms, and transform the technical indices into the range of [0, 1]. Polarize is a well-known method often used by real-world investors, sometimes they prefer to consider if the technical index value is above or below zero, we program some of the features using polarize method and prepare for RFE. Max-min (or min-max) [ 35 ] scaling is a transformation method often used as an alternative to zero mean and unit variance scaling. Another well-known method used is fluctuation percentage, and we transform the technical indices fluctuation percentage into the range of [− 1, 1].

The function RFE () in the first algorithm refers to recursive feature elimination. Before we perform the training data scale reduction, we will have to make sure that the features we selected are effective. Ineffective features will not only drag down the classification precision but also add more computational complexity. For the feature selection part, we choose recursive feature elimination (RFE). As [ 45 ] explained, the process of recursive feature elimination can be split into the ranking algorithm, resampling, and external validation.

For the ranking algorithm, it fits the model to the features and ranks by the importance to the model. We set the parameter to retain i numbers of features, and at each iteration of feature selection retains Si top-ranked features, then refit the model and assess the performance again to begin another iteration. The ranking algorithm will eventually determine the top Si features.

The RFE algorithm is known to have suffered from the over-fitting problem. To eliminate the over-fitting issue, we will run the RFE algorithm multiple times on randomly selected stocks as the training set and ensure all the features we select are high-weighted. This procedure is called data resampling. Resampling can be built as an optimization step as an outer layer of the RFE algorithm.

The last part of our hybrid feature engineering algorithm is for optimization purposes. For the training data matrix scale reduction, we apply Randomized principal component analysis (PCA) [ 31 ], before we decide the features of the classification model.

Financial ratios of a listed company are used to present the growth ability, earning ability, solvency ability, etc. Each financial ratio consists of a set of technical indices, each time we add a technical index (or feature) will add another column of data into the data matrix and will result in low training efficiency and redundancy. If non-relevant or less relevant features are included in training data, it will also decrease the precision of classification.

figure a

The above equation represents the explanation power of principal components extracted by PCA method for original data. If an ACR is below 85%, the PCA method would be unsuitable due to a loss of original information. Because the covariance matrix is sensitive to the order of magnitudes of data, there should be a data standardize procedure before performing the PCA. The commonly used standardized methods are mean-standardization and normal-standardization and are noted as given below:

Mean-standardization: \(X_{ij}^{*} = X_{ij} /\overline{{X_{j} }}\) , which \(\overline{{X_{j} }}\) represents the mean value.

Normal-standardization: \(X_{ij}^{*} = (X_{ij} - \overline{{X_{j} }} )/s_{j}\) , which \(\overline{{X_{j} }}\) represents the mean value, and \(s_{j}\) is the standard deviation.

The array fe_array is defined according to Table  2 , row number maps to the features, columns 0, 1, 2, 3 note for the extension methods of normalize, polarize, max–min scale, and fluctuation percentage, respectively. Then we fill in the values for the array by the rule where 0 stands for no necessity to expand and 1 for features need to apply the corresponding extension methods. The final algorithm of data preprocessing using RFE and PCA can be illustrated as Algorithm 1.

Algorithm 2: Price trend prediction model using LSTM

After the principal component extraction, we will get the scale-reduced matrix, which means i most effective features are converted into j principal components for training the prediction model. We utilized an LSTM model and added a conversion procedure for our stock price dataset. The detailed algorithm design is illustrated in Alg 2. The function TimeSeriesConversion () converts the principal components matrix into time series by shifting the input data frame according to the number of time steps [ 3 ], i.e., term length in this research. The processed dataset consists of the input sequence and forecast sequence. In this research, the parameter of LAG is 1, because the model is detecting the pattern of features fluctuation on a daily basis. Meanwhile, the N_TIME_STEPS is varied from 1 trading day to 10 trading days. The functions DataPartition (), FitModel (), EvaluateModel () are regular steps without customization. The NN structure design, optimizer decision, and other parameters are illustrated in function ModelCompile () .

Some procedures impact the efficiency but do not affect the accuracy or precision and vice versa, while other procedures may affect both efficiency and prediction result. To fully evaluate our algorithm design, we structure the evaluation part by main procedures and evaluate how each procedure affects the algorithm performance. First, we evaluated our solution on a machine with 2.2 GHz i7 processor, with 16 GB of RAM. Furthermore, we also evaluated our solution on Amazon EC2 instance, 3.1 GHz Processor with 16 vCPUs, and 64 GB RAM.

In the implementation part, we expanded 20 features into 54 features, while we retain 30 features that are the most effective. In this section, we discuss the evaluation of feature selection. The dataset was divided into two different subsets, i.e., training and testing datasets. Test procedure included two parts, one testing dataset is for feature selection, and another one is for model testing. We note the feature selection dataset and model testing dataset as DS_test_f and DS_test_m, respectively.

We randomly selected two-thirds of the stock data by stock ID for RFE training and note the dataset as DS_train_f; all the data consist of full technical indices and expanded features throughout 2018. The estimator of the RFE algorithm is SVR with linear kernels. We rank the 54 features by voting and get 30 effective features then process them using the PCA algorithm to perform dimension reduction and reduce the features into 20 principal components. The rest of the stock data forms the testing dataset DS_test_f to validate the effectiveness of principal components we extracted from selected features. We reformed all the data from 2018 as the training dataset of the data model and noted as DS_train_m. The model testing dataset DS_test_m consists of the first 3 months of data in 2019, which has no overlap with the dataset we utilized in the previous steps. This approach is to prevent the hidden problem caused by overfitting.

Term length

To build an efficient prediction model, instead of the approach of modeling the data to time series, we determined to use 1 day ahead indices data to predict the price trend of the next day. We tested the RFE algorithm on a range of short-term from 1 day to 2 weeks (ten trading days), to evaluate how the commonly used technical indices correlated to price trends. For evaluating the prediction term length, we fully expanded the features as Table  2 , and feed them to RFE. During the test, we found that different length of the term has a different level of sensitive-ness to the same indices set.

We get the close price of the first trading date and compare it with the close price of the n _ th trading date. Since we are predicting the price trend, we do not consider the term lengths if the cross-validation score is below 0.5. And after the test, as we can see from Fig.  4 , there are three-term lengths that are most sensitive to the indices we selected from the related works. They are n  = {2, 5, 10}, which indicates that price trend prediction of every other day, 1 week, and 2 weeks using the indices set are likely to be more reliable.

figure 4

How do term lengths affect the cross-validation score of RFE

While these curves have different patterns, for the length of 2 weeks, the cross-validation score increases with the number of features selected. If the prediction term length is 1 week, the cross-validation score will decrease if selected over 8 features. For every other day price trend prediction, the best cross-validation score is achieved by selecting 48 features. Biweekly prediction requires 29 features to achieve the best score. In Table  3 , we listed the top 15 effective features for these three-period lengths. If we predict the price trend of every other day, the cross-validation score merely fluctuates with the number of features selected. So, in the next step, we will evaluate the RFE result for these three-term lengths, as shown in Fig.  4 .

We compare the output feature set of RFE with the all-original feature set as a baseline, the all-original feature set consists of n features and we choose n most effective features from RFE output features to evaluate the result using linear SVR. We used two different approaches to evaluate feature effectiveness. The first method is to combine all the data into one large matrix and evaluate them by running the RFE algorithm once. Another method is to run RFE for each individual stock and calculate the most effective features by voting.

Feature extension and RFE

From the result of the previous subsection, we can see that when predicting the price trend for every other day or biweekly, the best result is achieved by selecting a large number of features. Within the selected features, some features processed from extension methods have better ranks than original features, which proves that the feature extension method is useful for optimizing the model. The feature extension affects both precision and efficiency, while in this part, we only discuss the precision aspect and leave efficiency part in the next step since PCA is the most effective method for training efficiency optimization in our design. We involved an evaluation of how feature extension affects RFE and use the test result to measure the improvement of involving feature extension.

We further test the effectiveness of feature extension, i.e., if polarize, max–min scale, and calculate fluctuation percentage works better than original technical indices. The best case to leverage this test is the weekly prediction since it has the least effective feature selected. From the result we got from the last section, we know the best cross-validation score appears when selecting 8 features. The test consists of two steps, and the first step is to test the feature set formed by original features only, in this case, only SLOWK, SLOWD, and RSI_5 are included. The next step is to test the feature set of all 8 features we selected in the previous subsection. We leveraged the test by defining the simplest DNN model with three layers.

The normalized confusion matrix of testing the two feature sets are illustrated in Fig.  5 . The left one is the confusion matrix of the feature set with expanded features, and the right one besides is the test result of using original features only. Both precisions of true positive and true negative have been improved by 7% and 10%, respectively, which proves that our feature extension method design is reasonably effective.

figure 5

Confusion matrix of validating feature extension effectiveness

Feature reduction using principal component analysis

PCA will affect the algorithm performance on both prediction accuracy and training efficiency, while this part should be evaluated with the NN model, so we also defined the simplest DNN model with three layers as we used in the previous step to perform the evaluation. This part introduces the evaluation method and result of the optimization part of the model from computational efficiency and accuracy impact perspectives.

In this section, we will choose bi-weekly prediction to perform a use case analysis, since it has a smoothly increasing cross-validation score curve, moreover, unlike every other day prediction, it has excluded more than 20 ineffective features already. In the first step, we select all 29 effective features and train the NN model without performing PCA. It creates a baseline of the accuracy and training time for comparison. To evaluate the accuracy and efficiency, we keep the number of the principal component as 5, 10, 15, 20, 25. Table  4 recorded how the number of features affects the model training efficiency, then uses the stack bar chart in Fig.  6 to illustrate how PCA affects training efficiency. Table  6 shows accuracy and efficiency analysis on different procedures for the pre-processing of features. The times taken shown in Tables  4 , 6 are based on experiments conducted in a standard user machine to show the viability of our solution with limited or average resource availability.

figure 6

Relationship between feature number and training time

We also listed the confusion matrix of each test in Fig.  7 . The stack bar chart shows that the overall time spends on training the model is decreasing by the number of selected features, while the PCA method is significantly effective in optimizing training dataset preparation. For the time spent on the training stage, PCA is not as effective as the data preparation stage. While there is the possibility that the optimization effect of PCA is not drastic enough because of the simple structure of the NN model.

figure 7

How does the number of principal components affect evaluation results

Table  5 indicates that the overall prediction accuracy is not drastically affected by reducing the dimension. However, the accuracy could not fully support if the PCA has no side effect to model prediction, so we looked into the confusion matrices of test results.

From Fig.  7 we can conclude that PCA does not have a severe negative impact on prediction precision. The true positive rate and false positive rate are barely be affected, while the false negative and true negative rates are influenced by 2% to 4%. Besides evaluating how the number of selected features affects the training efficiency and model performance, we also leveraged a test upon how data pre-processing procedures affect the training procedure and predicting result. Normalizing and max–min scaling is the most commonly seen data pre-procedure performed before PCA, since the measure units of features are varied, and it is said that it could increase the training efficiency afterward.

We leveraged another test on adding pre-procedures before extracting 20 principal components from the original dataset and make the comparison in the aspects of time elapse of training stage and prediction precision. However, the test results lead to different conclusions. In Table  6 we can conclude that feature pre-processing does not have a significant impact on training efficiency, but it does influence the model prediction accuracy. Moreover, the first confusion matrix in Fig.  8 indicates that without any feature pre-processing procedure, the false-negative rate and true negative rate are severely affected, while the true positive rate and false positive rate are not affected. If it performs the normalization before PCA, both true positive rate and true negative rate are decreasing by approximately 10%. This test also proved that the best feature pre-processing method for our feature set is exploiting the max–min scale.

figure 8

Confusion matrices of different feature pre-processing methods

In this section, we discuss and compare the results of our proposed model, other approaches, and the most related works.

Comparison with related works

From the previous works, we found the most commonly exploited models for short-term stock market price trend prediction are support vector machine (SVM), multilayer perceptron artificial neural network (MLP), Naive Bayes classifier (NB), random forest classifier (RAF) and logistic regression classifier (LR). The test case of comparison is also bi-weekly price trend prediction, to evaluate the best result of all models, we keep all 29 features selected by the RFE algorithm. For MLP evaluation, to test if the number of hidden layers would affect the metric scores, we noted layer number as n and tested n  = {1, 3, 5}, 150 training epochs for all the tests, found slight differences in the model performance, which indicates that the variable of MLP layer number hardly affects the metric scores.

From the confusion matrices in Fig.  9 , we can see all the machine learning models perform well when training with the full feature set we selected by RFE. From the perspective of training time, training the NB model got the best efficiency. LR algorithm cost less training time than other algorithms while it can achieve a similar prediction result with other costly models such as SVM and MLP. RAF algorithm achieved a relatively high true-positive rate while the poor performance in predicting negative labels. For our proposed LSTM model, it achieves a binary accuracy of 93.25%, which is a significantly high precision of predicting the bi-weekly price trend. We also pre-processed data through PCA and got five principal components, then trained for 150 epochs. The learning curve of our proposed solution, based on feature engineering and the LSTM model, is illustrated in Fig.  10 . The confusion matrix is the figure on the right in Fig.  11 , and detailed metrics scores can be found in Table  9 .

figure 9

Model prediction comparison—confusion matrices

figure 10

Learning curve of proposed solution

figure 11

Proposed model prediction precision comparison—confusion matrices

The detailed evaluate results are recorded in Table  7 . We will also initiate a discussion upon the evaluation result in the next section.

Because the resulting structure of our proposed solution is different from most of the related works, it would be difficult to make naïve comparison with previous works. For example, it is hard to find the exact accuracy number of price trend prediction in most of the related works since the authors prefer to show the gain rate of simulated investment. Gain rate is a processed number based on simulated investment tests, sometimes one correct investment decision with a large trading volume can achieve a high gain rate regardless of the price trend prediction accuracy. Besides, it is also a unique and heuristic innovation in our proposed solution, we transform the problem of predicting an exact price straight forward to two sequential problems, i.e., predicting the price trend first, focus on building an accurate binary classification model, construct a solid foundation for predicting the exact price change in future works. Besides the different result structure, the datasets that previous works researched on are also different from our work. Some of the previous works involve news data to perform sentiment analysis and exploit the SE part as another system component to support their prediction model.

The latest related work that can compare is Zubair et al. [ 47 ], the authors take multiple r-square for model accuracy measurement. Multiple r-square is also called the coefficient of determination, and it shows the strength of predictor variables explaining the variation in stock return [ 28 ]. They used three datasets (KSE 100 Index, Lucky Cement Stock, Engro Fertilizer Limited) to evaluate the proposed multiple regression model and achieved 95%, 89%, and 97%, respectively. Except for the KSE 100 Index, the dataset choice in this related work is individual stocks; thus, we choose the evaluation result of the first dataset of their proposed model.

We listed the leading stock price trend prediction model performance in Table  8 , from the comparable metrics, the metric scores of our proposed solution are generally better than other related works. Instead of concluding arbitrarily that our proposed model outperformed other models in related works, we first look into the dataset column of Table  8 . By looking into the dataset used by each work [ 18 ], only trained and tested their proposed solution on three individual stocks, which is difficult to prove the generalization of their proposed model. Ayo [ 2 ] leveraged analysis on the stock data from the New York Stock Exchange (NYSE), while the weakness is they only performed analysis on closing price, which is a feature embedded with high noise. Zubair et al. [ 47 ] trained their proposed model on both individual stocks and index price, but as we have mentioned in the previous section, index price only consists of the limited number of features and stock IDs, which will further affect the model training quality. For our proposed solution, we collected sufficient data from the Chinese stock market, and applied FE + RFE algorithm on the original indices to get more effective features, the comprehensive evaluation result of 3558 stock IDs can reasonably explain the generalization and effectiveness of our proposed solution in Chinese stock market. However, the authors of Khaidem and Dey [ 18 ] and Ayo [ 2 ] chose to analyze the stock market in the United States, Zubair et al. [ 47 ] performed analysis on Pakistani stock market price, and we obtained the dataset from Chinese stock market, the policies of different countries might impact the model performance, which needs further research to validate.

Proposed model evaluation—PCA effectiveness

Besides comparing the performance across popular machine learning models, we also evaluated how the PCA algorithm optimizes the training procedure of the proposed LSTM model. We recorded the confusion matrices comparison between training the model by 29 features and by five principal components in Fig.  11 . The model training using the full 29 features takes 28.5 s per epoch on average. While it only takes 18 s on average per epoch training on the feature set of five principal components. PCA has significantly improved the training efficiency of the LSTM model by 36.8%. The detailed metrics data are listed in Table  9 . We will leverage a discussion in the next section about complexity analysis.

Complexity analysis of proposed solution

This section analyzes the complexity of our proposed solution. The Long Short-term Memory is different from other NNs, and it is a variant of standard RNN, which also has time steps with memory and gate architecture. In the previous work [ 46 ], the author performed an analysis of the RNN architecture complexity. They introduced a method to regard RNN as a directed acyclic graph and proposed a concept of recurrent depth, which helps perform the analysis on the intricacy of RNN.

The recurrent depth is a positive rational number, and we denote it as \(d_{rc}\) . As the growth of \(n\) \(d_{rc}\) measures, the nonlinear transformation average maximum number of each time step. We then unfold the directed acyclic graph of RNN and denote the processed graph as \(g_{c}\) , meanwhile, denote \(C(g_{c} )\) as the set of directed cycles in this graph. For the vertex \(v\) , we note \(\sigma_{s} (v)\) as the sum of edge weights and \(l(v)\) as the length. The equation below is proved under a mild assumption, which could be found in [ 46 ].

They also found that another crucial factor that impacts the performance of LSTM, which is the recurrent skip coefficients. We note \(s_{rc}\) as the reciprocal of the recurrent skip coefficient. Please be aware that \(s_{rc}\) is also a positive rational number.

According to the above definition, our proposed model is a 2-layers stacked LSTM, which \(d_{rc} = 2\) and \(s_{rc} = 1\) . From the experiments performed in previous work, the authors also found that when facing the problems of long-term dependency, LSTMs may benefit from decreasing the reciprocal of recurrent skip coefficients and from increasing recurrent depth. The empirical findings above mentioned are useful to enhance the performance of our proposed model further.

This work consists of three parts: data extraction and pre-processing of the Chinese stock market dataset, carrying out feature engineering, and stock price trend prediction model based on the long short-term memory (LSTM). We collected, cleaned-up, and structured 2 years of Chinese stock market data. We reviewed different techniques often used by real-world investors, developed a new algorithm component, and named it as feature extension, which is proved to be effective. We applied the feature expansion (FE) approaches with recursive feature elimination (RFE), followed by principal component analysis (PCA), to build a feature engineering procedure that is both effective and efficient. The system is customized by assembling the feature engineering procedure with an LSTM prediction model, achieved high prediction accuracy that outperforms the leading models in most related works. We also carried out a comprehensive evaluation of this work. By comparing the most frequently used machine learning models with our proposed LSTM model under the feature engineering part of our proposed system, we conclude many heuristic findings that could be future research questions in both technical and financial research domains.

Our proposed solution is a unique customization as compared to the previous works because rather than just proposing yet another state-of-the-art LSTM model, we proposed a fine-tuned and customized deep learning prediction system along with utilization of comprehensive feature engineering and combined it with LSTM to perform prediction. By researching into the observations from previous works, we fill in the gaps between investors and researchers by proposing a feature extension algorithm before recursive feature elimination and get a noticeable improvement in the model performance.

Though we have achieved a decent outcome from our proposed solution, this research has more potential towards research in future. During the evaluation procedure, we also found that the RFE algorithm is not sensitive to the term lengths other than 2-day, weekly, biweekly. Getting more in-depth research into what technical indices would influence the irregular term lengths would be a possible future research direction. Moreover, by combining latest sentiment analysis techniques with feature engineering and deep learning model, there is also a high potential to develop a more comprehensive prediction system which is trained by diverse types of information such as tweets, news, and other text-based data.

Abbreviations

Long short term memory

Principal component analysis

Recurrent neural networks

Artificial neural network

Deep neural network

Dynamic Time Warping

Recursive feature elimination

Support vector machine

Convolutional neural network

Stochastic gradient descent

Rectified linear unit

Multi layer perceptron

Atsalakis GS, Valavanis KP. Forecasting stock market short-term trends using a neuro-fuzzy based methodology. Expert Syst Appl. 2009;36(7):10696–707.

Article   Google Scholar  

Ayo CK. Stock price prediction using the ARIMA model. In: 2014 UKSim-AMSS 16th international conference on computer modelling and simulation. 2014. https://doi.org/10.1109/UKSim.2014.67 .

Brownlee J. Deep learning for time series forecasting: predict the future with MLPs, CNNs and LSTMs in Python. Machine Learning Mastery. 2018. https://machinelearningmastery.com/time-series-prediction-lstm-recurrent-neural-networks-python-keras/

Eapen J, Bein D, Verma A. Novel deep learning model with CNN and bi-directional LSTM for improved stock market index prediction. In: 2019 IEEE 9th annual computing and communication workshop and conference (CCWC). 2019. pp. 264–70. https://doi.org/10.1109/CCWC.2019.8666592 .

Fischer T, Krauss C. Deep learning with long short-term memory networks for financial market predictions. Eur J Oper Res. 2018;270(2):654–69. https://doi.org/10.1016/j.ejor.2017.11.054 .

Article   MathSciNet   MATH   Google Scholar  

Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn 2002;46:389–422.

Hafezi R, Shahrabi J, Hadavandi E. A bat-neural network multi-agent system (BNNMAS) for stock price prediction: case study of DAX stock price. Appl Soft Comput J. 2015;29:196–210. https://doi.org/10.1016/j.asoc.2014.12.028 .

Halko N, Martinsson PG, Tropp JA. Finding structure with randomness: probabilistic algorithms for constructing approximate matrix decompositions. SIAM Rev. 2001;53(2):217–88.

Article   MathSciNet   Google Scholar  

Hassan MR, Nath B. Stock market forecasting using Hidden Markov Model: a new approach. In: Proceedings—5th international conference on intelligent systems design and applications 2005, ISDA’05. 2005. pp. 192–6. https://doi.org/10.1109/ISDA.2005.85 .

Hochreiter S, Schmidhuber J. Long short-term memory. J Neural Comput. 1997;9(8):1735–80. https://doi.org/10.1162/neco.1997.9.8.1735 .

Hsu CM. A hybrid procedure with feature selection for resolving stock/futures price forecasting problems. Neural Comput Appl. 2013;22(3–4):651–71. https://doi.org/10.1007/s00521-011-0721-4 .

Huang CF, Chang BR, Cheng DW, Chang CH. Feature selection and parameter optimization of a fuzzy-based stock selection model using genetic algorithms. Int J Fuzzy Syst. 2012;14(1):65–75. https://doi.org/10.1016/J.POLYMER.2016.08.021 .

Huang CL, Tsai CY. A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting. Expert Syst Appl. 2009;36(2 PART 1):1529–39. https://doi.org/10.1016/j.eswa.2007.11.062 .

Idrees SM, Alam MA, Agarwal P. A prediction approach for stock market volatility based on time series data. IEEE Access. 2019;7:17287–98. https://doi.org/10.1109/ACCESS.2019.2895252 .

Ince H, Trafalis TB. Short term forecasting with support vector machines and application to stock price prediction. Int J Gen Syst. 2008;37:677–87. https://doi.org/10.1080/03081070601068595 .

Jeon S, Hong B, Chang V. Pattern graph tracking-based stock price prediction using big data. Future Gener Comput Syst. 2018;80:171–87. https://doi.org/10.1016/j.future.2017.02.010 .

Kara Y, Acar Boyacioglu M, Baykan ÖK. Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the Istanbul Stock Exchange. Expert Syst Appl. 2011;38(5):5311–9. https://doi.org/10.1016/j.eswa.2010.10.027 .

Khaidem L, Dey SR. Predicting the direction of stock market prices using random forest. 2016. pp. 1–20.

Kim K, Han I. Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index. Expert Syst Appl. 2000;19:125–32. https://doi.org/10.1016/S0957-4174(00)00027-0 .

Lee MC. Using support vector machine with a hybrid feature selection method to the stock trend prediction. Expert Syst Appl. 2009;36(8):10896–904. https://doi.org/10.1016/j.eswa.2009.02.038 .

Lei L. Wavelet neural network prediction method of stock price trend based on rough set attribute reduction. Appl Soft Comput J. 2018;62:923–32. https://doi.org/10.1016/j.asoc.2017.09.029 .

Lin X, Yang Z, Song Y. Expert systems with applications short-term stock price prediction based on echo state networks. Expert Syst Appl. 2009;36(3):7313–7. https://doi.org/10.1016/j.eswa.2008.09.049 .

Liu G, Wang X. A new metric for individual stock trend prediction. Eng Appl Artif Intell. 2019;82(March):1–12. https://doi.org/10.1016/j.engappai.2019.03.019 .

Liu S, Zhang C, Ma J. CNN-LSTM neural network model for quantitative strategy analysis in stock markets. 2017;1:198–206. https://doi.org/10.1007/978-3-319-70096-0 .

Long W, Lu Z, Cui L. Deep learning-based feature engineering for stock price movement prediction. Knowl Based Syst. 2018;164:163–73. https://doi.org/10.1016/j.knosys.2018.10.034 .

Malkiel BG, Fama EF. Efficient capital markets: a review of theory and empirical work. J Finance. 1970;25(2):383–417.

McNally S, Roche J, Caton S. Predicting the price of bitcoin using machine learning. In: Proceedings—26th Euromicro international conference on parallel, distributed, and network-based processing, PDP 2018. pp. 339–43. https://doi.org/10.1109/PDP2018.2018.00060 .

Nagar A, Hahsler M. News sentiment analysis using R to predict stock market trends. 2012. http://past.rinfinance.com/agenda/2012/talk/Nagar+Hahsler.pdf . Accessed 20 July 2019.

Nekoeiqachkanloo H, Ghojogh B, Pasand AS, Crowley M. Artificial counselor system for stock investment. 2019. ArXiv Preprint arXiv:1903.00955 .

Ni LP, Ni ZW, Gao YZ. Stock trend prediction based on fractal feature selection and support vector machine. Expert Syst Appl. 2011;38(5):5569–76. https://doi.org/10.1016/j.eswa.2010.10.079 .

Pang X, Zhou Y, Wang P, Lin W, Chang V. An innovative neural network approach for stock market prediction. J Supercomput. 2018. https://doi.org/10.1007/s11227-017-2228-y .

Pimenta A, Nametala CAL, Guimarães FG, Carrano EG. An automated investing method for stock market based on multiobjective genetic programming. Comput Econ. 2018;52(1):125–44. https://doi.org/10.1007/s10614-017-9665-9 .

Piramuthu S. Evaluating feature selection methods for learning in data mining applications. Eur J Oper Res. 2004;156(2):483–94. https://doi.org/10.1016/S0377-2217(02)00911-6 .

Qiu M, Song Y. Predicting the direction of stock market index movement using an optimized artificial neural network model. PLoS ONE. 2016;11(5):e0155133.

Scikit-learn. Scikit-learn Min-Max Scaler. 2019. https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.MinMaxScaler.html . Retrieved 26 July 2020.

Shen J. Thesis, “Short-term stock market price trend prediction using a customized deep learning system”, supervised by M. Omair Shafiq, Carleton University. 2019.

Shen J, Shafiq MO. Deep learning convolutional neural networks with dropout—a parallel approach. ICMLA. 2018;2018:572–7.

Google Scholar  

Shen J, Shafiq MO. Learning mobile application usage—a deep learning approach. ICMLA. 2019;2019:287–92.

Shih D. A study of early warning system in volume burst risk assessment of stock with Big Data platform. In: 2019 IEEE 4th international conference on cloud computing and big data analysis (ICCCBDA). 2019. pp. 244–8.

Sirignano J, Cont R. Universal features of price formation in financial markets: perspectives from deep learning. Ssrn. 2018. https://doi.org/10.2139/ssrn.3141294 .

Article   MATH   Google Scholar  

Thakur M, Kumar D. A hybrid financial trading support system using multi-category classifiers and random forest. Appl Soft Comput J. 2018;67:337–49. https://doi.org/10.1016/j.asoc.2018.03.006 .

Tsai CF, Hsiao YC. Combining multiple feature selection methods for stock prediction: union, intersection, and multi-intersection approaches. Decis Support Syst. 2010;50(1):258–69. https://doi.org/10.1016/j.dss.2010.08.028 .

Tushare API. 2018. https://github.com/waditu/tushare . Accessed 1 July 2019.

Wang X, Lin W. Stock market prediction using neural networks: does trading volume help in short-term prediction?. n.d.

Weng B, Lu L, Wang X, Megahed FM, Martinez W. Predicting short-term stock prices using ensemble methods and online data sources. Expert Syst Appl. 2018;112:258–73. https://doi.org/10.1016/j.eswa.2018.06.016 .

Zhang S. Architectural complexity measures of recurrent neural networks, (NIPS). 2016. pp. 1–9.

Zubair M, Fazal A, Fazal R, Kundi M. Development of stock market trend prediction system using multiple regression. Computational and mathematical organization theory. Berlin: Springer US; 2019. https://doi.org/10.1007/s10588-019-09292-7 .

Book   Google Scholar  

Download references

Acknowledgements

This research is supported by Carleton University, in Ottawa, ON, Canada. This research paper has been built based on the thesis [ 36 ] of Jingyi Shen, supervised by M. Omair Shafiq at Carleton University, Canada, available at https://curve.carleton.ca/52e9187a-7f71-48ce-bdfe-e3f6a420e31a .

NSERC and Carleton University.

Author information

Authors and affiliations.

School of Information Technology, Carleton University, Ottawa, ON, Canada

Jingyi Shen & M. Omair Shafiq

You can also search for this author in PubMed   Google Scholar

Contributions

Yes. All authors read and approved the final manuscript.

Corresponding author

Correspondence to M. Omair Shafiq .

Ethics declarations

Competing interests.

The authors declare that they have no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Shen, J., Shafiq, M.O. Short-term stock market price trend prediction using a comprehensive deep learning system. J Big Data 7 , 66 (2020). https://doi.org/10.1186/s40537-020-00333-6

Download citation

Received : 24 January 2020

Accepted : 30 July 2020

Published : 28 August 2020

DOI : https://doi.org/10.1186/s40537-020-00333-6

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Deep learning
  • Stock market trend
  • Feature engineering

research paper on share prices

Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Determinants of Stock Prices: Empirical Evidence from NSE 100 Companies

Profile image of Shashank Shashu

Related Papers

JMRA Publication

In this study an attempt is made to study and identified corporate fundamental factors which have a significant influence on share price in stock exchange. After review of literature review twenty-one fundamental factors are determined. These factors are classified into six corporate factors applying factor analysis. There are six corporate factors are extracted and are labelled as Profitability Factor, Earning Power and Growth Factor, Return on investment factor, Stock Valuation factor, Volatility Factor and liquidity and solvency factor. After extraction of six factors are subject to correlation and regression analysis. It is evident that the coefficients of 21 fundamental factors of selected companies are represented out of 21 fundamental factors Return on Capital Employed, return on Net Worth, Book Value Per Share, Price Earnings Ratio, return on Assets, Net Profit Margin, size (total assets) are have positive relationship with the market price of the share.

research paper on share prices

Arslan Iqbal

The aim of the study is to explore & examine the determinants of share price in Karachi StockExchangès (KSE) oil & gas and cement sector. The study has employed a panel data approach on oil & gas and cement sector of Karachi Stock Exchange (KSE) over the period of 2008 to 2013. The fixed effect and random effect model has been used to determine the research objective. The study reveals that earning per share and book value per share are positive and significant determinants of share price in both sectors while dividend yield is negatively significant in cement sector. Overall, the conclusion indicating that model is significant and random effect model is appropriate in both sectors of Karachi Stock Exchange (KSE) Keywords: Share price, earning per share, book value per share, dividend yield JEL Classification Codes: G30, G32 Introduction: The determinants of share price are a debatable topic among financial researchers for more than five decades. Stock market is major capital markets which is the biggest source of finance to corporation and provide investment opportunities to the investor in form of share and bonds. Investment in shares contains two types of returns; first one is capital gain and second is dividends but the returns on share investment contain high risk because they are unconfirmed and fluctuations in share price are not explanatory. It depends on internal factors such as dividend per share, earning per share, book value, dividend yield as well as external factors that are interest rate, inflation rate, government policy etc. The initial work on determinants of share price done by Collins (1957) in context of U.S banks, he reveals that dividend, basic earning power and profits affect the stock prices. Empirical studies reveal that various factors in different markets determine the share price. The present study deals with an attempt to study the determinants of share price on the basis of selected accounting formulas. The present study consists of four sections. Section I deals with objective and review of empirical studies. Section II explains data collection and research methodology. Section III is related with descriptive statistics, correlation matrix and regression result which divided into two parts oil & gas and cement sector respectively. Section IV is devoted to the discussion of conclusion.

mihiri wickramasinghe

sciepub.com SciEP

The core purpose of financial management is the maximization of shareholders' wealth with three core decisions that is, dividend, financing and investment decisions. In simple words dividend is the percentage of net profit of a company which is distributed among the shareholders. The research gap identified for this study is that, a comparison of different industries dividend policy impact on the Indian Capital Market and the samples are drawn from the National Stock Exchange (NSE) India. The main objective of this project is to study the influence of dividend policy on the market price of selected company's scrip in NSE. The researcher has chosen Empirical study to prove the hypothesis and 15 companies balance sheet for 10 years (2007-2016) were collected from the respected companies websites and the following ratios have been calculated to test the hypothesis, Net profit ratio, Gross profit ratio, Earning per share, Dividend yield, Dividend payout ratio, Return on equity and the Market price of share. The analyses conducted, the parameter estimates were viewed within AMOS graphics and displayed the standardized parameter estimates. The regression analysis model revealed that the significant impact of the independent variables on the market price of share as well as impact of dividend policy determinants on the outcome variable market price.

Hemal Pandya

The focus of this study is to identify the determinants of share prices in the Indian market. The study uses panel data pertaining to thirty firms comprising BSE SENSEX over the period 2010-2014 and employs the fully modified ordinary least squares method. The results indicate that the variables leverage followed by Price-Earnings Ratio, Profitability and DPS respectively are the major determinants of share prices in Indian stock market. The inverse relationship of Leverage with the share prices indicates that raising capital via debt involves periodic interest payments on part of firms; increased use of debt by a firm would therefore result in higher interest payments and this lowers the earnings available to equity shareholders. Investors therefore generally prefer firms with lower debt.

Muhammad Irfan Chani

Purpose: The volatility in stock prices is one of the most discussed topics in finance. Many studies have been conducted to find the factors which cause fluctuation in stock prices and different results have been found. In this study an attempt has been made to see the affect of dividend yield, dividend payout ratio, return on equity, earning per share and profit after tax on stock prices in Pakistan. For this purpose four non financial sectors (Sugar, Chemical, Food and personal care, Energy) have been selected. A sample of 63 companies listed at Karachi stock exchange was analyzed for the period of 2006-2011. Methodology: Ordinary least square regression model has been applied on panel data. Findings: The results indicate dividend yield and dividend payout ratio which are both measures of dividend policy have significant impact on stock price. Dividend yield is negatively related with stock price and dividend payout ratio is positively related with stock price which means that the...

fatemeh shafie

Marcel Meler

Asian Pacific journal of tropical biomedicine

Sundaram Ravikumar

To identify the larvicidal activity of the seagrass extracts. Seagrass extracts, Syringodium isoetifolium (S. isoetifolium), Cymodocea serrulata and Halophila beccarii, were dissolved in DMSO to prepare a graded series of concentration. Batches of 25 early 4th instars larvae of Aedes aegypti (Ae. aegypti) were transferred to 250 mL enamel bowl containing 199 mL of distilled water and 1 mL of plant extracts (0.01 mg - 0.1 mg). After 24 h the mortality rate was identified with the formulae [(% of test mortality - % of control mortality)/(100 - % of control mortality)] × 100. Each experiment was conducted with three replicates and a concurrent control group. A control group consisted of 1 mL of DMSO and 199 mL of distilled water only. : The root extract of S. isoetifolium showed maximum larvicidal activity with minimum concentration of extract of LC50= 0.0 604 ± 0.0 040)µg/mL with lower confidence limit (LCL) - upper confidence limit (UCL) = (0.051-0.071) and LC90=0.0 972µg/mL followed...

RELATED PAPERS

Cimel Ciencia E Investigacion Medica Estudiantil Latinoamericana

Horacio Deza

Heri Ramdhani

Rabia Batool

Nilufar Abdukhalilova

questia.com

Robert Gable

2008 International Conference on Autonomic Computing

Selim Kalayci

Journal of Education and Community Health

farzaneh jarvandi

Religionsvidenskabeligt Tidsskrift

Troels Engberg-Pedersen

Nature Communications

Brendan Daisley

Business Management Journal

Oktafalia Marisa Muzammil

DEWI KARTIKA

Journal of Phycology

Bernardo Cicchi

Jurnal Revenue : Jurnal Ilmiah Akuntansi

2015 ASEE Annual Conference and Exposition Proceedings

DOAJ (DOAJ: Directory of Open Access Journals)

Mohammad Javan-nikkhah

QMiP Bulletin

Frederick Wertz Sr.

2013 IEEE Photonics Conference

Colin Humphreys

Michael Quirke

Palastren: Jurnal Studi Gender

Juwita Hayyuning Prastiwi

Arthroscopy techniques

Dr. Mauricio González Sánchez

Laurence E. Fritsch

Laurence FRITSCH

Acta crystallographica. Section F, Structural biology communications

Nadine Candoni

RELATED TOPICS

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024

Our approach

  • Responsibility
  • Infrastructure
  • Try Meta AI

RECOMMENDED READS

  • 5 Steps to Getting Started with Llama 2
  • The Llama Ecosystem: Past, Present, and Future
  • Introducing Code Llama, a state-of-the-art large language model for coding
  • Meta and Microsoft Introduce the Next Generation of Llama
  • Today, we’re introducing Meta Llama 3, the next generation of our state-of-the-art open source large language model.
  • Llama 3 models will soon be available on AWS, Databricks, Google Cloud, Hugging Face, Kaggle, IBM WatsonX, Microsoft Azure, NVIDIA NIM, and Snowflake, and with support from hardware platforms offered by AMD, AWS, Dell, Intel, NVIDIA, and Qualcomm.
  • We’re dedicated to developing Llama 3 in a responsible way, and we’re offering various resources to help others use it responsibly as well. This includes introducing new trust and safety tools with Llama Guard 2, Code Shield, and CyberSec Eval 2.
  • In the coming months, we expect to introduce new capabilities, longer context windows, additional model sizes, and enhanced performance, and we’ll share the Llama 3 research paper.
  • Meta AI, built with Llama 3 technology, is now one of the world’s leading AI assistants that can boost your intelligence and lighten your load—helping you learn, get things done, create content, and connect to make the most out of every moment. You can try Meta AI here .

Today, we’re excited to share the first two models of the next generation of Llama, Meta Llama 3, available for broad use. This release features pretrained and instruction-fine-tuned language models with 8B and 70B parameters that can support a broad range of use cases. This next generation of Llama demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning. We believe these are the best open source models of their class, period. In support of our longstanding open approach, we’re putting Llama 3 in the hands of the community. We want to kickstart the next wave of innovation in AI across the stack—from applications to developer tools to evals to inference optimizations and more. We can’t wait to see what you build and look forward to your feedback.

Our goals for Llama 3

With Llama 3, we set out to build the best open models that are on par with the best proprietary models available today. We wanted to address developer feedback to increase the overall helpfulness of Llama 3 and are doing so while continuing to play a leading role on responsible use and deployment of LLMs. We are embracing the open source ethos of releasing early and often to enable the community to get access to these models while they are still in development. The text-based models we are releasing today are the first in the Llama 3 collection of models. Our goal in the near future is to make Llama 3 multilingual and multimodal, have longer context, and continue to improve overall performance across core LLM capabilities such as reasoning and coding.

State-of-the-art performance

Our new 8B and 70B parameter Llama 3 models are a major leap over Llama 2 and establish a new state-of-the-art for LLM models at those scales. Thanks to improvements in pretraining and post-training, our pretrained and instruction-fine-tuned models are the best models existing today at the 8B and 70B parameter scale. Improvements in our post-training procedures substantially reduced false refusal rates, improved alignment, and increased diversity in model responses. We also saw greatly improved capabilities like reasoning, code generation, and instruction following making Llama 3 more steerable.

research paper on share prices

*Please see evaluation details for setting and parameters with which these evaluations are calculated.

In the development of Llama 3, we looked at model performance on standard benchmarks and also sought to optimize for performance for real-world scenarios. To this end, we developed a new high-quality human evaluation set. This evaluation set contains 1,800 prompts that cover 12 key use cases: asking for advice, brainstorming, classification, closed question answering, coding, creative writing, extraction, inhabiting a character/persona, open question answering, reasoning, rewriting, and summarization. To prevent accidental overfitting of our models on this evaluation set, even our own modeling teams do not have access to it. The chart below shows aggregated results of our human evaluations across of these categories and prompts against Claude Sonnet, Mistral Medium, and GPT-3.5.

research paper on share prices

Preference rankings by human annotators based on this evaluation set highlight the strong performance of our 70B instruction-following model compared to competing models of comparable size in real-world scenarios.

Our pretrained model also establishes a new state-of-the-art for LLM models at those scales.

research paper on share prices

To develop a great language model, we believe it’s important to innovate, scale, and optimize for simplicity. We adopted this design philosophy throughout the Llama 3 project with a focus on four key ingredients: the model architecture, the pretraining data, scaling up pretraining, and instruction fine-tuning.

Model architecture

In line with our design philosophy, we opted for a relatively standard decoder-only transformer architecture in Llama 3. Compared to Llama 2, we made several key improvements. Llama 3 uses a tokenizer with a vocabulary of 128K tokens that encodes language much more efficiently, which leads to substantially improved model performance. To improve the inference efficiency of Llama 3 models, we’ve adopted grouped query attention (GQA) across both the 8B and 70B sizes. We trained the models on sequences of 8,192 tokens, using a mask to ensure self-attention does not cross document boundaries.

Training data

To train the best language model, the curation of a large, high-quality training dataset is paramount. In line with our design principles, we invested heavily in pretraining data. Llama 3 is pretrained on over 15T tokens that were all collected from publicly available sources. Our training dataset is seven times larger than that used for Llama 2, and it includes four times more code. To prepare for upcoming multilingual use cases, over 5% of the Llama 3 pretraining dataset consists of high-quality non-English data that covers over 30 languages. However, we do not expect the same level of performance in these languages as in English.

To ensure Llama 3 is trained on data of the highest quality, we developed a series of data-filtering pipelines. These pipelines include using heuristic filters, NSFW filters, semantic deduplication approaches, and text classifiers to predict data quality. We found that previous generations of Llama are surprisingly good at identifying high-quality data, hence we used Llama 2 to generate the training data for the text-quality classifiers that are powering Llama 3.

We also performed extensive experiments to evaluate the best ways of mixing data from different sources in our final pretraining dataset. These experiments enabled us to select a data mix that ensures that Llama 3 performs well across use cases including trivia questions, STEM, coding, historical knowledge, etc.

Scaling up pretraining

To effectively leverage our pretraining data in Llama 3 models, we put substantial effort into scaling up pretraining. Specifically, we have developed a series of detailed scaling laws for downstream benchmark evaluations. These scaling laws enable us to select an optimal data mix and to make informed decisions on how to best use our training compute. Importantly, scaling laws allow us to predict the performance of our largest models on key tasks (for example, code generation as evaluated on the HumanEval benchmark—see above) before we actually train the models. This helps us ensure strong performance of our final models across a variety of use cases and capabilities.

We made several new observations on scaling behavior during the development of Llama 3. For example, while the Chinchilla-optimal amount of training compute for an 8B parameter model corresponds to ~200B tokens, we found that model performance continues to improve even after the model is trained on two orders of magnitude more data. Both our 8B and 70B parameter models continued to improve log-linearly after we trained them on up to 15T tokens. Larger models can match the performance of these smaller models with less training compute, but smaller models are generally preferred because they are much more efficient during inference.

To train our largest Llama 3 models, we combined three types of parallelization: data parallelization, model parallelization, and pipeline parallelization. Our most efficient implementation achieves a compute utilization of over 400 TFLOPS per GPU when trained on 16K GPUs simultaneously. We performed training runs on two custom-built 24K GPU clusters . To maximize GPU uptime, we developed an advanced new training stack that automates error detection, handling, and maintenance. We also greatly improved our hardware reliability and detection mechanisms for silent data corruption, and we developed new scalable storage systems that reduce overheads of checkpointing and rollback. Those improvements resulted in an overall effective training time of more than 95%. Combined, these improvements increased the efficiency of Llama 3 training by ~three times compared to Llama 2.

Instruction fine-tuning

To fully unlock the potential of our pretrained models in chat use cases, we innovated on our approach to instruction-tuning as well. Our approach to post-training is a combination of supervised fine-tuning (SFT), rejection sampling, proximal policy optimization (PPO), and direct preference optimization (DPO). The quality of the prompts that are used in SFT and the preference rankings that are used in PPO and DPO has an outsized influence on the performance of aligned models. Some of our biggest improvements in model quality came from carefully curating this data and performing multiple rounds of quality assurance on annotations provided by human annotators.

Learning from preference rankings via PPO and DPO also greatly improved the performance of Llama 3 on reasoning and coding tasks. We found that if you ask a model a reasoning question that it struggles to answer, the model will sometimes produce the right reasoning trace: The model knows how to produce the right answer, but it does not know how to select it. Training on preference rankings enables the model to learn how to select it.

Building with Llama 3

Our vision is to enable developers to customize Llama 3 to support relevant use cases and to make it easier to adopt best practices and improve the open ecosystem. With this release, we’re providing new trust and safety tools including updated components with both Llama Guard 2 and Cybersec Eval 2, and the introduction of Code Shield—an inference time guardrail for filtering insecure code produced by LLMs.

We’ve also co-developed Llama 3 with torchtune , the new PyTorch-native library for easily authoring, fine-tuning, and experimenting with LLMs. torchtune provides memory efficient and hackable training recipes written entirely in PyTorch. The library is integrated with popular platforms such as Hugging Face, Weights & Biases, and EleutherAI and even supports Executorch for enabling efficient inference to be run on a wide variety of mobile and edge devices. For everything from prompt engineering to using Llama 3 with LangChain we have a comprehensive getting started guide and takes you from downloading Llama 3 all the way to deployment at scale within your generative AI application.

A system-level approach to responsibility

We have designed Llama 3 models to be maximally helpful while ensuring an industry leading approach to responsibly deploying them. To achieve this, we have adopted a new, system-level approach to the responsible development and deployment of Llama. We envision Llama models as part of a broader system that puts the developer in the driver’s seat. Llama models will serve as a foundational piece of a system that developers design with their unique end goals in mind.

research paper on share prices

Instruction fine-tuning also plays a major role in ensuring the safety of our models. Our instruction-fine-tuned models have been red-teamed (tested) for safety through internal and external efforts. ​​Our red teaming approach leverages human experts and automation methods to generate adversarial prompts that try to elicit problematic responses. For instance, we apply comprehensive testing to assess risks of misuse related to Chemical, Biological, Cyber Security, and other risk areas. All of these efforts are iterative and used to inform safety fine-tuning of the models being released. You can read more about our efforts in the model card .

Llama Guard models are meant to be a foundation for prompt and response safety and can easily be fine-tuned to create a new taxonomy depending on application needs. As a starting point, the new Llama Guard 2 uses the recently announced MLCommons taxonomy, in an effort to support the emergence of industry standards in this important area. Additionally, CyberSecEval 2 expands on its predecessor by adding measures of an LLM’s propensity to allow for abuse of its code interpreter, offensive cybersecurity capabilities, and susceptibility to prompt injection attacks (learn more in our technical paper ). Finally, we’re introducing Code Shield which adds support for inference-time filtering of insecure code produced by LLMs. This offers mitigation of risks around insecure code suggestions, code interpreter abuse prevention, and secure command execution.

With the speed at which the generative AI space is moving, we believe an open approach is an important way to bring the ecosystem together and mitigate these potential harms. As part of that, we’re updating our Responsible Use Guide (RUG) that provides a comprehensive guide to responsible development with LLMs. As we outlined in the RUG, we recommend that all inputs and outputs be checked and filtered in accordance with content guidelines appropriate to the application. Additionally, many cloud service providers offer content moderation APIs and other tools for responsible deployment, and we encourage developers to also consider using these options.

Deploying Llama 3 at scale

Llama 3 will soon be available on all major platforms including cloud providers, model API providers, and much more. Llama 3 will be everywhere .

Our benchmarks show the tokenizer offers improved token efficiency, yielding up to 15% fewer tokens compared to Llama 2. Also, Group Query Attention (GQA) now has been added to Llama 3 8B as well. As a result, we observed that despite the model having 1B more parameters compared to Llama 2 7B, the improved tokenizer efficiency and GQA contribute to maintaining the inference efficiency on par with Llama 2 7B.

For examples of how to leverage all of these capabilities, check out Llama Recipes which contains all of our open source code that can be leveraged for everything from fine-tuning to deployment to model evaluation.

What’s next for Llama 3?

The Llama 3 8B and 70B models mark the beginning of what we plan to release for Llama 3. And there’s a lot more to come.

Our largest models are over 400B parameters and, while these models are still training, our team is excited about how they’re trending. Over the coming months, we’ll release multiple models with new capabilities including multimodality, the ability to converse in multiple languages, a much longer context window, and stronger overall capabilities. We will also publish a detailed research paper once we are done training Llama 3.

To give you a sneak preview for where these models are today as they continue training, we thought we could share some snapshots of how our largest LLM model is trending. Please note that this data is based on an early checkpoint of Llama 3 that is still training and these capabilities are not supported as part of the models released today.

research paper on share prices

We’re committed to the continued growth and development of an open AI ecosystem for releasing our models responsibly. We have long believed that openness leads to better, safer products, faster innovation, and a healthier overall market. This is good for Meta, and it is good for society. We’re taking a community-first approach with Llama 3, and starting today, these models are available on the leading cloud, hosting, and hardware platforms with many more to come.

Try Meta Llama 3 today

We’ve integrated our latest models into Meta AI, which we believe is the world’s leading AI assistant. It’s now built with Llama 3 technology and it’s available in more countries across our apps.

You can use Meta AI on Facebook, Instagram, WhatsApp, Messenger, and the web to get things done, learn, create, and connect with the things that matter to you. You can read more about the Meta AI experience here .

Visit the Llama 3 website to download the models and reference the Getting Started Guide for the latest list of all available platforms.

You’ll also soon be able to test multimodal Meta AI on our Ray-Ban Meta smart glasses.

As always, we look forward to seeing all the amazing products and experiences you will build with Meta Llama 3.

Our latest updates delivered to your inbox

Subscribe to our newsletter to keep up with Meta AI news, events, research breakthroughs, and more.

Join us in the pursuit of what’s possible with AI.

research paper on share prices

Product experiences

Foundational models

Latest news

Meta © 2024

Do prestigious underwriters shape IPO pricing? A meta-analytic review

  • Review Paper
  • Published: 15 October 2019
  • Volume 15 , pages 573–609, ( 2021 )

Cite this article

research paper on share prices

  • Tiziana La Rocca   ORCID: orcid.org/0000-0001-5064-0100 1  

945 Accesses

6 Citations

Explore all metrics

This paper investigates the controversial effect of underwriter reputation on underpricing by applying a meta-analysis methodology. The aim is to verify whether and to what extent the magnitude of this relationship is conditioned by institutional-specific moderators. Companies that go public with the support of prestigious underwriters report a changing effect on underpricing. The relation between underwriter reputation and underpricing was negative in the seventies, while it shifted to be positive in the nineties and two thousands. The positive effect between underwriter reputation and underpricing is higher in common law countries compared to civil law countries. In particular, there was a strong positive effect in the USA starting in the two thousands. This paper also provides a methodological contribution by the application, for the first time in management topic, of the robust variance estimation method to address the issue of within-paper heterogeneity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

research paper on share prices

Similar content being viewed by others

research paper on share prices

Underpricing of initial public offerings (IPOs) and the credibility of underwriters’ pricing services

research paper on share prices

Does the appointment of the three musketeers reduce IPO underpricing? global evidence

Affiliation ties and underwriter selection.

In addition to the standard measure of underpricing, as suggested by Ritter ( 1984 ), it can be calculated as the difference between the “simple” underpricing above and the market index return measured between the day of the admission to the trading and the beginning of the public offering (Rock 1986 ; McGuinness 1992 ; Carter et al. 1998 ; Ljungqvist 1999 ; Certo et al. 2001a , b ; Loughran and Ritter 2004 ; Chahine and Filatotchev 2008a , b ).

Brau et al. ( 2004 ) support this view by submitting a questionnaire about the decision to go public to 336 chief financial officers (CFOs) of non-financial U.S. companies. The questionnaire asked about positive and negative signals to investors in the context of an IPO. Ninety-one percent of the respondents indicated that profitability is the most positive signal to investors and it consolidates a company’s reputation even more than continuous positive operating results, because investors view past successes as indicating future profits. The second-most-popular opinion of the respondents was that the quality of an investment is further guaranteed by a partnership with a major investment bank (74% of CFOs), and the third-most popular indicator of quality was a venture capitalist’s ownership of an equity stake in a company (40%). According to the majority of the respondents, signals perceived by outsiders to be very negative include the sale of a large part of the capital or the sale of a significant proportion of shares by internal shareholders. The willingness of insiders to quit the company en masse will surely cause investors to believe that the organisation has financial problems.

“… when an underwriter is faced with an IPO that exhibits an unusually high demand among investors, s/he adjusts the offer price to a point above the high end of the initial filing range but significantly below the anticipated market price for the IPO shares. Therefore, investors that disclose their high reservation prices for these high-demand IPOs not only receive a larger allocation of IPO shares but also a higher than normal initial return …” (Cooney et al. 2001 : 17).

The term “study” refers to a single regression inside a paper. To deal with the fact that in each paper there can be many regressions (studies) that are not independent correlations, the robumeta approach will be used.

I also checked that the results were not affected by posing away observations that were more than 3.0 standard deviation far from the mean effect size, instead of the winsorizing approach.

The diversity among the studies that included a meta-analysis necessarily leads to statistical heterogeneity.

Doidge et al. ( 2004 ) show that for a given level of legal protection for investors, the incentive to adopt better governance mechanisms depends on a country’s financial development.

(*Studies included in the meta-analysis, respecting the selection criteria and, in particular, having in the same model empirical measures of underpricing and underwriter reputation)

*Aggarwal RK, Krigman L, Womack KL (2002) Strategic IPO underpricing, information momentum, and lockup expiration selling. J Financ Econ 66(1):105–137

Google Scholar  

*Aggarwal RK, Purnanandam AK, Wu G (2006) Underwriter manipulation in initial public offerings. Unpublished Working Paper, version: January 26, 2006

*AIDEA (2013) Bank reputation and IPO underpricing: evidence from the Istanbul Stock Exchange. Working Paper, Conference AIDEA

Allen F, Faulhaber GR (1989) Signalling by underpricing in the IPO market. Financ Econ 23(2):303–323

Anderson CW, Huang J (2017) Institutional investment in IPOs and post-IPO M&A activity. J Empirical Finance 41(C) 1–18

*Arrfelt M, Mannor M, Nahrgang JD, Christensen AL (2018) All risk-taking is not the same: examining the competing effects of firm risk-taking with meta-analysis. Rev Manag Sci 12(3):621–660

Arthurs JD, Hoskisson RE, Busenitz LW, Johnson RA (2008) Managerial agents watching other agents: multiple agency conflicts regarding underpricing in IPO firms. Acad Manag J 51(2):277–294

*Arthurs JD, Busenitz LW, Johnson RA, Hoskisson RE (2009) Signaling and initial public offerings: the use and impact of lockup period. J Bus Ventur 24(4):360–372

Autore DM, Boulton TJ, Smart SB, Zutter CJ (2014) The impact of institutional quality on initial public offerings. J Econ Bus 73(2):65–96

*Bade M (2018) Determinants of IPO-firms’ merger appetite. Rev Manag Sci. https://doi.org/10.1007/s11846-018-0291-2

Article   Google Scholar  

*Bairagi RK, Dimovski W (2011) The underpricing of REIT IPOs: 1996–2010. J Prop Res 28(3):233–248

Beatty RP (1989) Auditor reputation and the pricing of initial public offerings. Account Rev 64(4):693–709

Beatty RP, Ritter JR (1986) Investment banking, reputation, and the underpricing of initial public offerings. J Financ Econ 15(1–2):213–232

*Beatty R, Welch I (1996) Issuer expenses and legal liability in initial public offerings. J Law Econ 39(3):545–602

*Bédard J, Coulombe D, Courteau L (2008) Audit committee, underpricing of IPOs and accuracy of management earnings forecasts. Corp Gov Int Rev 16(6):519–535

Bergh DD, Connelly BL, Ketchen DJ Jr, Shannon LM (2014) Signalling theory and equilibrium in strategic management research: an assessment and a research agenda. J Manag Stud 51(8):1334–1360

Booth JR, Smith RL (1986) Capital raising, underwriting and the certification hypothesis. J Financ Econ 15(1–2):261–281

Booth JR, Chua L (1996) Ownership dispersion, costly information and IPO underpricing. J Financ Econ 41(2):291–310

Boulton TJ, Smart SB, Zutter CJ (2010) IPO underpricing and international corporate governance. J Int Bus Study 41(2):206–222

Boulton TJ, Smart SB, Zutter CJ (2017) IPO underpricing around the world. In: Cumming D, Johan S (eds) Oxford handbook of IPOs. Oxford University Press, Oxford

*Bozzolan S, Ipino E (2007) Information asymmetries and IPO underpricing: the role of voluntary disclosure of forward-looking information in the prospectus. Working Paper, SSRN 999518

*Bradley DJ, Jordan BD (2002) Partial adjustment to public information and IPO underpricing. J Financ Quant Anal 37(4):595–616

*Brau JC, Brown RA, Osteryoung JS (2004) Do venture capitalists add value to small manufacturing firms? An empirical analysis of venture and non venture capital-backed initial public offerings. J Small Bus Manag 42(1):78–92

*Carter R, Manaster S (1990) initial public offerings and underwriter reputation. J Finance 45(4):1045–1067

*Carter RB, Dark FH, Singh AK (1998) Underwriter reputation, initial returns, and the long-run performance of IPO stocks. J Finance 53(1):285–311

*Certo ST, Covin J, Daily CM, Dalton DR (2001a) Wealth and the effects of founder management among IPO-stage new ventures. Strateg Manag J 22(6–7):641–658

*Certo ST, Daily CM, Dalton DR (2001b) Signaling firm value through board structure: an investigation of initial public offerings. Entrep Theory Pract 26(2):33–50

Certo ST, Holcomb TR, Holmes RM Jr (2009) IPO research in management and entrepreneurship: moving the agenda forward. J Manag 35(6):1340–1378

Chahine S, Filatotchev I (2008a) The effects of information disclosure and board independence on IPO discount. J Small Bus Manag 46(2):219–241

Chahine S, Filatotchev I (2008b) The effects of venture capitalist affiliation to underwriters on short- and long-term performance in French IPOs. Glob Finance J 18(3):351–372

*Chahine S, Goergen M (2011) VC board representation and performance of US IPOs. J Bus Finance Account 38(3–4):413–445

*Chahine S, Saade S (2011) Shareholders’ rights and the effect of the origin of venture capital firms on the underpricing of US IPOs. Corpor Gov Int Rev 19(6):601–621

*Chahine S, Filatotchev I, Wright M (2007) Venture capitalists, business angels, and performance of entrepreneurial IPOs in the UK and France. J Bus Finance Account 34(3/4):505–528

*Chahine S, Filatotchev I, Zahra S (2011) Building perceived quality of founder-involved IPO firms: founders’ effects on board selection and stock market performance. Entrep Theory Pract 35(2):319–335

*Chahine S, Arthurs JD, Filatotchev I, Hoskisson RE (2012) The effects of venture capital syndicate diversity on earnings management and performance of IPOs in the US and UK: an institutional perspective. J Corp Finance 18(1):179–192

Chemmanur TJ, Fulghieri P (1994) Investment bank reputation, information production, and financial intermediation. J Finance 49(1):57–79

*Chen CR, Mohan NJ (2002) Underwriter spread, underwriter reputation, and IPO underpricing: a simultaneous equation analysis. J Bus Finance Account 29(3/4):521–540

*Cheng H-P, Chan C-H (2009) A meta-analytic review of ownership structure, board composition, and IPO underpricing. SSRN Working Paper, SSRN ID = 1343679

*Cliff MT, Denis DJ (2004) Do initial public offering firms purchase analyst coverage with underpricing? J Finance 59(6):2871–2901

*Coakley J, Hadass L, Wood A (2009) UK IPO underpricing and venture capitalists. Eur J Finance 15(4):421–435

Cohen J, Cohen P (1983) Applied multiple regression/correlation analysis for the behavioral sciences, 2nd edn. Erlbaum, Hillsdale

Connelly BL, Certo T, Reutzel CR (2011) Signaling theory: a review and assessment. J Manag 37(1):39–67

*Cook DO, Kieschnick R, Van Ness RA (2006) On the marketing of IPOs. J Financ Econ 82(1):35–61

*Cooney JW, Singh AK, Carter RB, Dark FH (2001) IPO initial returns and underwriter reputation: has the inverse relation flipped in the 1990s? Working Paper, University of Kentucky, Case Western Reserve University and Iowa State University

Daily CM, Certo ST, Dalton DR, Roengpitya R (2003) IPO underpricing: a meta-analysis and research synthesis. Entrep Theory Pract 27(3):271–295

*Dell’Acqua A, Guardasole A, Bonini S (2013) Grandstanding and spinning in VC-backed IPOs on AIM UK. SSRN Working Paper, SSRN-ID = 2280411

*Dimovski W, Philavanh S, Brooks R (2011) Underwriter reputation and underpricing: evidence from the Australian IPO market. Rev Quant Finance Account 37(4):409–426

Djankov S, La Porta R, Lopez-de-Silanes F, Shleifer A (2008) The law and economics of self-dealing. J Financ Econ 88(3):430–465

Doidge C, Karolyi GA, Stulz RM (2004) Why are foreign firms listed in the US worth more? J Financ Econ 71(2):205–238

Dunbar CG (2000) Factors affecting investment banks initial public offering market share. J Financ Econ 55(1):3–41

Dyck A, Zingales L (2004) Private benefits of control: an international comparison. J Finance 59(2):537–600

Easterbrook FH (1984) Two agency-cost explanation of dividends. Am Econ Rev 74(4):650–659

Edelen RM, Kadlec GB (2005) Issuer surplus and the partial adjustment of IPO prices to public information. J Financ Econ 77(2):347–373

Ekkayokkaya M, Pengniti T (2012) Governance reform and IPO underpricing. J Corp Finance 18(2):238–253

Engelen P-J, van Essen M (2010) Underpricing of IPOs: firm-, issue- and country-specific characteristics. J Bank Finance 34(8):1958–1969

*Fang LK (2005) Investment bank reputation and the price and quality of underwriting services. J Finance 60(6):2729–2761

*Fernando CS, Krishnamurthy S, Spindt PA (2004) Are share price levels informative? Evidence from the ownership, pricing, turnover and performance of IPO firms. J Financ Mark 7(4):377–403

*Filatotchev I, Bishop K (2002) Board composition, share ownership, and underpricing of UK IPO firms. Strateg Manag J 23(10):941–955

*Fohlin C (2000) IPO underpricing in two universes: Berlin, 1882–1892, and New York, 1998–2000. Working Paper 1088, California Institute of Technology

*Francis B, Hasan I, Lothian JR, Sun X (2010) The signaling hypothesis revisited: evidence from foreign IPOs. J Financ Quant Anal 45(1):81–106

*Francis B, Hasan I, Zhou M (2012) Strategic conservative earnings management of technology firms: evidence from IPO market. Financ Mark Inst Instrum 21(5):261–293

*Franzke SA (2003) Underpricing of venture-backed and non venture-backed IPOs: Germany’s Neuer Markt. RICAFE working paper (Risk Capital and the Financing of European Innovative Firms), No. 003

Fulghieri P, Spiegel J (1991) Underpriced initial public offers as a multi-product investment house strategy. Paper fb-21, Columbia—Graduate School of Business

*Geranio M, Mazzoli C, Palmucci F (2013) Conflicts of interest in IPO pricing. Working Paper, University of Bocconi

Geyskens I, Krishnan R, Steenkamp JEM, Cunha PV (2009) A review and evaluation of meta-analysis practices in management research. J Manag 35(2):393–419

*Gompers PA (1996) Grandstanding in the venture capital industry. J Financ Econ 42(1):133–156

*Gounopoulos D (2003) The initial and aftermarket performance of IPOs: evidence from Athens Stock Exchange. Working Paper, Manchester School of Management

*Gounopoulos D, Nounis C, Stylianides P (2008) The short and long term performance of initial public offerings in the Cyprus Stock Exchange. J Financ Decis Mak 4(1):1–28

Grinblatt M, Hwang CY (1989) Signalling and the pricing of new issues. J Finance 44(2):393–420

*Guenther DA, Willenborg M (1999) Capital gains tax rates and the cost of capital for small business: evidence from the IPO market. J Financ Econ 53(3):385–408

*Guo RJ, Lev B, Shi C (2006) Explaining the short- and long-term IPO anomalies in the US by R&D. J Bus Finance Account 33(3–4):550–579

*Habib M, Ljungqvist A (2001) Underpricing and entrepreneurial wealth losses in IPOs: theory and evidence. Rev Financ Stud 14(2):433–458

Hanley KW (1993) The underpricing of initial public offerings and the partial adjustment phenomenon. J Financ Econ 34(2):231–250

Hansen RS, Torregrosa P (1992) Underwriter compensation and corporate monitoring. J Finance 47(4):1537–1555

Hao GQ (2007) Laddering in initial public offerings. J Financ Econ 85(1):102–122

Hardy MA (1993) Regression with dummy variables. quantitative applications in the social sciences, SAGE University Paper, 93

Hedges LV, Olkin I (1985) Statistical methods for meta-analysis. Academic Press, Orlando

Hedges LV, Vevea JL (1998) Fixed- and random-effects models in meta-analysis. Psychol Methods 3(4):486–504

Hedges LV, Tipton E, Johnson MC (2010) Robust variance estimation in meta-regression with dependent effect size estimates. Res Synth Methods 1(1):39–65

Heugens PPMAR, van Essen M, van Oosterhout J (2009) Meta-analyzing ownership concentration and firm performance in Asia. Towards a more fine-grained understanding. Asia Pac J Manag 26(3):481–512

*Hoberg G (2007) The underwriter persistence phenomenon. J Finance 62(3):1169–1206

Hopp C, Dreher A (2013) Do differences in institutional and legal environments explain cross-country variations in IPO underpricing? Appl Econ 45(4):435–454

*Howton SD, Howton SW, Olson GT (2001) Board ownership and IPO return. J Econ Finance 25(1):100–114

Hunter JE, Schmidt FL (2004) Methods of meta-analysis: correcting error and bias in research findings, 2nd edn. Sage, Thousand Oaks

Ibbotson RG (1975) Price performance of common stock new issues. J Financ Econ 2(3):235–272

Jackson D, Riley R, White IR (2011) Multivariate meta-analysis: potential and promise. Stat Med 30(20):2481–2498

Jagannathan R, Sherman AE (2005) Reforming the bookbuilding process for IPOs. J Appl Corp Financ 17(1):67–72

Jenkinson T, Ljungqvist A (2001) Going public: the theory and evidence on how companies raise equity finance, 2nd edn. Oxford University Press, Oxford

Johnson J, Miller R (1988) Investment banker prestige and the underpricing of initial public offerings. Financ Manag 17(2):19–29

*Karlis PL (2000) IPO underpricing. Park Place Econ VIII:81–89

Kaufmann D, Kraay A, Mastruzzi M (2004) Governance matters III: governance indicators for 1996, 1998, 2000, and 2002. World Bank Econ Rev 18(2):253–287

*Kirkulak B, Davis C (2005) Underwriter reputation and underpricing: evidence from the Japanese IPO market. Pac Basin Finance J 13(4):451–470

*Kutsuna K, Dimovski W, Brooks R (2008) The pricing and underwriting costs of Japanese REIT IPOs. J Prop Res 25(3):221–239

La Porta R, Lopez-de-Silanes F, Shleifer A, Vishny RW (1997) Legal determinants of external finance. J Finance 52(3):1131–1150

La Porta R, Lopez-de-Silanes F, Shleifer A, Vishny RW (1998) Law and finance. J Political Econ 106(6):1113–1155

La Porta R, Lopez-de-Silanes F, Shleifer A, Vishny RW (1999) The quality of government. J Law Econ Organ 15(1):222–279

Leland HE, Pyle DH (1977) Information asymmetries, financial structure, and financial intermediation. J Finance 32(2):371–387

*Leone AJ, Rock S, Willenborg M (2007) Disclosure of intended use of proceeds and underpricing in initial public offerings. J Account Res 45(1):111–153

*Lin TH (1996) The certification role of large block shareholders in initial public offerings: the case of venture capitalists. Q J Bus Econ 35(2):55–65

*Liu X, Ritter JR (2011) Local underwriter oligopolies and IPO underpricing. J Financ Econ 102(3):579–601

*Ljungqvist A (1999) IPO underpricing, wealth losses and the curious role of venture capitalists in the creation of public companies. Unpublished working paper

*Ljungqvist A, Wilhelm WJ (2003) IPO pricing in the dot-com bubble. J Finance 58(2):723–752

Logue DE (1973) On the pricing of unseasoned equity issues: 1965–1969. J Financ Quant Anal 8(1):91–103

Logue DE, Rogalski RJ, Seward JK, Foster-Johnson L (2002) What is special about the roles of underwriter reputation and market activities in initial public offerings? J Bus 75(2):213–243

Loughran T, Ritter JR (2002) Why don’t issuers get upset about leaving money on the table in IPOs? Rev Financ Stud 15(2):413–443

*Loughran T, Ritter J (2004) Why has IPO underpricing changed over time? Financ Manag 33(3):5–37

*Loureiro G (2010) The reputation of underwriters: a test of the bonding hypothesis. J Corp Finance 16(4):516–532

*Lowry M, Murphy KJ (2007) Executive stock options and IPO underpricing. J Financ Econ 85(1):339–365

Lowry M, Schwert GW (2004) Is the IPO pricing process efficient? J Financ Econ 71(1):3–26

Lypsey MW, Wilson DB (2001) Practical meta-analysis. Sage Publications, Thousand Oaks

*McGuinness P (1992) An examination of the underpricing of initial public offerings in Hong Kong: 1980–1990. J Bus Finance Account 19(2):165–186

*Megginson W, Weiss KA (1991) Venture capital certification in initial public offerings. J Finance 46(3):879–903

*Michaely R, Shaw WH (1994) The pricing of initial public offerings: tests of adverse-selection and signaling theories. Rev Financ Stud 7(2):279–319

Mulherin JH (2007) Measuring the costs and benefits of regulation: conceptual issues in securities markets. J Corp Finance 13(2–3):421–437

Pagano M, Panetta F, Zingales L (1998) Why do companies go public? An empirical analysis. J Finance 53(1):27–64

Pham PK, Kalev PS, Steen AB (2003) Underpricing, stock allocation, ownership structure and post-listing liquidity of newly listed firms. J Bank Finance 27(5):919–947

*Pollock TG (2004) The benefits and costs of underwriters’ social capital in the US Initial Public Offerings market. Strateg Organ 2(4):357–388

Pollock TG, Fund B, Baker T (2009) Dance with the one that brought you? Venture capital firms and the retention of founder-CEOs. Strateg Entrep J 3(3):199–217

*Rasheed AMA, Datta DK, Chinta RR (1997) Determinants of price premiums: a study of initial public offerings in the medical diagnostics and devices industry. J Small Bus Manag 35(4):11–23

*Reber B, Fong C (2006) Explaining mispricing of initial public offerings in Singapore. Appl Financ Econ 16:1339–1353

Ritter JR (1984) Signaling and the valuation of unseasoned new issues: a comment. J Finance 39(4):1231–1237

Ritter JR (1987) The costs of going public. J Financ Econ 19(2):269–281

*Robinson RM, Robinson MA, Peng C-C (2004) Underpricing and IPO ownership retention. J Econ Finance 28(1):132–146

Rock K (1986) Why new issues are Underpriced? J Financ Econ 15(1–2):187–212

Rosenthal R (1991) Meta-analytic procedures for social research, Revised edn. Sage, Newbury Park

Sapienza H, Manigart S, Vermeir W (1996) Venture capitalist governance and value added in four countries. J Bus Ventur 11(6):439–469

*Schenone C (2004) The effect of banking relationships on the firm’s IPO underpricing. J Finance 59(6):2903–2958

*Schertler A (2002) The determinant of underpricing: initial public offerings on the Neuer Markt and the Nouveau Marché. EIFC Working Paper, 02–12

*Schrand C, Verrecchia RE (2005) Information disclosure and adverse selection explanations for IPO underpricing. Working Paper SSRN No. 316824

*Smart SB, Zutter CJ (2003) Control as motivation for underpricing: a comparison of dual and single class IPOs. J Financ Econ 69(1):85–110

*Song K, Choi Y-S, Lee JE (2012) Revisiting the certifying role of financial intermediaries on IPOs. J Appl Bus Res 28(5):1017–1034

Spamann H (2010) The “antidirector rights index” revisited. Rev of Financ Stud 23(2):467–486

Stanley TD (2001) Wheat from chaff: meta-analysis as quantitative literature review. J Econ Perspect 15(3):131–150

Stanley TD, Doucouliagos H, Jost MG, Heckemeyer H, Johnston RJ, Laroche P, Nelson JP, Paldam M, Poot J, Pugh G, Rosenberger RS, Rost K (2013) Meta-analysis of economics research reporting guidelines. J Econ Surv 27(2):390–394

Stoll H, Curley AJ (1970) small business and the new issues markets for equities. J Financ Quant Anal 5(3):309–322

Stübben J, Berkholz R (2014) Determinants of IPO underpricing—a comparison between the UK and Germany. Master thesis. Lund University

Su C (2015) Does institutional reform improve the impact of investment bank reputation on the long-term stock performance of initial public offerings? Br Account Rev 47(4):445–470

*Su C, Bangassa K (2011) The impact of underwriter reputation on initial returns and long-run performance of Chinese IPOs. J Int Financ Mark Inst Money 21(5):760–791

Tanner-Smith EE, Tipton E (2013) Robust variance estimation with dependent effect sizes: practical considerations including a software tutorial in Stata and SPSS. Res Synth Methods 5(1):13–30

Tanner-Smith EE, Tipton E, Polanin JR (2016) Handling complex meta-analytic data structures using robust variance estimates: a tutorial in R. J Dev Life-Course Criminol 2(1):85–112

Thompson SG, Higgins JPT (2002) How should meta-regression analyses be undertaken and interpreted? Stat Med 21(11):1559–1573

Tinic SM (1988) Anatomy of initial public offerings of common stock. J Finance 43(4):789–822

Tipton E (2013) Robust variance estimation in meta-regression for binary dependent outcomes. Res Synth Methods 4(2):169–187

*Vismara S (2014) Patents, R&D investments and post-IPO strategies. Rev Manag Sci 8(3):419–435

Vong API, Trigueiros D (2010) The short-run price performance of initial public offerings in Hong Kong: new evidence. Glob Finance J 21(3):253–261

*Vong API, Zhao N (2008) An examination of IPO underpricing in the growth enterprise market of Hong Kong. Appl Financ Econ 18(19):1539–1547

Welch I (1989) Seasoned offerings, imitation costs and the underpricing of initial public offerings. J Finance 44(2):421–449

*Wen W (2005) What has explained IPO underpricing? Working Paper Simon Fraser University

*Wolf FM (1986) Meta-analysis: quantitative methods for research synthesis. Quantitative applications in the social sciences. SAGE University Paper, 59

World Bank (2016) The worldwide governance indicators. http://www.worldbank.org/ . Accessed Feb 2016

Wurgler J (2000) Financial markets and the allocation of capital. J Financ Econ 58(1–2):187–214

*Yang L (2012) An analysis of the IPO underpricing for high-tech companies in Japan. Working Paper Saint Mary’s University, Halifax N.S

*Yeh Y-H, Shu P-G, Guo R-J (2008) Ownership structure and IPO valuation—evidence from Taiwan. Financ Manag 37(1):141–161

Zhang F (2012) Information precision and IPO pricing. J Corp Finance 18(2):331–348

Download references

Author information

Authors and affiliations.

Department of Economics, University of Messina, Via dei Verdi, 75, 98122, Messina, Italy

Tiziana La Rocca

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Tiziana La Rocca .

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

List of the management, entrepreneurship and finance journals used for manually search of papers, which were the journals that I found in the first set of papers that I obtained as output using the first research-criteria.

Rights and permissions

Reprints and permissions

About this article

La Rocca, T. Do prestigious underwriters shape IPO pricing? A meta-analytic review. Rev Manag Sci 15 , 573–609 (2021). https://doi.org/10.1007/s11846-019-00356-1

Download citation

Received : 07 June 2018

Accepted : 01 October 2019

Published : 15 October 2019

Issue Date : April 2021

DOI : https://doi.org/10.1007/s11846-019-00356-1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Corporate governance
  • Underpricing
  • Underwriter reputation
  • Prestigious underwriter
  • Meta-analytic moderator analysis
  • Robumeta approach

JEL Classification

  • Find a journal
  • Publish with us
  • Track your research

TSMC expects Q2 sales to jump on 'insatiable' AI demand

  • Medium Text
  • Sees Q2 revenue at $19.6-$20.4 bln vs $15.68 bln year earlier
  • Forecasts 2024 revenue to rise in low- to mid-20% range in USD
  • TSMC Q1 net profit rises 9%, beats forecasts
  • Stock price has surged on chip demand for AI applications

Logo of Taiwan Semiconductor Manufacturing Co (TSMC), in Hsinchu

NET PROFIT RISES

Sign up here.

Reporting by Yimou Lee and Faith Hung; additional reporting by Ben Blanchard; Editing by Anne Marie Roantree, Shri Navaratnam and Christian Schmollinger

Our Standards: The Thomson Reuters Trust Principles. New Tab , opens new tab

research paper on share prices

Thomson Reuters

Yimou Lee is a Senior Correspondent for Reuters covering everything from Taiwan, including sensitive Taiwan-China relations, China's military aggression and Taiwan's key role as a global semiconductor powerhouse. A three-time SOPA award winner, his reporting from Hong Kong, China, Myanmar and Taiwan over the past decade includes Myanmar's crackdown on Rohingya Muslims, Hong Kong protests and Taiwan's battle against China's multifront campaigns to absorb the island.

FILE PHOTO: The logo of Google LLC is seen at the Google Store Chelsea in New York City

Technology Chevron

U.s. chip bans not meant to hobble china's growth, blinken says.

U.S. export controls on sending advanced computing chips to China are not meant to hold back China's economy or technological development, Secretary of State Antony Blinken said during an interview with National Public Radio on Friday.

Apple logo is seen on the MacBook in this illustration taken

IMAGES

  1. (PDF) Survey on Factors Determining the Share Price Movement in Indian

    research paper on share prices

  2. (PDF) The Impact of Dividend Policy on Share Price Volatility

    research paper on share prices

  3. (PDF) The Effect of Qualified Auditors' Report on Share Prices

    research paper on share prices

  4. Sample essay on share prices

    research paper on share prices

  5. (PDF) Study On Dividend Policy: Antecedent and Its Impact On Share Price

    research paper on share prices

  6. Charted: The Rise of Stock Buybacks Over 20 Years

    research paper on share prices

VIDEO

  1. jk paper limited share price #stock #market #live #intraday #bank #nifty

  2. Genus Paper share

  3. Best swing trading stock| j.k paper ltd| entry| target| इस stock जाने मत देना| return 10% confirmed

  4. Lecture 14: Types of issue for SEBI Grade A

  5. JK PAPER SHARE LATEST NEWS|JK PAPER DIVIDEND NEWS|JK PAPER SHARE PRICE|JK PAPER LTD|multibaggerstock

  6. Historical Stock Price Data in Google Sheets

COMMENTS

  1. Stock market movement forecast: A Systematic review

    This paper presents an updated systematic review of the state of the art in the stock market forecast considering fundamental and technical analysis from 2014 to 2018. ... Research methodology and related works ... and closing prices. This oscillator is used in conjunction with the share price and can predict an uptrend or downtrend. Lambert ...

  2. A systematic review of fundamental and technical analysis of stock

    A stock market is a place for trading stocks (equity) and other financial instruments of public listed companies, where the price of shares is termed "share" or "stock price" (Wanjawa and Muchemi 2014). In reality, the stock price level of a firm, to a large extent reflects how it "cuts its pie" (Chan et al. 2017).

  3. Factors affecting share prices: A literature revisit

    The aim of the paper is to present key factors they are determinants of share prices in the stock markets globally. To achieve the aim of this study many of previous studies have been reviewed.

  4. The Effect of Dividend Policy on Share Price: A Conceptual Review

    dividend changes o n share prices: An empirical anal ysis of China. Chinese Economy. 2009;4 2(1):62- 85. [ 76 ] Hamid K, Khurram MU, Ghaffar W. Juxtaposition of Micro and

  5. ESG performance and stock prices: evidence from the COVID-19 ...

    Summary statistics. In response to the COVID-19 pandemic, this paper selects Chinese A-share listed companies as a study sample. This study selects the sample based on the following considerations.

  6. Corporate Social Responsibility and Stock Prices After the ...

    This paper examines the impact of Corporate Social Responsibility (CSR) on companies' stock market performance, measured in terms of share price, in the period after the global financial crisis. We further distinguish between strategic socially responsible activities directly related to a company's business core, and the remaining CSR ...

  7. Factors affecting share prices: A literature revisit

    This paper revisits previous studies by various researches on factors affecting the share price in both developed and developing economies. These researchers looked at the relationships between stock prices and selected internal and external factors affecting a firm's share price. "Some of the factors affecting share prices can be ...

  8. On the effects of Covid-19 pandemic on stock prices: an ...

    Examining the factors influencing stock prices has been thoroughly researched in the literature of financial economics, factors such as money supply, inflation rate, interest rates, exchange rate, etc. as being the main determinants of stock prices in almost every country (Bahmani-Oskooee and Saha 2019; Fasanya et al. 2021a).Some examples of studies that have included these variables are, Niel ...

  9. Empirical Relationship Between Macroeconomic Variables and Stock Market

    In a bid to find gaps in research, some of the existing literature has been studied. Mookerjee and Yu studied informational efficiency in the Singapore market by applying cointegration and causality tests.Narrow and broad money supply, nominal exchange rates and foreign currency reserves are studied in connection with stock prices.

  10. An Exploratory Study of Stock Price Movements from Earnings Calls

    Here, we study the statistical relationship between earnings calls, company sales, stock perfor-mance, and analysts' recommendations. Our study covers a decade of observations with approximately 100,000 transcripts of earnings calls from 6,300 public companies from January 2010 to December 2019. In this study, we report three novel findings.

  11. PDF Relationships Between Stock Price, Trading Volume, and Bid-ask Spread

    Stock Exchange. The authors used a log-linear regression model where spread per share was the dependent variable and price per share, trading volume, number of market makers, and risk were the independent variables. All four independent variables were significant, and the model explained 91% of the variation in the spread.

  12. The Impact of Green Investors on Stock Prices

    Abstract. We study the impact of green investors on stock prices in a dynamic equilibrium model where investors are green, passive or active. Green investors track an index that progressively excludes the stocks of the brownest firms; passive investors hold a value-weighted index of all stocks; and active investors hold a mean-variance efficient portfolio of all stocks.

  13. The Effect of Dividend Policy on Stock Price: Evidence from the Indian

    To examine the impact of the dividend policy as well as DPR of the share price. Descriptive statistics and least square regression method: The model is able to explain only 43.43% of the variation in the share price wherein DY, investment growth and earnings volatility insignificantly influence the changes in the company's share price. 2013

  14. Short-term stock market price trend prediction using a ...

    In the era of big data, deep learning for predicting stock market prices and trends has become even more popular than before. We collected 2 years of data from Chinese stock market and proposed a comprehensive customization of feature engineering and deep learning-based model for predicting price trend of stock markets. The proposed solution is comprehensive as it includes pre-processing of ...

  15. Time-varying influence of interest rates on stock returns: evidence

    The rest of this paper proceeds in the following structure. Part 2 reviews the literature that explores the mechanism of interest rate on the stock price. ... Interest rates move in the opposite direction from share prices, and money supply is aligned with share prices. However, this effect is statistically asymmetrical, and the influence of ...

  16. A critical study on relationship of EPS and stock price

    EPS is one of the measure to check the financial position of the firm [2]. Core et al. [7] concluded that in presence of ESP (Employee Stock Option) EPS will positively influence stock price of the company. Forecasting of earnings also influences the stock price. The study of Capstaff et al. [6] is first study in this regard.

  17. How Earning Per Share (EPS) Affects on Share Price and Firm Value

    This paper attempts to provide empirical evidence on how EPS affect the share price movement. We have collected and analyzed 22 scheduled banks 110 firm year data and found that share price does ...

  18. (PDF) Determinants of Stock Prices: Empirical Evidence from NSE 100

    It indicates the extent to which the earnings of each share are covered by its price.It can be calculated as: P/E ratio DPS = Dividendt,n No. of outstanding shares (4) (6) = MP t,n EPS t, n (7) 89 IRACST- International Journal of Research in Management & Technology (IJRMT), ISSN: 2249-9563 Vol. 3, No.3,June 2013 Where MPt,n is the market price ...

  19. A Study on the Impact of Dividend Announcements on Stock Prices in

    This research highlights the impact of dividend payment on the behavior of stock prices and their abnormal returns. To understand this behavior, 120 stocks have been randomly picked, which have paid cash dividends in 2016 and used the popular event methodology.

  20. PDF Analysis on Factors that Affect Stock Prices: A Study on Listed ...

    Research Journal of Finance and Accounting www.iiste.org ISSN 2222-1697 (Paper) ISSN 2222-2847 (Online) Vol.7, No.18, 2016 93 ... This study investigates on identifying the main forces that affect share prices in the capital market of Bangladesh. For this, the study considers a panel data set of 7 companies of cement industry listed in the Dhaka

  21. Carbon prices on the rise? Shedding light on the emerging EU ETS2

    how high prices could rise and related drivers is substantial. In light of this, we explore ETS2 price paths using the energy system model PRIMES. We focus on the effect of complementary efficiency policies (EPs), as earlier research suggests they could have profound impact. Indeed, analysing three scenarios with different

  22. Truth Social: Donald Trump is on the verge of another $1 billion ...

    According to SEC filings, Trump Media can issue additional shares to pre-merger shareholders such as the former president if the dollar volume-weighted average price equals or exceeds $12.50 for ...

  23. Introducing Meta Llama 3: The most capable openly available LLM to date

    Today, we're excited to share the first two models of the next generation of Llama, Meta Llama 3, available for broad use. This release features pretrained and instruction-fine-tuned language models with 8B and 70B parameters that can support a broad range of use cases.

  24. Do prestigious underwriters shape IPO pricing? A meta-analytic review

    The management, finance and entrepreneurship literature has demonstrated increasing interest in the causes and consequences of initial public offerings, IPOs (Certo et al. 2009).Underpricing, which involves the difference between the issue price of a company's shares and the official market price that is recorded on the first day of trading, is an anomaly in the capital market that has ...

  25. A Study on Analysis of Stock Prices of Selected Industries

    This study made will help the investors know the behaviour of share prices and thus can succeed. Simple Moving Average for IDFC First Bank Ltd. Moving Average of Bajaj Finance Ltd . from 1 st ...

  26. iPhone sales are plunging. Here's why

    Apple's smartphone sales tumbled 10% last quarter, according to market research firm IDC, as sales in China fell sharply. ... Samsung occupied about 20.8% of market share (or 60.1 million ...

  27. (PDF) The Effect of Market Capitalization, Trading ...

    This research looks at how market capitalization, trading volume, BV, and DER are used to analyze share price variations. The research aims to understand investor behavior in fluctuating market ...

  28. UF DBA's research born from personal tragedy awarded best paper

    University of Florida Bryan Hall 100 PO Box 117150 Gainesville, FL 32611-7150

  29. International Paper tops quarterly sales on higher prices, demand

    Packaging company International Paper beat Wall Street's expectations for first-quarter sales on Thursday, capitalizing on higher prices, improved production and a rebound in demand for its products.

  30. TSMC expects Q2 sales to jump on 'insatiable' AI demand

    TSMC , the world's largest chipmaker and a major Apple and Nvidia supplier, forecast second-quarter sales may rise as much as 30% as it rides a wave of demand for semiconductors used in artificial ...