Research
design
This
study adopted an explanatory research design. The explanatory design was
considered appropriate because it facilitated the identification and assessment
of causal relationships among the study variables. Panel data were used because
they combined both cross-sectional and time-series dimensions, thereby
increasing the number of observations and improving the efficiency and
robustness of the empirical estimations. Consistent with Wooldridge, the use of
panel data allowed the study to control for unobserved heterogeneity across
banks that could otherwise bias the results [44]. The dataset consisted of an
unbalanced panel, as some sampled banks did not have complete observations for
all years due to data attrition. Consequently, the empirical analysis focused
on unbalanced panel data. The study estimated pooled ordinary least squares
(OLS), fixed-effects and random-effects regression models. These estimation
techniques enabled the analysis to account for both individual-specific and
time-invariant characteristics while ensuring the selection of the most
appropriate model for inference.
Population
of the study
The
population of the study comprised all licensed rural and community banks
operating in Ghana. These were selected as the population of interest because
of their central role in extending credit to underserved communities and their
relatively high exposure to loan default risk.
Sample
size and sampling technique
The
study employed a purposive sampling technique to select rural and community
banks with available and consistent financial data over the study period.
Sampling was necessary to reduce the volume of data required while ensuring
that the selected sample adequately represented the population under
investigation. Accordingly, a total of twenty (20) rural and community banks
were selected for the period 2014–2019. The selected time frame was considered
appropriate because it provided sufficient observations to support robust panel
data analysis and captured recent developments in the rural banking sector.
Rural and community banks that did not have complete annual financial reports
for the study period were excluded, resulting in an unbalanced panel dataset.
Data
collection procedure
The
study relied exclusively on secondary data. Financial data for both the
dependent and independent variables were obtained from the audited annual
financial statements of the sampled rural and community banks for the period
2014–2019. These statements provided consistent and reliable information on key
financial indicators, including non-performing loans, asset quality and return
on equity. In addition, macroeconomic variables were sourced from the World
Development Indicators (WDI) database to capture relevant external factors that
influenced loan default and financial performance. The use of secondary data
enhanced the reliability and objectivity of the study and facilitated
longitudinal analysis.
Data
analysis techniques
The
data were analyzed using econometric techniques implemented in EViews software.
Prior to estimation, several diagnostic tests were conducted to assess the
suitability of the data for regression analysis. These included unit root
(stationarity) tests to ensure the stability of the variables over time and
multicollinearity tests to examine the degree of correlation among the
explanatory variables. Furthermore, the Hausman specification test was
conducted to determine the most appropriate estimation technique among the
pooled OLS, fixed-effects and random-effects models. The Hausman test enabled
the selection of a consistent and efficient model by assessing whether
individual-specific effects were correlated with the explanatory variables.
Based on the results of this test, the final regression estimates were
selected, interpreted and discussed.
Description
of variables
In
this study, bank financial performance was conceptualized as the dependent
variable and proxied by return on equity (ROE). The empirical banking
literature has consistently employed profitability indicators such as return on
assets (ROA) and return on equity (ROE) to evaluate bank performance and
financial sustainability [45,46]. While ROA primarily captures managerial
efficiency in the utilization of total assets, ROE more directly reflects the
returns generated on shareholders invested capital and therefore aligns more
closely with the objective of shareholders’ wealth maximization. As such, ROE
provides a more comprehensive assessment of how effectively bank management
translates asset deployment and risk-taking decisions into returns for equity
holders. The relevance of ROE is particularly pronounced in the context of
rural and community banks, where capital bases are relatively limited and
profitability is highly sensitive to asset quality and credit risk. Empirical
evidence suggests that rising non-performing loans erode banks’ equity through
higher provisioning costs and reduced retained earnings, thereby exerting a
direct negative effect on ROE. Moreover, studies across both developed and
emerging economies have shown that deterioration in loan performance weakens
investor confidence and constrains capital growth, reinforcing the centrality
of ROE as an indicator of financial health. Within emerging and developing
banking systems, ROE has been widely adopted as a robust measure of
profitability because it captures both operational efficiency and the
consequences of risk exposure, particularly credit risk. This is especially
relevant for rural and community banks in Ghana, whose lending activities are
heavily concentrated in microcredit and small-scale loans that are inherently
more vulnerable to default. As such, fluctuations in non-performing loans and
asset quality are more likely to be transmitted directly into equity returns,
making ROE an appropriate and policy-relevant performance indicator.
Accordingly,
ROE was used in this study to assess the effectiveness of management in
generating profits from shareholders’ funds within rural and community banks in
Ghana. It was measured as the ratio of net profit after tax to total
shareholders’ equity and expressed as a percentage. Shareholders’ equity
comprised paid-up capital, statutory reserves, income surpluses, capital
surpluses and revaluation reserves. This measurement approach is consistent
with prior empirical studies that emphasized ROE as a reliable indicator of
profitability, capital efficiency and long-term financial sustainability in the
banking sector, particularly under conditions of heightened credit risk [47].
Accordingly, the selection of ROE as the dependent variable was theoretically
and empirically justified, as it captures the combined effects of asset
quality, loan default and risk management practices on shareholder value. By
focusing on ROE, this study provides a direct assessment of how non-performing
loans and related credit risk factors influence the financial performance and
sustainability of rural and community banks in Ghana (Table 1).
Panel
data analysis
The
study adopted a panel data methodology in recognition of its superior
econometric advantages for analyzing bank-level behavior over time. Panel data
combine cross-sectional observations of the same institutions with their
time-series dynamics, thereby offering a more comprehensive and reliable
representation of economic relationships than single-period cross-sectional
analyses. This framework improves estimation efficiency by increasing the
number of observations, enhancing variability and mitigating potential
multicollinearity among regressors. More importantly, panel data techniques
allow for the control of unobserved heterogeneity across banks, which may
otherwise bias parameter estimates if ignored. Accordingly, the analysis
employed pooled ordinary least squares (OLS), fixed effects and random effects
models to account for both time-specific and bank-specific effects. This
approach ensured that the estimated relationships between loan default and its
internal and macroeconomic determinants reflected not only cross-sectional
differences among rural banks but also their evolution over time.
The basic panel data model is of the
form:
…………………….(1)
Where
is constant, i represents the firm and t is the time dimension.
Represents the explanatory variable and
is the error term.
Where
is the firm’s specific effect and
is a random term.
Pooled
regression model
The
pooled regression model combines cross-sectional and time-series observations
into a single estimation framework, treating the data as a homogeneous pool and
ignoring individual-specific and time-specific effects. Under this approach,
the explanatory variables are assumed to be uncorrelated with the error term
and all observational units are presumed to share a common intercept and slope
coefficients. While pooled ordinary least squares (OLS) provides consistent and
efficient estimates under these assumptions, its principal limitation lies in
its inability to account for unobserved heterogeneity across individual
entities and over time. Consequently, the model does not distinguish between
cross-sectional differences among banks or temporal variations, which may lead
to biased estimates if such effects are present. Despite this limitation,
pooled OLS serves as a useful benchmark model and is commonly employed as a
baseline specification in panel data analysis.
…………………...……………………………………………..(2)
Where; Y=Dependent Variable, X=Explanatory Variable, i=Cross Section Unit,
t=Time Duration and =Error Termit is assumed that the X's are non-stochastic
and that the error term fits the classical assumptions.
Fixed effect regression model
The
fixed-effect model requires the individual ?1t results to be compared with the
explanatory variables X. The fixed effect model is shown below:
Yit=?1i + ?2X2it +?3X3it + ?it……………….……………………………….….….(3)
Where;
Y =Dependent Variable, X=Explanatory Variable, i =Cross section unit, t
=The time period. While intercept can be different between companies in the
Fixed Effect Model, each intercept does not vary over time. In other words,
it's time. Fixed Effect Model assumes that the pitch of regression equations
are different for individuals or over time.
Random effects regression Model
Unlike
the Fixed Effect model, the random effect implies that the error term of the
entity is not associated with the describing variables. The model of fixed
effect is of the form:
Yit=?1i + ?2X2it +?3X3it + ?it
……………….…………………………………….(3)
Where;
Y =Dependent Variable, X =Explanatory Variable, I =
Cross-section Unit, t = Time period.
Instead of considering ?1i as a set, we presume that it is a random variable
with a mean value of ?1i (no subscript i). In other words, the individual error
components are not associated with each other and are not correlated over the
cross-section and time-series units. It is not immediately measurable; it is
regarded as an unobservable or latent element. Y =Dependent Variable, X
=Explanatory Variable, I = Cross-section Unit, t = Time period. Instead of
considering ?1i as a set, we presume that it is a random variable with a mean
value of ?1i (no subscript i).
Model Specification
Model 1: LoanDeftit= ?0 + ?1Bszit+
?2AssetQyit + ?3Unmplyit + ?4LQttit+
?5GDPit + ?6Inflit + ?7 InRatit+
?8ROEit + ?it Model 2: ROEit=
?0 + ?1Bszit+ ?2AssetQyit
+ ?3Unmplyit + ?4LQttit+ ?5GDPit
+ ?6Inflit + ?7 InRatit+ ?8LoanDeft
+ ?it
Where
LoanDeft =
loan default, ROE= return on equity, Bsz= bank size, AssetQy= asset quality,
Umploy= unemployment rate, LQtt= Liquidity, GDP = Gross Domestic Product, Infl
= Inflation, InRat = Interest Rate
? = error
term, i & t represent cross-section unit and at time t respectively and ?
represents coefficient of the variables.