Skip to content

Using eight years of student loan and academic performance data from the College of Management at Lunghwa University of Science and Technology, this study conducts clustering, ANOVA, and visualization analysis to reveal key underlying patterns.(LHU Thesis)

License

Notifications You must be signed in to change notification settings

kancheng/rsloan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rsloan

利用 大學管理學院 8 年的"就學貸款"與"學生成績"資料,進行 集群、ANOVA 與視覺化分析。

Master Thesis — Lunghwa University of Science and Technology

Institution: College of Management, Lunghwa University of Science and Technology Degree: Master’s Thesis

This project analyzes eight years of institutional data, including student loan application records and academic performance, collected by the College of Management at Lunghwa University of Science and Technology.

Using Python and statistical modeling, the study performs:

Clustering Analysis to identify hidden patterns among student groups

ANOVA (Analysis of Variance) to examine differences across loan usage and academic outcomes

Data Visualization to interpret trends and support decision-making

The research aims to provide insights into financial aid effectiveness, academic performance prediction, and institutional policy planning.

RSLoan - R Shiny

為 R Shiny 開發。

Home

rsloanhome

Work

rsloanimport

R Function

分析集群的函數,為 rsloan 的基礎, dataiptdataopt 為資料輸入跟資料輸出的目錄。

Import

set.seed(929)
getwd()

# R Kan Dev Function Main File
source("C:/dataipt/rfunc/main-rfunc.R")

# Data input CSV file
dataipth = "C:/dataipt/lhucmdt/lhumlndcw70"

# Data output CSV file
dataopth = "C:/dataopt/lhucmdt"

# input DF
rcsvdf(dataipth)

# input List
lhudata = rcsvlt(dataipth)

Setting

# Cluster Analysis Base colnames 
hacbdt = c( "cala","loam","ec", "cppg")

# Cluster Analysis PKey colnames 
pkb = c( "sid")

Hierarchical clustering

產生 階層式分群變異數分析各群敘述統計就學貸款下的各群敘述統計各分群的人數比例 等 資料集。

# Hierarchical clustering
hcaon(im13,  hacbdt, pkb, hck = 6, dtname = "im13")

Plot

產生 單科目多科目 的散佈圖。

# HCA Multiple ggplot proc

mainindex = c( "cala" )
courindex = c( "itdc", "cppg", "pcpg", "oopg", "itdcn", "cala", "calb", "ec", 
"dtst", "nwkpm", "sadm", "idbs", "st", "mana", "inkpg", "dbms", "mis")

sg2proc("im11avt", mainindex, courindex)
mg2proc("im11avt", mainindex, courindex)

Details

raw

為目前分析過後的管理學院學生資料。

  • dtcna70 為資料清洗過 NA 的資料集。

  • origdt 為資料未處理 NA 的原始資料集。

  • sql 為資料清洗過 NA 的 SQL 資料。

rfunc

by Windows R Console 。 Here

若想要直接在 Console 的部分直接用寫好的自訂函數,可於 rfunc 目錄中執行寫好的 R File。

shiny-server

by Ubuntu Linux

shiny-server/rsloan/...

將 shiny-server 目錄向下 的整個 rsloan 目錄複製整個放置在 /srv/shiny-server/ 路徑下面,如下 :

放置路徑

/srv/shiny-server/rsloan

瀏覽器

127.0.0.1:3838/rsloan

rprofile

為執行 R 執行預設時自動載入。

Environment

詳見 rsloan-environment 環境部屬

R Version

package 對應該版本為 R version 3.3.2

Data Cleaning

NA 值的資料筆數處理 rsloan-dcna

nca

有實際成績的科目欄位 / 所有科目欄位

Paper

Kan, Hao-Cheng. (2017). "A Study of Academic Achievement Analysis System: Using Cluster Analysis and Visualization". (Lunghwa University of Science and Technology). Retrieved from https://hdl.handle.net/11296/45ru69 .

LICENSE

MIT License

About

Using eight years of student loan and academic performance data from the College of Management at Lunghwa University of Science and Technology, this study conducts clustering, ANOVA, and visualization analysis to reveal key underlying patterns.(LHU Thesis)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published