DIGITAL LIBRARY ARCHIVE
HOME > DIGITAL LIBRARY ARCHIVE
< Previous   List   Next >  
Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning
Full-text Download
Junga Song (Hanbat National University)
Keunho Choi (Hanbat National University)
Gunwoo Kim (Hanbat National University)
Vol. 24, No. 4, Page: 67 ~ 83
10.13088/jiis.2018.24.4.067
Keywords
Movie, Box Office, Box Office Revenue, Box Office Factors, Prediction of Box Office, Predicting Number of Audience, Machine Learning
Abstract
The Korean film industry with significant increase every year exceeded the number of cumulative audiences of 200 million people in 2013 finally. However, starting from 2015 the Korean film industry entered a period of low growth and experienced a negative growth after all in 2016. To overcome such difficulty, stakeholders like production company, distribution company, multiplex have attempted to maximize the market returns using strategies of predicting change of market and of responding to such market change immediately. Since a film is classified as one of experiential products, it is not easy to predict a box office record and the initial number of audiences before the film is released. And also, the number of audiences fluctuates with a variety of factors after the film is released. So, the production company and distribution company try to be guaranteed the number of screens at the opining time of a newly released by multiplex chains. However, the multiplex chains tend to open the screening schedule during only a week and then determine the number of screening of the forthcoming week based on the box office record and the evaluation of audiences. Many previous researches have conducted to deal with the prediction of box office records of films. In the early stage, the researches attempted to identify factors affecting the box office record. And nowadays, many studies have tried to apply various analytic techniques to the factors identified previously in order to improve the accuracy of prediction and to explain the effect of each factor instead of identifying new factors affecting the box office record. However, most of previous researches have limitations in that they used the total number of audiences from the opening to the end as a target variable, and this makes it difficult to predict and respond to the demand of market which changes dynamically. Therefore, the purpose of this study is to predict the weekly number of audiences of a newly released film so that the stakeholder can flexibly and elastically respond to the change of the number of audiences in the film. To that end, we considered the factors used in the previous studies affecting box office and developed new factors not used in previous studies such as the order of opening of movies, dynamics of sales. Along with the comprehensive factors, we used the machine learning method such as Random Forest, Multi Layer Perception, Support Vector Machine, and Naive Bays, to predict the number of cumulative visitors from the first week after a film release to the third week. At the point of the first and the second week, we predicted the cumulative number of visitors of the forthcoming week for a released film. And at the point of the third week, we predict the total number of visitors of the film.
In addition, we predicted the total number of cumulative visitors also at the point of the both first week and second week using the same factors. As a result, we found the accuracy of predicting the number of visitors at the forthcoming week was higher than that of predicting the total number of them in all of three weeks, and also the accuracy of the Random Forest was the highest among the machine learning methods we used. This study has implications in that this study 1) considered various factors comprehensively which affect the box office record and merely addressed by other previous researches such as the weekly rating of audiences after release, the weekly rank of the film after release, and the weekly sales share after release, and 2) tried to predict and respond to the demand of market which changes dynamically by suggesting models which predicts the weekly number of audiences of newly released films so that the stakeholders can flexibly and elastically respond to the change of the number of audiences in the film.
Show/Hide Detailed Information in Korean
영화 흥행에 영향을 미치는 새로운 변수 개발과이를 이용한 머신러닝 기반의 주간 박스오피스 예측
송정아 (한밭대학교)
최근호 (한밭대학교)
김건우 (한밭대학교)
Keywords
영화 흥행 예측, 영화 관람객 수 예측, 박스오피스 예측, 기계학습
Abstract
2013년 누적인원 2억명을 돌파한 한국의 영화 산업은 매년 괄목할만한 성장을 거듭하여 왔다. 하지만 2015 년을 기점으로 한국의 영화 산업은 저성장 시대로 접어들어, 2016년에는 마이너스 성장을 기록하였다. 영화산업을 이루고 있는 각 이해당사자(제작사, 배급사, 극장주 등)들은 개봉 영화에 대한 시장의 반응을 예측하고 탄력적으로 대응하는 전략을 수립해 시장의 이익을 극대화하려고 한다. 이에 본 연구는 개봉 후 역동적으로 변화하는 관람객 수요 변화에 대한 탄력적인 대응을 할 수 있도록 주차 별 관람객 수를 예측하는데 목적을 두고 있다. 분석을 위해 선행연구에서 사용되었던 요인 뿐 아니라 개봉 후 역동적으로 변화하는 영화의 흥행순위, 매출점유율, 흥행순위 변동 폭 등 선행연구에서 사용되지 않았던 데이터들을 새로운 요인으로 사용하고 Naive Bays, Random Forest, Support Vector Machine, Multi Layer Perception등의 기계학습 기법을 이용하여 개봉 일 후, 개봉1주 후, 개봉 2주 후 시점에는 차주 누적 관람객 수를 예측하고 개봉 3주 후 시점에는 총 관람객 수를 예측하였다. 새롭게 제시한 변수들을 포함한 모델과 포함하지 않은 모델을 구성하여 실험하였고 비교를 위해 매 예측시점마다 동일한 예측 요인을 사용하여 총 관람객 수도 예측해보았다. 분석결과 동일한 시점에 총 관람객 수를예측했을 경우 보다 차주 누적 관람객 수를 예측하는 것이 더 높은 정확도를 보였으며. 새롭게 제시한 변수들을포함한 모델의 정확도가 대부분 높았으며 통계적으로 그 차이가 유의함으로써 정확도에 기여했음을 확인할 수있었다. 기계학습 기법 중에는 Random Forest가 가장 높은 정확도를 보였다.
Cite this article
JIIS Style
Song, J., K. Choi, and G. Kim, "Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning", Journal of Intelligence and Information Systems, Vol. 24, No. 4 (2018), 67~83.

IEEE Style
Junga Song, Keunho Choi, and Gunwoo Kim, "Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning", Journal of Intelligence and Information Systems, vol. 24, no. 4, pp. 67~83, 2018.

ACM Style
Song, J., Choi, K., and Kim, G., 2018. Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning. Journal of Intelligence and Information Systems. 24, 4, 67--83.
Export Formats : BiBTeX, EndNote

Warning: include(/home/hosting_users/ev_jiisonline/www/admin/archive/advancedSearch.php) [function.include]: failed to open stream: No such file or directory in /home/hosting_users/ev_jiisonline/www/archive/detail.php on line 429

Warning: include() [function.include]: Failed opening '/home/hosting_users/ev_jiisonline/www/admin/archive/advancedSearch.php' for inclusion (include_path='.:/usr/local/php/lib/php') in /home/hosting_users/ev_jiisonline/www/archive/detail.php on line 429
@article{Song:JIIS:2018:750,
author = {Song, Junga and Choi, Keunho and Kim, Gunwoo},
title = {Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning},
journal = {Journal of Intelligence and Information Systems},
issue_date = {December 2018},
volume = {24},
number = {4},
month = Dec,
year = {2018},
issn = {2288-4866},
pages = {67--83},
url = {http://dx.doi.org/10.13088/jiis.2018.24.4.067 },
doi = {10.13088/jiis.2018.24.4.067},
publisher = {Korea Intelligent Information System Society},
address = {Seoul, Republic of Korea},
keywords = { Movie, Box Office, Box Office Revenue, Box Office Factors, Prediction of Box Office, Predicting Number of Audience and Machine Learning
},
}
%0 Journal Article
%1 750
%A Junga Song
%A Keunho Choi
%A Gunwoo Kim
%T Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning
%J Journal of Intelligence and Information Systems
%@ 2288-4866
%V 24
%N 4
%P 67-83
%D 2018
%R 10.13088/jiis.2018.24.4.067
%I Korea Intelligent Information System Society