> df_test.info()
Int64Index: 15000 entries, 4985 to 1561
Data columns (total 16 columns):
video_id 15000 non-null object
trending_date 15000 non-null object
title 15000 non-null object
channel_title 15000 non-null object
category_id 15000 non-null int64
publish_time 15000 non-null object
tags 15000 non-null object
views 15000 non-null int64
likes 15000 non-null int64
dislikes 15000 non-null int64
comment_count 15000 non-null int64
thumbnail_link 15000 non-null object
comments_disabled 15000 non-null bool
ratings_disabled 15000 non-null bool
video_error_or_removed 15000 non-null bool
description 14717 non-null object
dtypes: bool(3), int64(5), object(8)
memory usage: 1.6+ MB
> df_validation.info()
nt64Index: 6965 entries, 0 to 21954
Data columns (total 16 columns):
video_id 6965 non-null object
trending_date 6965 non-null object
title 6965 non-null object
channel_title 6965 non-null object
category_id 6965 non-null int64
publish_time 6965 non-null object
tags 6965 non-null object
views 6965 non-null int64
likes 6965 non-null int64
dislikes 6965 non-null int64
comment_count 6965 non-null int64
thumbnail_link 6965 non-null object
comments_disabled 6965 non-null bool
ratings_disabled 6965 non-null bool
video_error_or_removed 6965 non-null bool
description 6829 non-null object
dtypes: bool(3), int64(5), object(8)
memory usage: 782.2+ KB