用 R 玩 Kaggle –
臉書打卡點預測
Play kaggle with R, Facebook V: Predicting Check Ins
@Mia (R-Ladies)
library(dplyr)
r-ladies_global %>%
filter(from = 'Taipei', travel_to = 'Lisbon')
The Agenda
First Second Third
Hey, Kaggle
# with R-Ladies
# with Masters
Play R
# Warm Up
# EDA, Shiny Apps
# Azure Jupyter
Notebook
Brief Intro
# About
R-Ladies
# About me
2
Last
Q&A
# Recap
# Resource
Sharing
1.
Brief Intro
Intro about R-Ladies and me!
https://rladiestaipei.github.io/R-Ladies-Taipei/
3
R-Ladies Taipei
4
Berl
in
Taiwa
n
4
● Founded in 2014
● every month in Taipei,
● Our history: goo.gl/HbHNeP
Hello!
I am Mia Chang (張懷文).
▪ Data Scientist, Lecturer
▪ Member of R-Ladies Taipei
▪ Co-founder of Azure Taiwan Community
▪ Microsoft Most Valuable Professionals (MVP) 2017
5
2.
Hey, Kaggle
With R-Ladies and other masters!
6
7
8
9
3.
Play R
# Warm Up - 問題背景,問題定義
# Azure Jupiter Notebook
# use Jupiter access Data
# 結論
10
11
Warm Up -
還沒有modeling經驗的朋友
Warm Up -
關於這個問題背景,問題定義
12
With 8.6 million test records there are about a trillion (10^12)
place-observation combinations.
Warm Up -
關於這個問題背景,問題定義
Schema
row_id
x y
accuracy
time
place_id
13
EDA
“
Warm Up - 關於這個問題背景,問題定義
Three weeks into the eight-week competition,
I climbed to the top of the public leaderboard with
about 50 features
1. the summary data such as the number of historical check ins.
2. historical density of a place candidate, one year prior to the
observation.
3.All features are rescaled if needed in order to result in
similar interpretations for the train and test features.
14
EDA -
Missing data
15
c
EDA -
發現大家透過GPS, Wi-Fi or cellular
16
c
EDA -
尖峰打卡日
17
玩資料 - Shinny App
18
玩資料 - Github
19
玩資料 - Azure Jupyter Notebook
20
玩資料 - Azure Jupyter Notebook
21
# 演算法及結論
#Rcpp
#It was expected that it
would be clearly correlated
with the variation in x and y
but the pattern is not as
obvious. Halfway through the
competition I cracked the
code ...
22
4.
Q&A
# Recap
# Action Item
23
Recap
First Second Third
Hey, Kaggle
# with R-Ladies
# with Masters
Play R
# Warm Up
# EDA, Shiny Apps
# Azure Jupyter
Notebook
Brief Intro
# About
R-Ladies
# About me
24
Last
Q&A
# Recap
# Resource
Sharing
Action Item
First Second Third
Hi, Kaggle Play R
Get your
partners
Visit R-Ladies
R-Basic too!
25
Then
...
Thanks for your listening!
26
Look forward to your visit to R-Ladies Taipei! Also Azure Taiwan!
Bye!
I am Mia Chang (張懷文)
▪ mia5419@gmail.com
▪ facebook.com/mia5419
27
28
Take Away & Reference
1.Use EDA to help you find
more feature.
2.Go to Kaggle website to get
more resource to help you:
forum, kernels
3.No matter you are
learning R or you are going
to traveling to visit other
R-Ladies, call us for more
resources :)
1. R-Ladies Meetup Page
2. R-Ladies Facebook Group
3. Blog Post by Tom Van de Wiele
- Detail about implementation
4. Github Repository
5. Shiny App by Tom Van de Wiele
- EDA that you can learn more
6. Kaggle Event Page
7. Microsoft Azure Notebooks

More Related Content

KEY
XP matsuri Agile 2009 Report
PDF
20150118 學個 Sinatra 好過年
PDF
A Gentle Introduction to Tidy Statistics in R.pdf
PPT
Ruby - The Hard Bits
PDF
Scaling with apache spark (a lesson in unintended consequences) strange loo...
PDF
Sharing (or stealing) the jewels of python with big data & the jvm (1)
PDF
Idiomatic R for Rosetta Code (2013)
PDF
Padrino is agnostic
XP matsuri Agile 2009 Report
20150118 學個 Sinatra 好過年
A Gentle Introduction to Tidy Statistics in R.pdf
Ruby - The Hard Bits
Scaling with apache spark (a lesson in unintended consequences) strange loo...
Sharing (or stealing) the jewels of python with big data & the jvm (1)
Idiomatic R for Rosetta Code (2013)
Padrino is agnostic

Similar to Play Kaggle with R, Facebook V: Predicting Check Ins (13)

PDF
Introduction to Python Syntax and Semantics
PPTX
Python Map Reduce vs Scalding
PPTX
Apache Jena Elephas and Friends
PPTX
R Then and Now
PDF
Use Groovy&Grails in your spring boot projects
PDF
PyData Frankfurt - (Efficient) Data Exchange with "Foreign" Ecosystems
PDF
PyData: Past, Present Future (PyData SV 2014 Keynote)
PDF
Big data beyond the JVM - DDTX 2018
PDF
Ray: Enterprise-Grade, Distributed Python
PDF
[DevRelCon Tokyo 2017] Creative Technical Content for Better Developer Experi...
PDF
Machine vision and device integration with the Ruby programming language (2008)
PDF
Improving Enterprise Agility via a Lean Lens
PDF
Puppet Camp Dallas 2014: How Puppet Ops Rolls
Introduction to Python Syntax and Semantics
Python Map Reduce vs Scalding
Apache Jena Elephas and Friends
R Then and Now
Use Groovy&Grails in your spring boot projects
PyData Frankfurt - (Efficient) Data Exchange with "Foreign" Ecosystems
PyData: Past, Present Future (PyData SV 2014 Keynote)
Big data beyond the JVM - DDTX 2018
Ray: Enterprise-Grade, Distributed Python
[DevRelCon Tokyo 2017] Creative Technical Content for Better Developer Experi...
Machine vision and device integration with the Ruby programming language (2008)
Improving Enterprise Agility via a Lean Lens
Puppet Camp Dallas 2014: How Puppet Ops Rolls
Ad

More from Mia Chang (11)

PDF
Evaluating Large Language Models for Your Applications and Why It Matters
PDF
Running the first automatic speech recognition (ASR) model with HuggingFace -...
PDF
7 steps to AI production - global azure bootcamp 2020 Koln
PDF
TensorFlow Lite for mobile & IoT
PDF
DPS2019 data scientist in the real estate industry
PDF
Leverage the power of machine learning on windows
PDF
Develop computer vision applications with azure computer vision api
PDF
The Art of Unit Testing Ch5-6
PDF
Deploy Deep Learning Application with Azure Container Instance - Devdays2018
PDF
What's AI, Machine Learning and Deep Learning - Talk @NCCU python 讀書會
PDF
twMVC#29 -Learning Machine Learning with Movie Recommendation
Evaluating Large Language Models for Your Applications and Why It Matters
Running the first automatic speech recognition (ASR) model with HuggingFace -...
7 steps to AI production - global azure bootcamp 2020 Koln
TensorFlow Lite for mobile & IoT
DPS2019 data scientist in the real estate industry
Leverage the power of machine learning on windows
Develop computer vision applications with azure computer vision api
The Art of Unit Testing Ch5-6
Deploy Deep Learning Application with Azure Container Instance - Devdays2018
What's AI, Machine Learning and Deep Learning - Talk @NCCU python 讀書會
twMVC#29 -Learning Machine Learning with Movie Recommendation
Ad

Recently uploaded (20)

PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPTX
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
PPTX
Caseware_IDEA_Detailed_Presentation.pptx
PDF
An essential collection of rules designed to help businesses manage and reduc...
PDF
Navigating the Thai Supplements Landscape.pdf
PPTX
recommendation Project PPT with details attached
PDF
Best Data Science Professional Certificates in the USA | IABAC
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
DOCX
Factor Analysis Word Document Presentation
PPTX
SET 1 Compulsory MNH machine learning intro
PPT
statistics analysis - topic 3 - describing data visually
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PDF
A biomechanical Functional analysis of the masitary muscles in man
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
MBA JAPAN: 2025 the University of Waseda
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
eGramSWARAJ-PPT Training Module for beginners
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
DU, AIS, Big Data and Data Analytics.ppt
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
Tapan_20220802057_Researchinternship_final_stage.pptx
Caseware_IDEA_Detailed_Presentation.pptx
An essential collection of rules designed to help businesses manage and reduc...
Navigating the Thai Supplements Landscape.pdf
recommendation Project PPT with details attached
Best Data Science Professional Certificates in the USA | IABAC
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Factor Analysis Word Document Presentation
SET 1 Compulsory MNH machine learning intro
statistics analysis - topic 3 - describing data visually
expt-design-lecture-12 hghhgfggjhjd (1).ppt
A biomechanical Functional analysis of the masitary muscles in man
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
MBA JAPAN: 2025 the University of Waseda

Play Kaggle with R, Facebook V: Predicting Check Ins