Professional Documents
Culture Documents
Session 7 - Osorio
Session 7 - Osorio
Javier Osorio
University of Arizona
School of Government and Public Policy
Strategy effectiveness?
• Align activities, capabilities, and tactics to an end
• We need high-quality, valid, and verifiable data
BERT
• Google’s pre-trained language model
• Learn from billions of documents
• Generic documents limited performance
ConfliBERT
• Domain specific: conflict + violence + crime
• English sources
Osorio - UA Tracking Organized Crime Using ML & NLP UNODC 2022
NLP and ML applications
Colombia
• NLP tools to track the violent presence of armed actors
Mexico
• ML and NLP to track violent presence of criminal groups
Latin America
• Organized crime activity
• Criminal Organizations
• 10 main groups
• 200 subgroups and gangs
• Actor geo-location
• Eventus ID
• Municipality-day
• Dictionary: 7,900 actors
• Dynamic GIS interface
Osorio - UA Tracking Organized Crime Using ML & NLP UNODC 2022
Organized Crime activities in LA
• Source:
• Insight Crime
• July 2004 to March 2020
• 13,000 news articles in English
• ML Task:
• Multi-label classification