<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="https://linusdietz.com/feed.xml" rel="self" type="application/atom+xml" /><link href="https://linusdietz.com/" rel="alternate" type="text/html" /><updated>2026-03-09T07:35:24+00:00</updated><id>https://linusdietz.com/feed.xml</id><title type="html">Dr. Linus Dietz</title><subtitle>About computer science, clean code, and open source.</subtitle><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><entry><title type="html">Understanding the Potential of Urban Parks to Promote Well-being</title><link href="https://linusdietz.com/parks-for-wellbeing/" rel="alternate" type="text/html" title="Understanding the Potential of Urban Parks to Promote Well-being" /><published>2025-11-21T00:00:00+00:00</published><updated>2025-11-21T00:00:00+00:00</updated><id>https://linusdietz.com/parks-for-wellbeing</id><content type="html" xml:base="https://linusdietz.com/parks-for-wellbeing/"><![CDATA[<p>Urban parks are vital public health assets, providing citizens to recreate, socialize, and perform their hobbies.  However, their health benefits are rarely understood through the range of activities they support. A King’s College London and Nokia Bell Labs study published in <a href="https://doi.org/10.1038/s44284-025-00345-4">Nature Cities</a>, investigates parks’ qualities in a global analysis of more than 23,000 parks in 35 cities on five continents. The study assesses how well parks support health-promoting activities and where they fall short.
The researchers identified six types of health-related activities that parks can support:</p>

<ul>
  <li><strong>Physical</strong> (e.g., walking, biking, sports)</li>
  <li><strong>Nature appreciation</strong> (e.g., birdwatching, picnicking)</li>
  <li><strong>Environmental</strong> (e.g., gardening, conservation)</li>
  <li><strong>Social</strong> (e.g., festivals, volunteering)</li>
  <li><strong>Cultural</strong> (e.g., performances, exhibitions)</li>
  <li><strong>Mindfulness</strong> (e.g., yoga, meditation)</li>
</ul>

<p><img src="/assets/images/map-parks.png" alt="image" /> Figure: Visualization of the park scores in London for <strong>(a)</strong> nature appreciation and <strong>(b)</strong> physical activities. Explore all parks in 35 cities using the <a href="https://linusdietz.com/research/healthy_parks/">interactive visualization</a>. CC-BY Linus Dietz, map data from <a href="https://www.openstreetmap.org/copyright">OpenStreetMap</a>.</p>

<h3 id="how-to-measure-parks-support-for-well-being">How to Measure Parks’ Support for Well-being</h3>

<p>The researchers collected all features found in parks (such as benches, ponds, or sports courts) from OpenStreetMap and linked them to activity types. Because the number of tags is large, they used a validated Large Language Model approach  to compile a <a href="https://github.com/LinusDietz/Health-Promoting-Parks-Replication/blob/main/data/lexicon.csv">lexicon</a> of 1432 park features and their corresponding health-promoting activities.
To score each park, they built statistical models that measure how well a park supports specific health-related activities. These scores show how many health-promoting elements a park has compared with an average park in the same city. To reflect cultural and climatic differences, each model is separate for each city.
The validation of the scores was done by comparing them with a dataset of 10 million geolocated Flickr photos taken in parks. Parks with higher scores had more photos of people engaged in the relevant activities.</p>

<p>The approach is first to systematically assess the qualities of parks for wellbeing on a global scale.</p>

<h3 id="key-findings">Key Findings</h3>

<ol>
  <li><strong>Regional differences in park design priorities:</strong> European parks prioritize opportunities for nature appreciation, whereas North American parks are focused around physical activities.</li>
  <li><strong>Spatial gradient:</strong> Inner-city parks consistently achieve higher scores across all activity categories compared to peripheral parks.</li>
  <li><strong>Inequality patterns:</strong> The degree of inequality in park activity scores shows no consistent geographic trend across cities, with Copenhagen and Rio de Janeiro exhibiting the greatest disparities and Tokyo, Paris, and Auckland the most equitable offerings. Generally, physical activity infrastructure is fairly evenly distributed but features supporting social interaction tend to be concentrated in a few parks.</li>
</ol>

<p><img src="/assets/images/parks-disparity.png" alt="image" /> Figure: Distribution of park scores by <strong>(a)</strong> distance quartile to the city center, <strong>(b)</strong> activity category, and <strong>(c)</strong> continent. CC-BY Linus Dietz.</p>

<h3 id="implications">Implications</h3>

<p>The paper provides actionable insights into the current state of urban parks, enabling the assessment of existing shortcomings and supporting evidence-based decisions for future development. In interviews, park managers and urban designers highlighted that such quantitative evidence can inform decisions about which features to prioritize and can help justify and secure funding for targeted improvements. Furthermore, urban planners can use the park scoring system to identify underserved areas and to prioritize investments that promote more equitable access to high-quality, health-promoting green spaces.</p>

<h3 id="resources">Resources</h3>

<ul>
  <li>Paper: <a href="../assets/papers/NatureCities2025UnderstandingThePotentialOfUrban.pdf" target="_blank">Understanding the Potential of Urban Parks to Promote Well-being</a>.  Nature  Cities  2025</li>
  <li><a href="https://social-dynamics.net/healthy-parks/">Project page</a></li>
  <li><a href="/research/healthy_parks/">Interactive Visualization</a></li>
  <li><a href="https://github.com/LinusDietz/Health-Promoting-Parks-Replication">Replication package</a> with code and data</li>
</ul>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[Urban parks are vital public health assets, providing citizens to recreate, socialize, and perform their hobbies. However, their health benefits are rarely understood through the range of activities they support. A King’s College London and Nokia Bell Labs study published in Nature Cities, investigates parks’ qualities in a global analysis of more than 23,000 parks in 35 cities on five continents. The study assesses how well parks support health-promoting activities and where they fall short. The researchers identified six types of health-related activities that parks can support:]]></summary></entry><entry><title type="html">Final Year Data Science Projects 2025/26</title><link href="https://linusdietz.com/final-year-projects-2025/" rel="alternate" type="text/html" title="Final Year Data Science Projects 2025/26" /><published>2025-11-02T00:00:00+00:00</published><updated>2025-11-02T00:00:00+00:00</updated><id>https://linusdietz.com/final-year-projects-2025</id><content type="html" xml:base="https://linusdietz.com/final-year-projects-2025/"><![CDATA[<p>This academic year, I am offering a range of final-year projects for MSc Data Science students at King’s College London. The topics are motivated by current research in urban computing and data governance.</p>

<h2 id="quantifying-urban-greenspace-globally">Quantifying Urban Greenspace Globally</h2>

<p>Recent research has highlighted the role of specific spaces within urban parks in supporting physical and mental wellbeing [1]. Using the Lexicon of Health-Related Park Features [2], a dataset of 23,477 parks across 35 cities was evaluated based on their potential to promote health-related activities.</p>

<p>This project offers students the opportunity to apply and extend their data science skills to a global-scale environmental challenge. Building upon an existing data processing pipeline for urban park analysis [3], the aim is to expand the scope to include all cities worldwide and incorporate a broader range of urban greenspaces—such as community gardens, green corridors, and other forms of urban vegetation. The analysis will be complemented with additional environmental indicators, including urban heat stress [4], air pollution, and vegetation indices such as NDVI, primarily derived from satellite observations.</p>

<p>An important aspect of this work is understanding the urban context of greenspaces; their spatial location, accessibility, and surrounding built environment characteristics such as building density and land-use type. Developing such contextual indices provides valuable insights into how greenspaces are used and helps explain their broader ecological and social roles within cities [5,6].</p>

<ul>
  <li>[1] <a href="https://linusdietz.com/assets/papers/NatureCities2025Health-promotingPotentialOfParksIn.pdf">Health-promoting Potential of Parks in 35 Cities Worldwide</a>. Nature Cities 2025</li>
  <li>[2] <a href="https://github.com/LinusDietz/Health-Promoting-Parks-Replication/blob/main/data/lexicon.csv">Lexicon of Health-Related Park Features (OpenStreetMap)</a></li>
  <li>[3] <a href="https://github.com/LinusDietz/Health-Promoting-Parks-Replication/">Park Analysis Pipeline</a></li>
  <li>[4] Parks, S. A., Dillon, G. K., &amp; Miller, C. (2014). A new metric for quantifying burn severity: the relativized burn ratio. Remote Sensing, 6(3), 1827-1844.</li>
  <li>[5] Larson, K. L., Brown, J. A., Lee, K. J., &amp; Pearsall, H. (2022). Park equity: why subjective measures matter. Urban Forestry &amp; Urban Greening, 76, 127733.</li>
  <li>[6] Kaczynski, A. T., Schipperijn, J., Hipp, J. A., Besenyi, G. M., Stanis, S. A. W., Hughey, S. M., &amp; Wilcox, S. (2016). ParkIndex: Development of a standardized metric of park access for research and planning. Preventive medicine, 87, 110-114.</li>
</ul>

<h2 id="analyzing-and-visualizing-the-inequalities-of-londons-built-environment-and-its-effect-on-health">Analyzing and Visualizing the Inequalities of London’s Built Environment and Its Effect on Health</h2>

<p>This project aims to develop a comprehensive analytical and visualization framework to quantify and understand inequalities across London’s built environment and their impact on public health. By integrating diverse spatial and socioeconomic datasets, the project will construct multi-dimensional indices of inequality that capture environmental, infrastructural, and health-related disparities across London boroughs. The framework will provide a quantifiable and scalable tool for assessing how variations in the built environment contribute to health inequities, supporting evidence-based urban policy and planning.</p>

<p>The built environment component investigates how the characteristics of places where people live influence their daily wellbeing. This includes examining air quality and pollution levels, noise exposure, access to green spaces and tree coverage, and neighbourhood resilience to extreme weather events such as flooding and heatwaves. Special attention is given to the vulnerabilities of specific population groups and the cumulative effects of environmental stressors. By aggregating and analysing these factors, the project explores how urban form and environmental conditions interact to shape comfort, safety, and overall quality of life across London.</p>

<p>The health equity component examines spatial patterns of health outcomes and access to healthcare. It analyses indicators such as obesity prevalence, GP prescription rates, and accessibility of healthcare services, alongside official measures of health deprivation. This analysis reveals where and why health inequalities persist and how they relate to environmental and infrastructural factors. Together, these insights aim to illuminate the links between the built environment and health outcomes, providing actionable intelligence for local authorities, urban planners, and public health agencies.</p>

<ul>
  <li>Sanja Šćepanović, Ivica Obadic, Sagar Joglekar, Laura Giustarini, Cristiano Nattero, Daniele Quercia, and Xiao Xiang Zhu. 2024. MEDSAT: a public health dataset for england featuring medical prescriptions and satellite imagery. In Proceedings of the 37th International Conference on Neural Information Processing Systems (NIPS ‘23)</li>
  <li>Schipperijn, J., Madsen, C.D., Toftager, M. et al. The role of playgrounds in promoting children’s health – a scoping review. Int J Behav Nutr Phys Act 21, 72 (2024). https://doi.org/10.1186/s12966-024-01618-2</li>
  <li>Bach, B., Freeman, E., Abdul-Rahman, A., Turkay, C., Khan, S., Fan, Y., &amp; Chen, M. (2022). Dashboard design patterns. IEEE Transactions on Visualization and Computer Graphics, 29(1), 342-352. https://dashboarddesignpatterns.github.io</li>
  <li>https://data.london.gov.uk/</li>
  <li>https://www.gov.uk/government/collections/english-indices-of-deprivation</li>
</ul>

<h2 id="ai-for-data-governance">AI for Data Governance</h2>

<p>The exchange of data within and between organizations is governed by company policies and data protection laws. As policies and data flows change over time, maintaining compliance in data exchange poses a complex challenge. Computational data governance refers to automating governance tasks using AI tools that support the organization and its employees to comply with policies and allow data sharing to foster the business needs. Currently, this is a disruptive field in enterprise software with many emerging solutions because all the current and future AI systems need to maintain access to data, which is only possible with highly automated and sophisticated data governance.</p>

<p>The project is about using popular open source software [1, 1b] and open standards [2,3] from the Linux Foundation to develop and evaluate solutions that facilitate data governance within organizations. Based on seminal work [4], further aspects of data governance will be explored, including data quality assurances, metadata tagging, query generation, query validation, AI explanations, and scaling to Big Data problems. An initial dataset of annotated data access requests is provided [5], which is to be adopted to other use cases.</p>

<p>Successful candidates will learn how to employ Large Language Models on a critical problem the IT industry currently faces. Gartner predicts: <em>“By 2027, for example, 60% of organizations will fail to realize the anticipated value of their AI use cases due to incohesive data governance frameworks [6].”</em> This is your chance to prove them wrong and become an expert in a highly-sought after field.</p>

<ul>
  <li>[1] Data Product MCP https://github.com/entropy-data/dataproduct-mcp</li>
  <li>[1b] Data Contract CLI https://github.com/datacontract/datacontract-cli</li>
  <li>[2] Bitol, Open Data Product Standard (ODPS). Linux Foundation AI &amp; Data, 2025. Version 1.0.0.</li>
  <li>[3] Bitol, Open Data Contract Standard (ODCS). Linux Foundation AI &amp; Data, 2025. Version 3.0.1.</li>
  <li>[4] Dietz, L. W., Wider, A., &amp; Harrer, S. (2025). Automating Data Governance with Generative AI. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 8(1), 760-771. https://doi.org/10.1609/aies.v8i1.36587</li>
  <li>[5] Supplementary Material for “Automating Data Governance with Generative AI” https://github.com/LinusDietz/Automating-Data-Governance</li>
  <li>[6] https://www.gartner.com/en/data-analytics/topics/data-governance</li>
</ul>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[This academic year, I am offering a range of final-year projects for MSc Data Science students at King’s College London. The topics are motivated by current research in urban computing and data governance.]]></summary></entry><entry><title type="html">Automating Data Governance with Generative AI</title><link href="https://linusdietz.com/automating-data-governance/" rel="alternate" type="text/html" title="Automating Data Governance with Generative AI" /><published>2025-10-21T00:00:00+00:00</published><updated>2025-10-21T00:00:00+00:00</updated><id>https://linusdietz.com/automating-data-governance</id><content type="html" xml:base="https://linusdietz.com/automating-data-governance/"><![CDATA[<p>Organizations now manage complex data systems, and meeting privacy laws such as the GDPR has become both vital and expensive. We examined how large language models (LLMs) can support data governance by generating warnings about data access decisions in decentralized data systems.</p>

<p>In the past, data governance relied on checklists, spreadsheets, or domain-specific languages for access control. With advances in natural language processing, data marketplaces within decentralized systems can now help decision-makers manage data sharing more effectively.</p>

<p>Laws such as the EU General Data Protection Regulation, the California Consumer Privacy Act, and the EU AI Act require strict control over personal and sensitive data. Organizations must balance innovation with compliance under these rules.</p>

<h2 id="data-marketplaces-and-ai-assisted-data-governance">Data Marketplaces and AI-Assisted Data Governance</h2>

<p>In data mesh architectures, domain teams manage their own data products within a federated environment. Each product defines its usage rules and guarantees through a data contract, which states what data can be shared and under what conditions.
To coordinate these contracts, an enterprise data marketplace records, reviews, and approves or rejects access requests between data products.</p>

<p>At the center of this marketplace is Governance AI, an LLM-powered tool that checks whether a data access request complies with the provider’s data contract, company policies, and legal requirements such as the GDPR. Governance AI does not make final decisions. Instead, it issues structured warnings and suggestions for correction to guide human experts.</p>

<h2 id="generating-realistic-testing-datasets">Generating Realistic Testing Datasets</h2>

<p>Testing compliance systems is difficult. Real-world datasets contain confidential information, and manually creating synthetic test cases takes time.
To evaluate the system, we used LLMs to generate realistic data access requests, building on the work of <a href="https://ojs.aaai.org/index.php/AIES/article/view/31660" target="_blank">Herdel et al. (2024)</a>. Each test case included metadata about data products, privacy policies, and data contracts to create plausible access scenarios. For example, a marketing team requesting customer purchase data for campaign optimization.</p>

<p>Domain experts from the insurance and e-commerce sectors reviewed the generated requests. Most were judged realistic, and some closely matched real-world use cases; the rest were discarded.</p>

<h2 id="evaluating-computational-governance">Evaluating Computational Governance</h2>

<p>In the evaluation, Governance AI was compared with domain experts who assessed 110 access requests across both sectors.</p>

<p>Governance AI issued 3.6 times more warnings than human experts. It did not miss any case where experts raised a compliance concern. After a secondary review, experts judged 80% of the AI’s warnings to be correct.</p>

<p>This cautious approach may slow some data-sharing workflows, but it supports a key compliance principle: prevent breaches first, optimize later. The AI also offered actionable suggestions, which proved more useful in e-commerce than in highly regulated fields such as insurance.</p>

<h2 id="findings-and-implications">Findings and Implications</h2>

<p>The study highlights several insights for organizations pursuing AI-assisted data governance. Governance AI can <strong>handle complex policy reasoning without missing critical cases</strong>, showing that automated assistance is both feasible and effective. Synthetic access requests generated by LLMs can <strong>realistically simulate</strong> real-world governance scenarios, providing a practical way to test systems at scale. Despite the AI’s cautious approach, <strong>human oversight remains essential</strong> to ensure contextual and legal accuracy. In privacy-sensitive contexts, a stricter approach is often safer, as over-warning carries less risk than under-warning. Differences across sectors also matter: e-commerce scenarios allowed flexible mitigation, such as anonymization, while insurance required precise legal compliance. Beyond compliance, we explored the potential for <strong>continuous governance</strong>, where AI systems dynamically test and monitor policy adherence as data landscapes evolve.</p>

<p>This is a project in collaboration with Arif Wider from HTW Berlin as well as Simon Harrer, who co-founded Entropy Data and built the <a href="https://www.entropy-data.com/">data marketplace</a>.</p>

<p>Linus W. Dietz, Arif Wider, &amp; Simon Harrer. <a href="../assets/papers/AIES25AutomatingDataGovernanceWithGenerative.pdf" target="_blank">Automating  Data  Governance  with  Generative  AI</a>.  AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society 2025.  <a href="https://doi.org/10.1609/aies.v8i1.36587"><i class="ai  ai-doi" style="color:  black;"></i></a></p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[Organizations now manage complex data systems, and meeting privacy laws such as the GDPR has become both vital and expensive. We examined how large language models (LLMs) can support data governance by generating warnings about data access decisions in decentralized data systems.]]></summary></entry><entry><title type="html">The Experience of Running: Recommending Routes Using Sensory Mapping in Urban Environments</title><link href="https://linusdietz.com/experience-of-running/" rel="alternate" type="text/html" title="The Experience of Running: Recommending Routes Using Sensory Mapping in Urban Environments" /><published>2025-05-16T00:00:00+00:00</published><updated>2025-05-16T00:00:00+00:00</updated><id>https://linusdietz.com/experience-of-running</id><content type="html" xml:base="https://linusdietz.com/experience-of-running/"><![CDATA[<p>Most apps that have a running route recommendation feature suggest routes based on distance, surface, elevation, and popularity, disregarding other factors that influence the psychological experience of running.</p>

<p>In a mixed-method study combining interviews with runners and a large-scale online survey, we captured what aspects influence the running experience. Analyzing the answers, we were able to group them into performance and achievement, environmental quality, and mental and social connectedness. These insights led to the creation of the Experience of Running Scale designed to measure how a run feels.
Clustering the responses also revealed two major groups of runners’ needs in terms of urban running routes. Sometimes, runners are drawn to scenic paths, which are lush, peaceful, and rich with natural beauty. Others thrive on the energy of urban routes, where the city pulses with life.</p>

<p>Interestingly, these preferences often line up with our personalities. Runners who score higher on neuroticism tend to seek the quiet solitude of nature, while extroverts are more likely to enjoy the buzz of city streets. Using this data, we built a personalized routing engine that accounts for environmental factors like sound, smell, surface type, and traffic to recommend routes that better match what we need.</p>

<p>In a routing engine prototype for London running routes, we mapped the perceivable environment using open data from platforms like Flickr, OpenStreetMap, and local crime reports to be able to recommend these two types of routes. While most runners we asked about their route preferences preferred scenic routes for their calm and beauty, situations like running in the dark flipped this tendency towards the more crowded urban routes as safety concerns trumped serenity.</p>

<p>We hope that by making it easier to plan routes tailored to individual needs and preferences, we can lower the barriers to going out, encouraging people to exercise with greater joy and confidence.</p>

<p>Hänsel, K., Aiello, L. M., Quercia, D., Schifanella, R., Varga, K. Z., Dietz, L. W., &amp; Constantinides, M. (2025). The experience of running: Recommending routes using sensory mapping in urban environments. International Journal of Human-Computer Studies.</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[Most apps that have a running route recommendation feature suggest routes based on distance, surface, elevation, and popularity, disregarding other factors that influence the psychological experience of running]]></summary></entry><entry><title type="html">Understanding the Influence of Data Characteristics on the Performance of Point-of-Interest Recommendation Algorithms</title><link href="https://linusdietz.com/poi-rec-data-characteristics/" rel="alternate" type="text/html" title="Understanding the Influence of Data Characteristics on the Performance of Point-of-Interest Recommendation Algorithms" /><published>2025-04-22T00:00:00+00:00</published><updated>2025-04-22T00:00:00+00:00</updated><id>https://linusdietz.com/poi-rec-data-characteristics</id><content type="html" xml:base="https://linusdietz.com/poi-rec-data-characteristics/"><![CDATA[<p>This work is a joint project with Pablo Sánchez and Alejandro Bellogín, we investigated the effects of data characteristics on the performance of point-of-interest recommendation algorithms.</p>

<p>Point-of-interest (POI) recommenders have become ubiquitous tools for travelers, guiding them on where to eat, explore, or relax. But while researchers and developers often focus on which algorithm performs best there is little research on understanding why certain models perform better than others across datasets.
In this study, we set out to investigate that using data characteristics. Using Foursquare check-in data from New York City, we developed a framework that links the characteristics of a dataset, such as sparsity, popularity bias, and user mobility, to the performance of different recommendation algorithms analyzing accuracy, novelty and item exposure.
To uncover how data characteristics shape outcomes we curated 144 distinct data subsets, each reflecting real-world conditions: peak tourist seasons, domestic versus international users, and variations in user activity and mobility. We then trained a mix of classic and POI-specific recommendation models on each subset, analyzing the results using an explanatory framework powered by a multiple-regression model.</p>

<p>What we found confirmed our suspicions: widely studied data characteristics like density and popularity bias do play a role—but so do subtler dynamics, such as how far users tend to travel or how long they’ve been active on the platform. These hidden variables often determine whether a recommendation algorithm succeeds or falls short.
Our findings reinforce a key insight: there’s no universal best model. The optimal algorithm depends on the specific characteristics of the data and the goals of the recommendation system, trading-off accuracy, novelty, or equitable item exposure. By quantifying the characteristics of the data set, one can better select or fine-tune models to meet those goals.
Ultimately, we hope this work helps pave the way toward more relevant and serendipitous suggestions for users—and more balanced visibility for local businesses. Our framework represents a step toward more informed, intentional choices in recommendation system design to deliver recommendations that are relevant, but also fair, diverse, and genuinely helpful to both travelers and the communities they explore.</p>

<p>Linus W. Dietz, Pablo Sánchez, and Alejandro Bellogín. “Understanding the Influence of Data Characteristics on the Performance of Point-of-Interest Recommendation Algorithms”. In: Information Technology &amp; Tourism 27.1 (Jan. 2025), pp. 75–124. doi: <a href="https://doi.org/10.1007/s40558-024-00304-0">10.1007/s40558-024-00304-0</a>.</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[Point-of-interest (POI) recommendations are essential for travelers and the e-tourism business. They assist in decision-making regarding what venues to visit and where to dine and stay. While it is known that traditional recommendation algorithms' performance depends on data characteristics like sparsity, popularity bias, and preference distributions, the impact of these data characteristics has not been systematically studied in the POI recommendation domain. To fill this gap, we extend a previously proposed explanatory framework by introducing new explanatory variables specifically relevant to POI recommendation. At its core, the framework relies on having subsamples with different data characteristics to compute a regression model, which reveals the dependencies between data characteristics and performance metrics of recommendation models. To obtain these subsamples, we subdivide a POI recommendation data set on New York City and measure the effect of these characteristics on different classical POI recommendation algorithms in terms of accuracy, novelty, and item exposure. Our findings confirm the crucial role of key data features like density, popularity bias, and the distribution of check-ins in POI recommendation. Additionally, we identify the significance of novel factors, such as user mobility and the duration of user activity. In summary, our work presents a generic method to quantify the influence of data characteristics on recommendation performance. The results not only show why certain POI recommendation algorithms excel in specific recommendation problems derived from a LBSN check-in data set in New York City, but also offer practical insights into which data characteristics need to be addressed to achieve better recommendation performance.]]></summary></entry><entry><title type="html">First Workshop on Urban Recommender Systems</title><link href="https://linusdietz.com/urbanrec2024/" rel="alternate" type="text/html" title="First Workshop on Urban Recommender Systems" /><published>2024-06-05T00:00:00+00:00</published><updated>2024-06-05T00:00:00+00:00</updated><id>https://linusdietz.com/urbanrec2024</id><content type="html" xml:base="https://linusdietz.com/urbanrec2024/"><![CDATA[<p>As we witness an ongoing centralization of the population towards urban landscapes, recommendation technology has an increasing role in people’s perception and decision-making in cities. People use recommender systems for choosing places to stay, which attractions and restaurants to visit, and route recommender systems for navigation. This workshop aims to bring together practitioners and academics working on improving recommendations in the urban space with an emphasis on both individual needs and urban health and well-being.</p>

<p>The ongoing urbanization leads to a concentration of population in cities, making it increasingly harder to navigate and exploit their potential. Recommendation technology has emerged as a pivotal tool shaping individuals’ attention and decision-making within these urban landscapes. People heavily rely on recommender systems to overcome the choice overload problem, from selecting accommodations to deciding on dining options and exploring attractions. Thus, recommendation algorithms significantly influence how individuals engage with and navigate cities.</p>

<p>The first Workshop on Urban Recommender Systems (UrbanRec) will bring together industry practitioners and academic scholars dedicated to advancing recommendation systems tailored to the unique dynamics of urban environments. We aim to highlight the advancements in recommender systems technology and also dedicate attention to the aspects of urban health and well-being, which can be positively impacted through the use of recommendation technology. By aligning technological advancements with societal goals, the workshop aims to lay the path towards more inclusive, vibrant, and livable cities where recommendation systems play a constructive role in enhancing the quality of life for all residents and visitors alike.</p>

<p>The workshop aims to identify pressing challenges and seize opportunities at the intersection of recommendation technology and urban sciences through interdisciplinary discussions. By integrating insights from diverse perspectives spanning urban planning, data science, psychology, and public health, participants will explore novel approaches to recommender systems that prioritize user autonomy, community well-being, and environmental sustainability. Ultimately, the workshop aims to foster a deeper understanding of opportunities in the complex interplay between technology and urban dynamics.</p>

<p>Workshop Topics</p>

<ul>
  <li>Mobile Urban Recommender Systems</li>
  <li>Point-of-Interest Recommendations</li>
  <li>Location-based Services and Route Recommendations</li>
  <li>Health-related Behavioral Change</li>
  <li>Supporting Policymaking and Urban Planning</li>
  <li>Responsible and Ethical Urban Recommender Systems</li>
  <li>Dataset Track</li>
</ul>

<p>Read the <a href="https://urbanrec.github.io/UrbanRec2024/call/">call for papers</a> on the <a href="https://urbanrec.github.io/UrbanRec2024">workshop website</a>.</p>

<p>The workshop is co-organized with Sanja Šćepanović, Rossano Schifanella, and Wolfgang Wörndl.</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[As we witness an ongoing centralization of the population towards urban landscapes, recommendation technology has an increasing role in people’s perception and decision-making in cities. People use recommender systems for choosing places to stay, which attractions and restaurants to visit, and route recommender systems for navigation. This workshop aims to bring together practitioners and academics working on improving recommendations in the urban space with an emphasis on both individual needs and urban health and well-being.]]></summary></entry><entry><title type="html">Data-driven Destination Recommender Systems</title><link href="https://linusdietz.com/dissertation/" rel="alternate" type="text/html" title="Data-driven Destination Recommender Systems" /><published>2023-04-19T00:00:00+00:00</published><updated>2023-04-19T00:00:00+00:00</updated><id>https://linusdietz.com/dissertation</id><content type="html" xml:base="https://linusdietz.com/dissertation/"><![CDATA[<p>My <a href="https://mediatum.ub.tum.de/download/1685575/1685575.pdf">dissertation</a> is published as open access.</p>

<p>Deciding where to travel is a complex, emotionally involving, and financially relevant decision which people face relatively infrequently.
Some aspects of tourist recommendations, such as point-of-interest recommendation, hotel recommendations, or restaurant recommendations, are commercially well established, whereas there are few successful recommender systems for individual travel destinations.
In this thesis, we present several contributions in the context of destination recommendation covering traveler mobility analysis, destination characterization, and conversational recommender systems.</p>

<p>Understanding traveler mobility forms the basis for more personalized recommendations.
We propose methods to analyze global traveler mobility from location-based social networks to learn which data sources are suitable for analyses in this domain, how people travel around the world, and which types of travelers can be observed.
Our cluster analyses of trips and travelers reveal distinct groups, which can serve as an initial preference elicitation step, but we could also show that the common practice of evaluating point-of-interest recommendations without differentiating these groups leads to misleading results.
Furthermore, we use the mined trips to construct a specialized map of hierarchical travel regions and to recommend the personalized duration of stay at destinations.</p>

<p>To correctly match traveler preferences with destinations in the content-based recommendation domain, we investigate which data sources are suitable for characterizing destinations.
Constructing 18 data models and eliciting the concept of touristic experience in cities using an expert study, we determine that textual data sources, e.g., Wikipedia articles, do a good job of emulating the touristic experience using rank agreement metrics.
Additionally, we are able to optimize data sources with explicit features to be competitive by learning the importance of feature weights using black-box learning.</p>

<p>Finally, we present the CityRec conversational destination recommender system.
Since users often struggle to verbalize their true preferences and might have unrealistic expectations about destinations, we propose a novel conversational paradigm, “Navigation by Revealing Trade-offs”, to overcome the wishful-thinking problem and inform users of the trade-offs involved in choosing one destination over another.
The seamless integration of user interface and algorithms was evaluated using a large-scale user study with 600 participants.</p>

<p>Linus W. Dietz. “Data-Driven Destination Recommender Systems.” Technical University of Munich, 2023.</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[This thesis explores various aspects of the destination recommendation domain, namely mobility analyses about which types of travelers can be observed in location-based social media data, how these groups perform in point-of-interest recommendation, how long travelers should stay at a destination, and which data sources are suitable to characterize destinations. Finally, we propose a city recommender system that supports users making the trade-offs involved in their travel choices.]]></summary></entry><entry><title type="html">Travelers vs. Locals: The Effect of Cluster Analysis in Point-of-Interest Recommendation</title><link href="https://linusdietz.com/travelers-vs-locals/" rel="alternate" type="text/html" title="Travelers vs. Locals: The Effect of Cluster Analysis in Point-of-Interest Recommendation" /><published>2022-07-04T00:00:00+00:00</published><updated>2022-07-04T00:00:00+00:00</updated><id>https://linusdietz.com/travelers-vs-locals</id><content type="html" xml:base="https://linusdietz.com/travelers-vs-locals/"><![CDATA[<p>This work is a joint project with Pablo Sánchez Pérez, which was very interesting as is brought together my previous work on traveler mobility analysis with the algorithmic analysis of point-of-interest recommendations.
The <a href="https://dl.acm.org/doi/10.1145/3503252.3531320">article is published</a> in the Proceedings of the 2022 ACM Conference on User Modeling, Adaptation and Personalization.</p>

<p>The involvement of geographic information differentiates point-of-interest recommendation from traditional product recommendation.
This geographic influence is usually manifested in the effect of users tending toward visiting nearby locations, but further mobility patterns can be used to model different groups of users.
In this study, we characterize the check-in behavior of local and traveling users in a global Foursquare check-in data set.
Based on the features that capture the mobility and preferences of the users, we obtain representative groups of travelers and locals through an independent cluster analysis.
Interestingly, for locals, the mobility features analyzed in this work seem to aggravate the cluster quality, whereas these signals are fundamental in defining the traveler clusters.
To measure the effect of such a cluster analysis when categorizing users, we compare the performance of a set of recommendation algorithms, first on all users and then on each user group separately in terms of ranking accuracy, novelty, and diversity.
Our results on the Foursquare data set of 139,270 users in five cities show that locals, despite being the most numerous groups of users, tend to obtain lower values than the travelers in terms of ranking accuracy while they also seem to receive more novel and diverse POI recommendations.
For travelers, we observe the advantages of popularity-based recommendation algorithms in terms of ranking accuracy by recommending venues related to transportation and large commercial establishments.
However, there are considerable differences in the respective traveler groups, especially between predominantly domestic and international travelers.
Due to the considerable influence of mobility on the recommendations, this article underlines the importance of analyzing user groups differently when making and evaluating personalized point-of-interest recommendations.</p>

<p>Pablo Sánchez and Linus W. Dietz. “Travelers vs. Locals: The Effect of Cluster Analysis in Point-of-Interest Recommendation.” In: 30th ACM Conference on User Modeling, Adaptation and Personalization. UMAP’22. New York, NY, USA: ACM, July 2022, pp. 132– 142. DOI: 10.1145/3503252.3531320</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[The involvement of geographic information differentiates point-of-interest recommendation from traditional product recommendation. This geographic influence is usually manifested in the effect of users tending toward visiting nearby locations, but further mobility patterns can be used to model different groups of users.]]></summary></entry><entry><title type="html">A Comparative Study of Data-Driven Models for Travel Destination Characterization</title><link href="https://linusdietz.com/destination-characterization/" rel="alternate" type="text/html" title="A Comparative Study of Data-Driven Models for Travel Destination Characterization" /><published>2022-04-07T00:00:00+00:00</published><updated>2022-04-07T00:00:00+00:00</updated><id>https://linusdietz.com/destination-characterization</id><content type="html" xml:base="https://linusdietz.com/destination-characterization/"><![CDATA[<p>This publication is the result of a multi-year collaboration with our colleagues in Vienna, Mete Sertkan and Julia Neidhardt.
It tackles the problem of establishing a ground truth for the data models in content-based recommender algorithms in the domain of destination recommendation.
The article was <a href="https://www.frontiersin.org/articles/10.3389/fdata.2022.829939/full">published open access</a> in Frontiers in Big Data.</p>

<p>Characterizing items for content-based recommender systems is challenging in complex domains such as travel and tourism.
In the case of destination recommendation, no feature set can be readily used as a similarity ground truth, which makes it hard to evaluate the quality of destination characterization approaches.
Furthermore, the process should scale well for many items, be cost-efficient, and, most importantly, correct.
To evaluate which data sources are most suitable, we investigate 18 characterization methods that fall into the following categories: venue data, textual data, and factual data.
We make these data models comparable using rank agreement metrics and reveal which data sources capture similar underlying concepts.
To support choosing more suitable data models, we capture the desired concept using an expert survey and evaluate our characterization methods toward it.
We find that the textual models to characterize cities perform best overall, with data models based on factual and venue data being less competitive.
However, we show that data models with explicit features can be optimized by learning weights for their features.</p>

<p>Linus W. Dietz, Mete Sertkan, Saadi Myftija, Sameera Thimbiri Palage, Julia Neidhardt, and Wolfgang Wörndl. “A Comparative Study of Data-driven Models for Travel Destination Characterization.” In: Frontiers in Big Data 5 (Apr. 2022). ISSN: 2624-909X. DOI: 10.3389/fdata.2022.829939</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[To evaluate which data sources are most suitable to characterize destinations, we investigate 18 characterization methods that fall into the following categories: venue data, textual data, and factual data.]]></summary></entry><entry><title type="html">Navigation by Revealing Trade-offs for Content-based Recommendations</title><link href="https://linusdietz.com/navigation-by-revealing-tradeoffs/" rel="alternate" type="text/html" title="Navigation by Revealing Trade-offs for Content-based Recommendations" /><published>2022-01-07T00:00:00+00:00</published><updated>2022-01-07T00:00:00+00:00</updated><id>https://linusdietz.com/navigation-by-revealing-tradeoffs</id><content type="html" xml:base="https://linusdietz.com/navigation-by-revealing-tradeoffs/"><![CDATA[<p>The continuation of the work on the initial <a href="/cityrec">CityRec System</a> led me to investigate how prospective travelers can make informed decisions about the trade-offs involved in choosing one destination over another.
This <a href="https://link.springer.com/chapter/10.1007/978-3-030-94751-4_14">paper was published</a> in the ENTER22 e-Tourism Conference and was a runner-up for the best paper award.</p>

<p>Tourism is a complex domain for recommender systems because of the high cost of recommending an unsuitable item and the absence of ratings to learn user preferences.
Conversational recommender systems have been introduced to provide users with an opportunity to give feedback on items in a turn-based dialog until a final recommendation is accepted.
In a scenario such as recommending a city to visit, conversational content-based recommendation may be well-suited since users often struggle to specify their preferences without concrete examples.
However, critiquing item features comes with challenges.
Users might request item characteristics during recommendation that do not exist in reality, for example, demanding very high item quality for a very low price.
To tackle this problem, we present a novel conversational user interface that focuses on revealing the trade-offs of choosing one item over another.
The recommendations are driven by a utility function that assesses the user’s preference toward item features while learning the importance of the features to the user.
This enables the system to guide the recommendation through the search space faster and accurately over prolonged interaction.
We evaluated the system in an online study with 600 participants and found that our proposed paradigm leads to improved perceived accuracy and fewer conversational cycles compared to unit critiquing.</p>

<p>Linus W. Dietz, Sameera Thimbiri Palage, and Wolfgang Wörndl. “Navigation by Revealing Trade-offs for Content-based Recommendations.” In: Information and Communication Technologies in Tourism. Ed. by Jason L. Stienmetz, Berta Ferrer-Rosell, and David Mas- simo. Cham: Springer, Jan. 2022, pp. 149–161. ISBN: 978-3-030-94751-4. DOI: 10.1007/978-3-030-94751-4_14</p>]]></content><author><name>Linus Dietz</name><email>mail@linusdietz.com</email></author><summary type="html"><![CDATA[Tourism is a complex domain for recommender systems because of the high cost of recommending an unsuitable item and the absence of ratings to learn user preferences.]]></summary></entry></feed>