In this section, we describe our research findings in three major themes: aligning data collection with case management goals, comprehensive representation of caseworkers’ work in the data labels, and usability of data labels and data labeling. Each section first describes the caseworkers’ issues with data labeling, then introduces the proposed ideas, and details the perspectives of the caseworkers, managers, and program analysts.
4.1 Aligning Data Collection with Case Management Goals
Caseworkers consider data labeling extra work because they do not perceive its connection to improving client services. Caseworkers understand that data labels serve performance and funding purposes, but consistently expressed that they do not perceive the value of labeling. For example, caseworkers explained, “It is extra work”(C1), “It is not useful or part of my case management”(C4), and “not sure how exactly it is useful, but we were told to do this”(C2). Consequently, it is perceived as a data-focused activity rather than a client-centered activity, unlike case note writing that directly informs caseworkers’ decisions for the clients. Caseworkers, who are driven by a sense of care (C2) for their clients, lack the motivation to engage in tasks that do not directly contribute to client service and rarely participate in labeling. As noted by a manager, caseworkers expressed, “Data input isn’t hard, it is just not fun. You know, so it is hard to motivate yourself to do that so you can justify it in your mind like, oh, that wasn’t a very major interaction, so I’m not going to, you know, spend time logging into the system and logging that interaction because it is going to take more time for me to log this information than the actual conversation” (M2).
Hence, we observed the perceived disconnect between labeling data and providing care to significantly diminish caseworker motivation, affecting data labeling. Drawing from intrinsic and extrinsic motivation, we generated design ideas. We aimed to enhance extrinsic motivation [
25,
28] by directly bringing the value of data labeling to caseworkers’ daily tasks(4.1.1) and improve intrinsic motivation[
19] by increasing awareness about the value of data labeling(4.1.2) as explained in the following subsections.
4.1.1 Find Ways to Make Labeling Useful to Caseworkers.
Our investigation aimed to explore if labels can be utilized by caseworkers in their daily activities to address client needs and, in turn, motivate labeling. Primarily, we ideated ways to enhance identifying specific information from case notes that characterize the clients’ case, such as previous housing options explored. This information could then inform the caseworkers’ subsequent actions, such as recommending potential housing options or following up on past ones.
Our focus on improving information retrieval from case notes is derived from our case note analysis (see section 3.1.2) and caseworker interviews that revealed inefficiencies in the current process. The inefficiencies can be attributed to various factors, including the nuances introduced by free-text case notes, such as significant variations in terminology and abbreviations to convey similar information. For example, terms like "Detox program" and "A****** house program" refer to the same service, while "BC," "Birth Certificate," and "Identity card" are used interchangeably. Furthermore, the absence of specific keywords in documenting certain services results in the loss of valuable information during the search. To cope with these challenges, caseworkers resort to manual and iterative keyword-based searches through free-text notes, which is time-consuming (C1, C4). Manual searching is also prone to errors, leading to instances where caseworkers fail to adequately retrieve previous options explored, hindering their ability to re-evaluate housing or treatment choices effectively.
Our team developed two ideas to align labeling with the caseworkers’ goal of efficient retrieval of past information on the client’s case. These involve providing mechanisms that enable caseworkers to analyze case notes more effectively using labels.
First, we proposed filtering case notes based on data labels to facilitate targeted search (Idea 1) to enable caseworkers to effectively find relevant information and make informed decisions. For instance, When deciding which housing options to explore next, caseworkers can filter case notes by choosing all data labels associated with housing applications to identify case notes with detailed insights on outcomes of past housing options. This aligns data labeling directly with the caseworkers’ goal of better serving their clients.
Second, we proposed a client analytics dashboard that leverages data labels to showcase trends (Idea 2), such as the number of requests and prior outcomes of attempts to receive various services such as housing and mental health treatment. The purpose of this dashboard is to enable caseworkers to identify unique client trends, which will aid in devising personalized strategies to help clients achieve their goals. For instance, a significant number of clients aim to register for mental health programs as they play a crucial role in enhancing overall living stability and the ability to cope with the challenges of housing search. However, the clients’ history, such as criminal background, intensity of substance abuse, or violent behaviors, may hinder their acceptance into these programs. By analyzing trends, such as repeated denials for specific programs, and frequent access to substance abuse treatments, caseworkers can gain valuable insights to reassess their strategies and tailor their efforts to better address individual client needs and circumstances.
Participants in the speed-dating session strongly preferred both of these ideas to address the current inefficiencies in retrieving client case information and trends. They were also perceived to benefit managers by facilitating easy and accurate identification of client case characteristics. A caseworker noted that "If these data labels function better, then there would be more of a push, I think, to record things while the client is there" (C3). As caseworkers discussed additional use cases, they highlighted automated information retrieval for efficient management of client records, such as ensuring meeting specific ID requirements for housing applications and other services. One caseworker highlighted this need, saying, "There’s a limit to how many birth certificates we can order for a client within a year if we could, and as of now, like, I’ll control F search for information. But if we could just like, see how many times we have gotten this birth certificate for this client. You can really see the pattern" (C2). Caseworkers also emphasized how visual representation through a dashboard could quickly highlight trends for walk-in services where there is less time to go through past interactions with the client. For example, identifying clients who repeatedly request the same vital documents due to misplacement allows caseworkers to devise tailored solutions, such as having multiple copies ready to save time and enhance the effectiveness of their assistance (C1, C2, C3). Both program analysts and managers underscored the dashboard’s potential for swiftly analyzing client behaviors and delivering valuable insights, particularly for new caseworkers who often face information gaps on the client in the initial stages.
Program analysts further highlighted the need to implement these solutions to benefit all stakeholders. They stated the dashboard features should facilitate the needs of both caseworkers and managers to avoid any double work, providing an example of IDs ordered attribute to be aggregated not just on the client level to know the trend but also on an organization level to know the funding needs. "I also want to make sure that a manager is able to say, ’How many birth certificates are we getting in total this year because that’s going to change our funding need to ask for money there. I want to make sure that it works for everyone, and they’re not doing double work’" (P2). Managers echoed this sentiment and emphasized the value of an advanced and adaptable search system that enables caseworkers and managers to identify relevant information based on their needs (M2).
4.1.2 Increase Awareness of the Value of Data Labeling:
From an organizational perspective, a constant demonstration of outcomes and service provision (outputs) is essential for securing resources to run efficient services for the client. Aggregated data labels demonstrate the specific outcomes and funding needs. However, caseworkers indicate a lack of this knowledge and emphasize the need to understand the purpose and rationale behind their efforts in labeling. As noted by caseworkers, “What we need to do and why we we’re doing", the answers to these questions. Knowing that it translates into something bigger can help motivate recording the data labels”(C1) and “Realizing that our data labeling has a direct influence on our capabilities, the two are linked”(C5).
To enable an understanding of how data labeling can translate to better services to the clients, we proposed a dashboard with information on the periodic impact of data labels (Idea 3). The dashboard portrays the connection of data labeling to the value provided by presenting a periodic aggregate of labels and the corresponding effects on the organization’s funding and recognition by other entities. This can be achieved by presenting number of clients referred to the organization over time. For instance, consider displaying the total count of state IDs the organization has successfully obtained over time, and the subsequent increase in the number of clients referred to the organization for State IDs. This demonstrates that caseworkers can showcase their proficiency in efficiently processing State IDs by consistently recording IDs issued, a task often challenging due to the need for collaboration with multiple departments. The recognition can prompt other organizations to refer more clients, expanding access to suitable services for a larger group of people, and in turn, enhances program efficiency as caseworkers can focus on services in which they excel.
All participants agreed with the need to demonstrate the value of data labeling. However, caseworkers had mixed opinions on using dashboards as a form of communication. While one expressed curiosity to learn about the outcomes through dashboards (C2), others either preferred dashboards for predicting trends and allocating resources effectively (C3) or were not sure if they would actively refer to a dashboard to verify impact (C4). However, they emphasized the need for transparency (C1, C4) and sought this information through staff training on data labels (Idea 8), “I know I personally value transparency. I want to know what you know. Why my work, like, why, these things are important, and you know, and of course, I have a general idea of why. When we’re actually able to tie real outcomes, too. Okay, this. This led to x amount of funding, or we got, you know, a,b,c,d,e, from this, and it also informs us honestly, like where we need to focus more and less as well. It is another way to figure out gaps in services” (C4). The same caseworker highlighted periodic training, “This is an opportunity for folks to see like, Oh, well, this is the impact we’re having, like when we’re doing this. And this is how it is directly tied. I think, the more tied to this data people are, or the more invested in it, the more likely they’re going to utilize it” (C4). Program analysts, however, highly believed the dashboard was the most efficient in conveying the impact to caseworkers.
Managers proposed that a dashboard could also be quite useful for them to gain insights into the client case progress and, in turn, the caseworkers’ current workload. For instance, a manager explained how knowing the trends and current client case status can help allocate caseloads - “You know how many clients are currently unhoused, because we know those who are unhoused typically are much more high need, particularly with the conditions that exist in our community around housing and affordable housing. There’s a lot of effort that has to be put in by their caseworkers to identify housing. You’re gonna get a lot of doors shut your face like no, no options available, no options available. And then, once you have someone that can help, the load reduces. So kind of, you know, gauge-like, where’s someone’s level of effort? - needs to be evaluated of how that can be adjusted to help create more equilibrium" (M2).
4.2 Comprehensive Representation of Caseworkers’ Work in the Data Labels
The caseworkers are dissatisfied with the current data labels as they consider them inadequate in representing their work. Since the data label aggregates are used by the City and other collaborators for the organization’s performance evaluation, it is crucial to accurately capture all casework for appropriate recognition. This recognition leads collaborators to refer more clients, improving access to services for individuals in need. Moreover, it allows caseworkers to concentrate on services they excel at, ultimately enhancing the program’s efficiency.
Current data labels encompass measures required by funding agencies, such as counts of IDs processed to accommodate application fees and in-office client visits to facilitate office utilities. Data labels also include measures for standard performance reports by the City, such as counts of clients successfully housed, clients contacted, and mental health treatments obtained. However, caseworkers and managers stated that they fail to efficiently capture other crucial aspects of casework that impact clients’ housing stability and well-being. Data labels only portray end results such as housed or income acquired, overlooking case complexities, leading to a perception of low organizational performance (M1). For instance, securing stability for clients with behavioral issues demands extra effort to ensure task completion and maintain progress.
Similarly, the scale of coordination required with multiple entities is not considered, “I might deal with four different agencies in 30 min and send referrals and continue care stuff, all of that work get lost as far as getting captured"(C4). Moreover, data labels do not capture other casework outputs such as obtaining identity documents, these are crucial for job applications or accessing public resources for stable living. "I asked why we only captured the two, birth certificates, and State IDs. I know we don’t pay for social security cards. But we’re spending a ton of time ordering those" (C1). By using these drawbacks identified in interviews regarding the current labels, we created design ideas. These ideas focused on capturing complexities(4.2.1) and sharing the changing relevance of labels(4.2.2) with management, as explained in the following subsections. We aimed to address the perceived dissatisfaction of caseworkers with labels representing their work, to improve data labeling.
4.2.1 Enable Capturing of Casework Complexities.
Caseworkers are concerned that data labels reduce their work to numbers that do not represent the complexities of client cases they manage (C1, C4). For instance, they explain how capturing just the total number of housed or unhoused clients fails to consider the varying amount of work completed for them. Clients with criminal histories often face limited housing options, while clients with severe substance abuse issues require treatments to qualify for housing applications. "One case took two years of work and 40 housing applications while the other took 3 months of work and two applications - if you didn’t have the steps caught accurately, then it will just be a number, this person is housed, and that person not" (C4).
We proposed redesigning the data labels with increased granularity (Idea 4) to provide a more accurate representation of the casework by capturing the caseworkers’ efforts that indicate the case complexities. For instance, adding a data label to record the counts of people or entities contacted for a service can capture the scale of coordination needed. "If they talk to three different agencies in one interaction with the client, then that should count as 3 collateral contact instances, not just one. A recent extremely medical fragile client I had, involved me all week communicating with his emergency room doctor, a nurse at the emergency room, his partner, his insurance company, and potential nursing homes” (C4). Additionally, data labels that record the number of attempts made for housing before success, and mental health treatments sought before acceptance, can also account for the complexities of the casework.
Overall all the participants agreed with this idea. Caseworkers strongly resonated with the need to capture their casework efforts. They also considered granular labels to enhance the utility of filtering through case notes (Idea 1) and client analytic dashboards (Idea 2) with more precision. For instance, with an additional filter to distinguish between successful and unsuccessful housing applications, caseworkers can quickly identify areas of improvement from past applications. Managers found granular labels valuable for conveying the organization’s challenges to funders, the City, and collaborators, by enabling them to build a narrative backed by quantitative evidence on the extent of denials and efforts made to succeed in client goals (M1, M2). However, there were concerns about misrepresenting certain efforts captured without evaluating the underlying reasons behind the numbers (P2, M2). For instance, directly interpreting the total number of denials or total time spent on housing applications as measures of higher complexity, "an application can also be denied for logistic errors like missing the submission of a required document" (P2).
In addition to granular labels to demonstrate complexities, managers and program analysts also asserted the need to reform certain labels into broader categories. This is to keep the labels concise and simplify the labeling process for caseworkers. "How granular, you know, do we go? Are we losing any efficiencies by having ten options... Where’s the sweet spot?" (M2). For instance, they suggested combining different food coupons provided into a single label, as only the aggregate amount of food coupon requests is sufficient to allocate the budget for food support (P2). Similarly, all non-funded IDs could be combined into a single category called "IDs" instead of a list (M2).
Besides investigating the quantification of caseworker efforts to capture case complexities, we also explored enhancing access to qualitative case note information for managers to identify case complexities. We proposed standardizing case notes to access contextual information underlying the labels (Idea 5), making detailed case information easily available to others. With better access to qualitative case note data, managers can incorporate the reasons behind the aggregated data label numbers to represent case complexities.
All participants desired case note content to be accessible for comprehensive communication of casework for performance and funding reports. However, caseworkers raised concerns that over-restricting case note writing, generally performed during a client interaction, could distract their engagement with the client making it more data-centered. They emphasized the importance of maximizing time and attention to clients during interactions and proposed implementing overarching guidelines with flexibility. Additionally, a program analyst considered training as a more effective method for standardization over guidelines (P1), while the other asserted the usefulness would depend on designed guidelines (P2). Caseworkers stressed that standardized case note guidelines would help new caseworkers grasp what information to include to effectively inform their next steps in serving the clients. Caseworkers from diverse backgrounds have varied note-writing approaches, resulting in differences in the level of detail and types of information in their notes. For instance, one caseworker initially focused heavily on noting specific services provided to the client but later recognized the importance of recording their assessment of the client’s state in their notes, such as the client’s attitude and mental state. "I leaned in really hard on the very specific services I had provided, and really just including that, and I realized I was actually not seeing folks completely as people for a period of time. Not that I didn’t care about them, and I wasn’t trying to help them, but I wasn’t looking beyond processing a referral to see that you know they didn’t really seem to be doing too well" (C3).
Furthermore, managers believed that having access to detailed casework information would help identify program inefficiencies by providing contexts, such as for delays in treatments or extended periods of homelessness. "It would be so much more efficient for those who are coming in and trying to get a snap of what’s going on with this call" (M1).
4.2.2 Provide Ways for Shared Creation of Data Labels by Management and Workers.
Periodic assessment of data labels is essential to ensure they comprehensively capture changes in casework. Caseworkers’ work may evolve over time due to new client needs, such as voter registration, or organizational changes, such as new collaborations on new treatment programs or housing services. Without specific labels representing these tasks, the work done would go unnoticed during performance evaluation. For example, caseworkers highlighted the need for additional data labels for obtaining social security cards, driver’s licenses, and voter registrations. These are frequently handled tasks demonstrating their workload and expertise. Caseworkers noted, "These data labels will show that we’re working on them often, and how skilled we are in it” (C5), "Oh, my God, I’m waiting on like a 1,000 social security cards to come in right now. Both walk-in clients and my own case managed clients. And yet there is not a social security card drop-down" (C3). Only some caseworkers approached management on adding a social security card label, while other labels were never brought to management’s attention, indicating a lack of communication on updates to data labels.
To ensure the data labels are capturing current casework, we proposed to streamline the addition and subtraction of data labels (Idea 6) through a digital channel. This allows caseworkers to propose new labels, which managers can review to update the labels list. Managers can also gather quick feedback on data label changes from caseworkers to aid their decisions.
All participants unanimously supported streamlining data label creation. As one caseworker noted, "When a new need comes up and caseworkers are putting so much effort into it, being able to add immediately would be awesome" (C2). Program analysts encouraged the idea but were uncertain if caseworkers would actively suggest labels, indicating the need for testing. Managers underscored streamlining to also help preserve knowledge on past data labels, such as their definitions, intended purpose, and reasons for removal. This information is considered essential to compare aggregated values and assess differences over time (M2).
4.3 Usability of Data Labels and Data Labeling
Caseworkers highlighted two main usability concerns with data labeling. Firstly, they found the process of identifying relevant labels from a long list time-consuming, hampering client interaction time. Secondly, label meanings were considered ambiguous resulting in uncertainty when choosing them for client interactions, leading to inconsistency and abandonment of labeling (C3, C5). For example, the "birth certificate" label was associated with multiple instances such as when it was ordered, denied, and successfully received. This caused multiple entries for a single order, leading to inaccurate funding calculations for their application fees. We aimed to address the usability concerns hindering caseworker’s data labeling and its accuracy. Drawing from the major pain points identified from interviews on labeling efficiency(4.3.1) and label clarity(4.3.2), we generated design ideas as explained in the following subsections.
4.3.1 Tools to Efficiently Identify Relevant Labels:
Manual omissions in labeling were found to be commonplace. These omissions are attributed to caseworkers lacking sufficient time to assess each label for relevance (C4, M1) and remember everything that should be recorded. One caseworker stated, "There are instances you feel like you have missed out on data labels that you want to record, but it is either too time-consuming to go through all of them, or you forget in that instant" (C1). When client interactions get lengthy or highly active, caseworkers often struggle to remember and select all the relevant data labels corresponding to the performed activities. To efficiently identify relevant labels, we proposed four ideas.
First, an AI tool that analyzes current case note content and suggests appropriate labels (Idea 12) to guide caseworkers on labeling. Caseworkers commended the utility of AI recommendations. However, they stressed the importance of autonomy in choosing the final labels. They also opposed interrupting features such as continuous reminders or pop-ups that could impede their interaction with the client. Managers and program analysts encouraged this idea but cautioned against caseworkers’ over-reliance on AI recommendations which are prone to inaccuracies (M1, P2).
Second, displaying clients’ most frequent and last interaction data labels (Idea 13). This aimed to facilitate caseworkers to quickly assign relevant labels from past interactions. Caseworkers acknowledged the utility and believed it could reveal important client case characteristics. For instance, a frequent "no show" label for a client indicates that the client frequently misses appointments, prompting the caseworker to provide additional reminders. Additionally, labels from the previous interactions could help review recently explored options and identify potential next steps, "Hey? Let’s, you know, go in this direction, or it looks like this has been tried. But how do you feel about going this other way with” (C4).
Third, displaying labels that have been rarely or never assigned(Idea 15). Caseworkers intuitively prioritize assigning some labels over others. Showing rarely assigned labels could encourage reviewing all the labels for relevance. Overall, this idea elicited mixed responses among caseworkers. Some appreciated that it increases awareness of underutilized labels (C1, C4), while there were concerns that exposing the low-assigned labels per caseworker to everyone could induce performance pressure for specific individuals. Managers recognized the idea’s benefit in addressing caseworkers’ lack of awareness of specific existing labels. "I think that solves the issue of staff members not being aware that certain items may exist, and I think over time staff develop a blind spot" (M2).
Fourth, providing reminders on data labeling objectives (Idea 14), with an option to edit assigned labels to promote accuracy and completeness. Caseworkers had a neutral stance on this idea in influencing their labeling but accepted it as long as it didn’t impede their work. They preferred having a clear understanding of data labels (4.3.2) and their utility (4.1.2).
In addition to efficiently identifying relevant labels, we evaluated ideas addressing specific navigation issues in their interface. Data labeling significantly reduced when caseworkers transitioned to a new system. This decline was attributed to the new interface that groups data labels into drop-downs requiring multiple clicks to browse through the labels, in contrast to the old system’s easier browsing through a single-page labels list. We presented two ideas to facilitate easier navigation, a search feature to find specific labels (Idea 10), reducing the need to browse through lists. And visual feedback to navigate through the data labels (Idea 11), such as presenting assigned and unassigned labels separately, to enable reviewing and ensure completeness.
4.3.2 Clarify Label Meanings:
Caseworkers expressed the need for clearer data label definitions to judge relevance. There are also numerous redundant data labels with overlapping meanings in the existing list, a consequence of a lack of cross-checking between new and old labels (C1, C4). For example, four labels in the current list represent a similar client-caseworker contact type during an interaction. The "client contact" label, which represents any contact with the client subsumes "direct contact," which is in-person contact with the client. Further, the "direct contact" includes "direct contact in office" and "direct contact out of office." Caseworkers stressed the need to remove redundant labels to improve usability. "I would like the system to be more streamlined, with redundant and useless options removed. I don’t want to go through a list of many labels trying to figure out which one is appropriate" (C4). Hence, we proposed three ideas to improve clarity in data labels.
First, an AI tool to identify redundant data labels (Idea 9) to enhance the process of identifying and refining the labels. The AI tool will regularly analyze all case notes and corresponding labels assigned to identify potential redundancies. It does so by analyzing patterns, such as different labels assigned for similar case note content.
The second is to provide one-click access to data label definitions with appropriate examples within the interface where the labels are assigned. This instant access to the data label definitions and examples of labeling (Idea 7) could foster a common understanding among the caseworkers.
The third idea is based on the caseworkers’ suggestion to provide periodic training sessions on data labeling (Idea 8) to caseworkers. These training sessions serve as opportunities to clarify ambiguous labels and ensure that caseworkers have a better understanding of labeling.
In general, all participants strongly preferred these ideas to enhance caseworkers’ understanding and accuracy in data labeling. A manager noted that removing redundancy is one of their current goals (M2). Discussion around using AI for this purpose was, however, limited, likely due to unfamiliarity with AI’s functionality to identify redundant labels. However, one caseworker with a technical background expressed enthusiasm citing the example of an AI tool that suggests combining music albums based on their content similarity (C3). Caseworkers considered definitions as valuable tools for newer caseworkers who may often feel uncertain about appropriate labels. As one caseworker stated, "I’m killing it on labeling. You know the right data labels. And then I just don’t know. I hit a slump, where I’m like say ’direct contact’, and that’s all I get. And then I start feeling confused about what I should be labeling" (C3). Managers considered access to definitions to empower caseworkers by reducing reliance on management for clarification (M2).