HOW TO SCHEDULE DATA CLEANING FUNCTIONS

Introduction to Scheduling Functions

Automate your data cleaning by scheduling Flookup's powerful functions to run on their own. This feature uses Google Apps Script triggers to keep your datasets consistently clean and up-to-date, saving you time and reducing errors with minimal manual effort.

You can set core operations like Fuzzy Match and Extract Unique Values to run at flexible intervals, from once an hour to once a month, perfectly tailoring the automation to your workflow. Before scheduling, it is best to run the function manually on a small sample of your data to find the optimal parameters.


How to Schedule a Function

  1. Open the scheduling sidebar
    Navigate to Extensions > Flookup Data Wrangler > Schedule data cleaning functions in your spreadsheet menu.
  2. Select the main function
    From the top dropdown menu, choose the primary operation you want to automate. The form will dynamically update to show relevant options. The available functions are:
    • Fuzzy match by percentage: Finds the best match in a table based on text similarity.
    • Fuzzy match by sound: Finds matches based on phonetic similarity i.e. how words sound.
    • Extract unique values by percentage: Creates a unique list from a column based on text similarity.
    • Extract unique values by sound: Creates a unique list based on phonetic similarity.
    • Standardize text entries: Cleans text by removing stop words, punctuation or diacritics.
    • Compare string similarity: Calculates the similarity score between two columns of text.
  3. Choose the processing mode
    • Process data to the end: The task will run until all rows in your input range are processed and then it will stop automatically.
    • Process data in a loop: The task will process the input range and then repeat the process on the next scheduled run. This is ideal for data that is constantly updated.
  4. Set the required ranges and parameters
    For range inputs e.g. Lookup_values, Table_values, Input range, first select the desired range directly in your Google Sheet. Then, click the corresponding Grab selected range button in the sidebar to populate the input field.
    Adjust other parameters that appear, such as:
    • Lookup_column: Specify the column index e.g. "1" in your Table_values range to search for a match.
    • Return_column: Specify the column index in your Table_values range from which to return a result.
    • Stop_array: Optionally provide a cell range containing custom words or punctuation that you want to remove during standardization.
    • Threshold: Set the minimum similarity score, from "0.0" to "1.0", required for a match between the lookup values and the table values.
  5. Set the schedule frequency and details
    • Click HOURLY or DAILY to set the basic interval.
    • For daily schedules, set the approximate Time of day and the number of days between runs e.g. "1" for every day.
    • For hourly schedules, set the number of hours between runs e.g. "6" for every 6 hours.
  6. Lock the output position
    Select the cell in your Google Sheet where you want the results to start. Then, click the Grab selected cell button in the sidebar to lock this output position for each run.
  7. Schedule the function
    Click Schedule to create the trigger and close the sidebar.

Notes About Scheduling Functions


Issues You Might Encounter


For the Visual Learners

Labels might differ slightly but the steps are the same.


You Might Also Like