Now that your datasets have been aligned with participant and time information, and cohort information has been extracted, you can start to take advantage of the integrated data. In this step you will create grids and visualizations based on the aligned data.

Visualize Cohort Performance

LabKey makes it easy to incorporate cohort categories into visualizations. To see this, we will create a visualization that compares blood cell counts across the cohort groups.

  • Click the Clinical and Assay Data tab and click the Lab Results dataset.
  • Click the header for the CD4+ column and select Quick Chart.
  • LabKey Server will make a "best guess" visualization of the column. In this case, a box plot which shows the distribution of values, organized by cohort groups:
  • Notice that you are now in the main plot editor and could change the chart type and layout as you like.
  • Optionally, you can click Save, name the plot "CD4 Counts by Cohort", and click Save.
  • If you save this chart, click the Clinical and Assay Data tab. Notice that the chart has been added to the list.

Create Data Grids from Combined Datasets

What if you want to compare two measurements that are in different datasets? For example, is there a relationship between CD4 cell counts (from the "Lab Results" spreadsheet) the blood pressure measurements (from the "Physical Exam" spreadsheet)? Now that the spreadsheets have been imported to the LabKey study as datasets, we can easily create a combined, or joined view, of the two.

  • Go to the Clinical and Assay Data tab.
  • Navigate to one of the datasets to be combined. In this case, click the Lab Results dataset.
  • Select (Grid Views) > Customize Grid.
  • In the Available Fields panel, scroll down and open the node DataSets by clicking the . (Notice that DataSets is greyed out, as are the names of the datasets themselves. This only means that these are not "fields" or columns to select for display, but nodes you can open in order to select the columns they contain.)
  • Open the node Physical Exam.
  • Check the boxes for Systolic Blood Pressure and Diastolic Blood Pressure. This indicates that you wish to add these columns to the grid you are viewing. Notice they are added to the list in the Selected Fields panel.
  • Click Save.
  • In the Save Custom Grid View dialog, select Named and enter "Lab Results and Physical Exam Merged". Click Save.
  • You now have a joined data grid containing columns from both the Lab Results and Physical Exam datasets. Notice the view name in the header bar (inside the red oval). Also notice that this new grid is added to the Clinical and Assay Data tab.

Plot Combined Data

  • Confirm you are viewing the combined grid view named "Lab Results and Physical Exam Merged".
  • Select (Charts) > Create Chart. This opens the chart designer where you can create and customize many types of charts. The Bar plot type is selected by default.
  • In the left panel, select Scatter by dragging it from the list of columns on the right into the box.
  • For X Axis, select CD4+ (cells/mm3). This column came from the "Lab Results" dataset.
  • For the Y axis, select Systolic Blood Pressure. This column came from the "Physical Exam" dataset.
  • Click Apply.
  • Click Save, name the report "CD4+ Counts vs. Blood Pressure", and click Save in the popup.
  • Your chart will now be added to the Clinical and Assay Data tab.

Plot Trends Over Time

A key part of longitudinal study research is identifying trends in data over time. In this example, we will create a chart that shows cell counts for the cohorts over the course of the study.

  • Click the Clinical and Assay Data tab.
  • Click the Lab Results dataset.
  • Select (Charts) > Create Chart.
  • Click Time.
  • Notice there are no columns to plot.

Time charts require that columns used have been explicitly marked as "measures". To do so we edit the dataset definition:

  • Click Cancel to leave the plot editor and return to the grid for Lab Results.
  • Click Manage in the header bar of the grid, then click Edit Definition.
  • Scroll down to the Dataset Fields, and select the CD4+ field by clicking inside the Name or Label text boxes.
  • The field properties editor (a set of tabs) will appear to the right.
  • Select the Reporting tab, and check the box next to Measure.
  • To increase your charting options, repeat for the other 2 fields (Lymphocytes and Hemoglobin). Notice that switching rows leaves the same tab open in the property editor. Notice also a wrench icon marks any changed rows.
  • Scroll up and click Save.

Now we are ready to plot cell counts on a time chart.

  • Click View Data to return to the "Lab Results" data grid.
  • Select (Charts) > Create Chart.
  • Click Time again. Notice now that the columns you marked as measures are listed on the right.
  • Drag the CD4+ column to the Y Axis box.
  • Click Apply.

By default, the time chart plots for the first 5 individual participants in the data grid. To instead compare trends in the cohorts:

  • Click Chart Layout where you can customize the look and feel of a chart.
  • Change the Subject Selection to Participant Groups.
  • Click Apply.

There are now four lines, one for each cohort. Now we see a clear trend (in our fictional data) where CD4+ levels in the HIV negative cohort generally stay flat over time, while the other cohorts vary over the 24 months studied.

  • Click Save.
  • Name your time chart "CD4 Trends" and click Save in the popup.

Previous Step | Next Step (5 of 5)

Was this content helpful?

Log in or register an account to provide feedback


previousnext
 
expand allcollapse all