Sequence Submission Service

Overview

The Sequence Submission service allows user to validate and submit virus sequences to NCBI Genbank. User-provided metadata and FASTA sequences are validated against the Genbank data submission standards to identify any sequence errors before submission. Sequences are also annotated using the VIGOR4 and FLAN annotation tools for internal use by users. The Sequence Submission service provides a validation report that should be reviewed by the user before submitting the sequences to the Genbank.

Note: Current pipeline only supports Influenza A, B, or C virus family.

Using the Sequence Submission Service

The Seqeuence Submission submenu option under the “TOOLS & SERVICES” main menu (Genomics category) opens the Sequence Submission Service input form. Note: You must be logged into BV-BRC to use this service.

Sequence Submission menu option

Parameters

Below is a screenshot of the Sequence Submission input form, as well as a summary of customizable parameters.

Sequence Submission Input Form

Submitter Information

  • First Name: The first name of the submitter (Required)

  • Last Name: The last name of the submitter (Required)

  • Email: The email id of the submitter (Required)

  • Institution/Affiliation: The institution/affiliation information of the submitter (Required)

  • Country: The country information of the submitter (Required)

  • Phone Number: The phone number of the submitter

  • Consortium: The consortium information of the submitter

Sequence FASTA File

  • Enter Sequence: Paste the custom sequence in FASTA format.

  • Select FASTA File: Choose FASTA file that has been uploaded to the Workspace.

Metadata File

Choose Metadata file (CSV Format) that has been uploaded to the Workspace.

Output Folder

Folder in the Workspace where you want the results stored.

Output Name

Name you provide to identify the results in the Workspace.

Buttons

  • Reset: Resets the input form to default values

  • Annotate and Validate Sequence(s):

    • Validates input data and launches the job if there is no error. A message will appear below the box to indicate that the job is now in the queue.

Success Message

  • Validates input data and warns the user if there are error(s). A message will appear below the box to indicate that there are error(s).

Error Message

Output Results

Clicking on the Jobs indicator at the bottom of the BV-BRC page open the Jobs Status page that displays all current and previous service jobs and their status.

Job Indicator

Once the job has completed, selecting the job by clicking on it and clicking the “View” button on the green vertical Action Bar on the right-hand side of the page displays the results files (red box).

Job List

Results Page

The results page will consist of a header describing the job and a list of output files, as shown below.

Job Results

The Sequence Submission Service generates several folders and files that are deposited in the Private Workspace in the designated Output Folder. These include:

  • input.fasta - The fasta file that was submitted

  • metadata.csv - A comma-separated file containing the metadata about the fasta

  • Sequence_Validation_Report.csv - A comma separated value file of validation results for all the sequences allowing users to review segment, serotype, status and messages determined by VIGOR4 and FLAN for each sequence identifier

  • submission.xml – an xml file for identifying the submission

  • submission.zip – a compressed file that includes all the required submission files

  • submit.ready - a submission ready flag for GenBank

  • .aln - alignment of predicted protein(s) to reference, and reference protein to genome

  • .cds - fasta file of predicted CDSs

  • .gff3 - lists all the features of the genome in General Feature Format (GFF3 is the most recent version of GFF)

  • .pep - a fasta file of predicted proteins

  • .rpt - a summary file of program results

  • .tbl - predicted features in GenBank tbl format

  • .report – a report file generated by FLAN

Submission Button

Clicking the Submit Sequence button in the top right of the page to notify BV-BRC team for the submission is ready.

Submit Button

Action Buttons

After selecting one of the output files by clicking it, a set of options becomes available in the vertical green Action Bar on the right side of the table. These include:

  • Hide/Show: Toggles (hides) the right-hand side Details Pane.

  • Guide Link: to the corresponding Quick Reference.

  • Download: Downloads the selected item.

  • View: Displays the content of the file, typically as plain text or rendered html, depending on filetype.

  • Delete: Deletes the file.

  • Rename: Allows renaming of the file.

  • Copy: Copies the selected items to the clipboard.

  • Move: Allows moving of the file to another folder.