GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:04:55 Sequence gi568815578r:17842360_18068425 : 226066 bp : 46.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6525 6593 69 2 0 70 96 54 0.210 3.45 1.02 Intr + 17705 17864 160 1 1 116 -12 134 0.017 5.76 1.03 Intr + 27402 27523 122 1 2 102 100 0 0.110 2.81 1.04 Intr + 27805 28039 235 0 1 91 98 55 0.249 4.06 1.05 Intr + 43743 44015 273 1 0 76 84 158 0.513 11.71 1.06 Term + 52841 53172 332 0 2 93 41 98 0.446 0.42 1.07 PlyA + 54721 54726 6 1.05 2.00 Prom + 62391 62430 40 -5.56 2.01 Init + 69683 69774 92 1 2 64 77 76 0.422 4.06 2.02 Intr + 86573 86656 84 2 0 79 93 48 0.064 3.34 2.03 Intr + 96860 96956 97 2 1 82 91 76 0.463 7.31 2.04 Intr + 97681 97831 151 0 1 69 103 14 0.372 0.74 2.05 Term + 101136 101263 128 1 2 72 42 73 0.371 -0.46 2.06 PlyA + 101713 101718 6 1.05 3.12 PlyA - 102198 102193 6 -0.45 3.11 Term - 105286 105122 165 1 0 77 53 181 0.967 11.42 3.10 Intr - 106617 106531 87 0 0 89 97 127 0.999 13.87 3.09 Intr - 106744 106705 40 0 1 75 86 6 0.927 -2.67 3.08 Intr - 107848 107773 76 2 1 92 106 55 0.987 6.27 3.07 Intr - 108037 107932 106 1 1 109 33 68 0.984 3.19 3.06 Intr - 109236 109141 96 0 0 87 115 48 0.994 7.61 3.05 Intr - 110351 110228 124 1 1 72 67 55 0.993 2.39 3.04 Intr - 111758 111637 122 2 2 69 94 221 0.998 20.09 3.03 Intr - 113116 113006 111 1 0 113 68 51 0.985 6.18 3.02 Intr - 114678 114574 105 0 0 19 115 110 0.963 7.21 3.01 Init - 120485 120429 57 2 0 65 111 -37 0.303 -2.11 3.00 Prom - 122068 122029 40 -7.06 4.00 Prom + 125702 125741 40 -6.26 4.01 Init + 125910 126581 672 2 0 73 63 236 0.608 14.79 4.02 Intr + 127615 127771 157 0 1 8 100 124 0.596 5.18 4.03 Intr + 133325 133544 220 0 1 131 89 107 0.739 12.46 4.04 Intr + 145807 145939 133 1 1 69 71 91 0.972 6.15 4.05 Term + 147580 147750 171 0 0 73 50 188 0.999 11.33 4.06 PlyA + 148303 148308 6 1.05 5.00 Prom + 148334 148373 40 -6.26 5.01 Init + 169597 169850 254 0 2 67 21 509 0.727 38.91 5.02 Term + 171589 171778 190 1 1 96 51 90 0.933 3.02 5.03 PlyA + 172626 172631 6 1.05 6.06 PlyA - 176042 176037 6 1.05 6.05 Term - 182593 182277 317 2 2 101 55 490 0.616 42.10 6.04 Intr - 199364 199175 190 2 1 114 99 325 0.491 35.36 6.03 Intr - 200769 200677 93 0 0 62 66 48 0.235 0.16 6.02 Intr - 214518 214298 221 1 2 68 79 411 0.152 36.22 6.01 Init - 215275 215176 100 1 1 104 99 121 0.977 15.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 155115 155239 125 0 2 92 90 70 0.822 7.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_1|396_aa MVSNRKGWQVLCLQVRGQGHMNQAGPSVSLVGAWGCLRDCAFADVKTAALLQNAENASNS SREATICHKAENENPFSTSSHWEKTTRPVCQKALHPTPDHPSPACGHTRVWDPRLPKLRT VKGSQKQGFNFANRSSEKCASQLPTASHVGKTYLGRGVRCQGGWETARKPNAPREALFPI SWWPFPPTRYHNQIWGNSTHSELFQTPDIQVNKGMEQACVGFESIASPTDIPSSNLLDDV QARERTNSSPKENWRAAGRGKMHRKATECQTAIPRRHALTHKGHTIWPQASSQLDTSHRT ALPWESWNPLLVGATTQVEECPQHRDCPLGQLALLPDKGGSFAAPRLWSLRAGLQPNADT PLMPTAWPPYPVALRPQDAFFPRSSLPTPTWKASLV >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_1|1191_bp atggtctcaaacaggaagggatggcaggtgctatgcctgcaggtgcggggacagggtcac atgaatcaggctggaccatctgtatccctggttggagcctggggctgcctacgtgactgc gcttttgcagatgtcaaaacagcagctctcctgcaaaacgccgagaatgcttccaattcc tccagggaagctacaatttgccataaagctgaaaatgaaaatccttttagcacctcttcc cactgggagaagaccacgcgacctgtctgccagaaagccctccaccccaccccagaccac cccagtcctgcctgtggccacacccgtgtgtgggaccctcgtctccccaagctgcgaact gtgaagggttctcaaaagcagggtttcaattttgcaaacagatcttcagaaaagtgtgcc tcccagcttcccacggccagccacgtcggcaaaacatacctcgggagaggagtccgctgc caaggaggctgggaaacagcccggaagcccaatgcgcctcgggaggctctgtttccaatc agctggtggccgttccccccaactcgttaccataaccagatttggggaaattctactcac tcagagctgttccagaccccagacattcaggtaaataaaggtatggagcaagcttgtgtt ggctttgagtcaattgcatctcccacagacatcccctccagcaacctccttgatgacgtt caagccagggagaggacaaatagttccccaaaagaaaattggagggctgctggaagggga aagatgcaccggaaggccacagaatgccagacagccattccacgaagacacgcattaact cacaaaggtcataccatctggccacaggcctcaagccagctggacacctcacaccggaca gctctgccctgggaatcctggaaccccctgctagtaggcgccacgacccaggtcgaggaa tgccctcagcacagagactgtcccctggggcagctcgccctactgcctgataagggcggc tccttcgcggctcccagactctggagcctgcgggctgggctgcagccaaacgcagacacc cctctaatgccgacagcctggcccccttaccctgtagctctccgccctcaggatgcattc ttccccagaagctccctgcccacccccacctggaaggcctccctcgtctga >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_2|183_aa MRKQPPMWNTTIGIASMEIVWALIRAISRQRCFGTLRAWTQAVRMRSLSTGSVAGVCQRQ KADGARFPSKRKSPFTELSLGITTKKPHKEKQDSCGFPPDAVPPAAKCLPAFPGLSTPHR SQSASKPIPPPQTTMNAPSHTGTSDWEQGWSMGGGEPHTSPMACENASQKKTYQVHANNV QQH >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_2|552_bp atgaggaagcagccacctatgtggaacaccaccattggaattgccagcatggagattgtc tgggccctgataagagcaatttcaaggcagcggtgcttcggcaccttgagggcttggact caggctgtcaggatgaggagcctatccacgggatcagttgctggtgtttgccaaaggcag aaagcggatggtgcaaggtttccttccaaacggaaatcgcccttcacagagctgagcctg ggaattaccacaaagaaacctcacaaagaaaagcaagatagctgtggattcccacctgat gctgttcctcctgctgccaagtgcctgcctgcctttccaggtctcagcacaccacacagg tcgcagtcagcatctaaacccatccccccaccccagaccactatgaatgcaccctcacat acaggtaccagtgactgggaacagggctggagtatgggtggtggtgagccacatacttca ccaatggcttgtgaaaatgcttctcaaaagaagacataccaggttcatgcaaacaacgtt cagcagcactag >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_3|362_aa MPCSLQCSTVNCLCVGRLPLRSVSVDLNVDPSLQIDIPDALSERDKVKFTVHTKTTLPTF QSPEFSVTRQHEDFVWLHDTLIETTDYAGLIIPPAPTKPDFDGPREKMQKLGEGEGSMTK EEFAKMKQELEAEYLAVFKKTVSSHEVFLQRLSSHPVLSKDRNFHVFLEYDQDLSVRRKN TKEMFGGFFKSVVKSADEVLFTGVKEVDDFFEQEKNFLINYYNRIKDSCVKADKMTRSHK NVADDYIHTAACLHSLALEEPTVIKKYLLKVAELFEKLRKVEGRVSSDEDLKLTELLRYY MLNIEAAKDLLYRRTKALIDYENSNKALDKARLKSKDVKLAEAHQQECCQKFEQLSESAK EG >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_3|1089_bp atgccctgcagtctgcagtgttctactgtgaactgcttgtgtgttggcaggctaccgctg agatctgtatctgtggacctgaatgttgatccctcgcttcagattgacatacctgatgcg ctcagtgagagagacaaagtcaaatttacagtgcacacaaagaccacactgcccacgttt cagagcccagagttttctgttacaaggcaacatgaagactttgtgtggctacatgacact cttattgaaacaacagactatgctgggcttattattccacctgctcctacgaagcccgac tttgatggtcctcgagagaagatgcagaaactgggagaaggtgaagggtctatgaccaaa gaagaatttgccaagatgaaacaagaactggaagctgagtatctcgctgtgtttaagaag actgtgtcctcccatgaagtctttcttcagcggctttcttctcaccctgttctcagtaaa gatcgcaactttcatgttttcctggaatatgatcaggatctaagtgttaggcggaaaaat actaaagagatgtttggtggcttcttcaaaagtgtggtgaaaagtgctgatgaagtcctt tttactggagttaaggaggtagatgacttctttgagcaagagaagaacttccttattaac tattacaataggatcaaagattcttgtgtgaaagctgacaaaatgaccagatctcataaa aatgttgccgatgactatatccacaccgcagcctgcttacatagcctggctttagaagag cccacagtcatcaaaaagtacctattgaaggttgctgagctatttgaaaaactaaggaaa gtagagggtcgagtttcatcagatgaagatttgaagctaacagagctcctccgatactac atgctcaacattgaagctgctaaggatctcttatacagacgcaccaaagccctcattgac tatgagaactcaaacaaagctctggataaggcccggttaaagagcaaagacgtcaagttg gctgaggcacaccagcaggagtgctgccagaaatttgaacaactttccgaatctgcaaaa gaaggttga >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_4|450_aa MAASSHGLWTRKGKNAATPELAGPPAQESEAGRTSPCCGPPPAAAATREPRPWRRGTRAG AAWLCEERRSWAAAAAAWAPLGGGHGPASAGLPARRRQEASGLRHHPSCPGSRRAGRHVL PQSSLPVPAAVHLGAGQRRHVGPTLAPLHEEAANRRAATRTGSNGHSPSKTRLEKDRVSV RGEFITQPESKMAASADVTRSRESRRARFGGSRASETPALPLGEKKVNPYEEVDQEKYSN LVQSVLSSRGVAQTPGSVEEDALLCGPVSKHKLPNQDVFLQGKRFHEALESILSPQETLK ERDENLLKSGYIESVQHILKDVSGVRALESAVQHETLNYIGLLDCVAEYQGKLCVIDWKT SEKPKPFIQSTFDNPLQVVAYMGAMNHDTNYSFQVQCGLIVVAYKDGSPAHPHFMDAELC SQYWTKWLLRLEEYTEKKKNQNIQKPEYSE >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_4|1353_bp atggcggcgagcagccacggcctatggacgcgcaagggaaagaatgcagcgaccccggag ctcgcagggccgcccgcccaggagtctgaggctgggaggacctcaccttgctgcggtcct cctcctgctgctgcagcaactcgggaaccgcggccatggcgacgcgggactcgagcaggg gccgcctggctgtgcgaggaaagaagaagctgggccgccgccgccgccgcctgggcgcct ctcgggggcggccacggccccgcctccgccggcctccctgcccgacggcggcaggaggcc tccggactccgccaccatcccagctgccccgggagcaggcgagcagggcgccacgtgctc ccccagagcagcctcccagtccccgctgccgtccatcttggagccgggcaaagacgccac gtggggcctacccttgctccgctccacgaggaggccgccaaccgcagggccgcgacacgg acgggaagcaacggacactctcccagcaagacgcgtctagagaaagaccgcgtttcggtg cggggggaatttattactcagcccgagtccaagatggcagcgagcgctgacgtcaccaga tctcgtgagagcagaagggcgcgatttggaggctcccgcgcttcggagacgccggccctt ccgctcggagagaaaaaagtgaacccatatgaagaagtggaccaagaaaaatactctaat ttagttcagtctgtcttgtcatccagaggcgtcgcccagaccccgggatcggtggaggaa gatgctttgctctgtggacccgtgagcaagcataagctgccaaaccaagacgtcttttta caagggaaacggttccacgaagccttggaaagcatactttcaccccaggaaaccttaaaa gagagagatgaaaatctcctcaagtctggttacattgaaagtgtccagcatattctgaaa gatgtcagtggagtgcgagctcttgaaagtgctgttcaacatgaaaccttaaactatata ggtctgctggactgtgtggctgagtatcagggcaagctctgtgtgattgattggaagaca tcagagaaaccaaagccttttattcaaagtacatttgacaacccactgcaagttgtggca tacatgggtgccatgaaccatgataccaactacagctttcaggttcaatgtggcttaatt gtggtggcctacaaagatggatcacctgcccacccacatttcatggatgcagagctctgt tcccagtactggaccaagtggcttcttcgactagaagaatatacggaaaagaaaaagaac cagaatattcagaaaccagaatattcagaatag >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_5|147_aa MSDAAVDTSSEITTKDLKKKEAVEEAENGRDTPANGKANEENGEQEADNEVDEEEEEGGE EDEEEEEGDGEEEDGDEDEEAESARTFSFGTYHVLGAQTSPHREETIRRGPCGEELRPLA NHQHELPGSRENGPSDEAQAPAFESFS >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_5|444_bp atgtcagacgcagccgtagacaccagctccgaaatcaccaccaaggacttaaagaagaag gaagctgtggaggaagcggaaaatggaagagacacccctgctaatgggaaggctaatgag gaaaatggggagcaggaagctgacaatgaagtagatgaagaagaggaagaaggtggggag gaagacgaggaggaagaagaaggcgatggtgaggaagaggatggtgatgaagacgaggaa gctgagtccgctaggaccttctcctttggaacctaccacgtccttggagcccaaaccagc ccacacagagaggagaccataaggagaggtccatgtggagaggaattgaggcccctagcc aaccaccagcatgaactaccaggttcacgagagaatgggccttcagatgaagcccaggcc ccagcctttgagtctttcagctga >gi568815578r:17842360_18068425|GENSCAN_predicted_peptide_6|306_aa MPKVFLVKRRSLGVSVRSWDELPDEKRADTYIPVGLGRLLHDPPEDCRSDGGSSSGSGSS SAGEPGGAESSSSPHAPESETPEPGDAEGPDGHLATKQRPVARSKIKVTVGALVSRKEWR RKKAPFAPLQGDFYFIGQFTTGTCSDSVVHSCDLCGKGFRLQRMLNRHLKCHNQVKRHLC TFCGKGFNDTFDLKRHVRTHTGIRPYKCNVCNKAFTQRCSLESHLKKIHGVQQQYAYKQR RDKLYVCEDCGYTGPTQEDLYLHVNSAHPGSSFLKKTSKKLAALLQGKLTSAHQENTSLS EEEERK >gi568815578r:17842360_18068425|GENSCAN_predicted_CDS_6|921_bp atgcccaaagtcttcctggtgaagaggaggagcctgggggtctcggtccgcagctgggat gagctcccggatgagaaaagggcagacacctacatcccagtgggcctaggccgcctgctc cacgacccccccgaggactgccgcagcgacggcggcagcagcagcggcagcggcagcagc agcgcgggggagcctggaggagcagagagcagctcgtccccgcacgcccccgagagcgaa acccccgagcccggcgacgccgagggccccgatggacacctggcgaccaagcagcgcccg gtcgccagatcgaaaatcaaggtgactgttggtgctctagtcagcaggaaggagtggagg aggaagaaggcaccgtttgctccccttcaaggagatttctatttcattggccagttcacc acaggcacgtgcagcgactcggtggttcacagctgtgacctgtgtggcaagggcttccgt ctgcagcgcatgctgaaccgtcacctcaagtgccacaaccaggtgaaaagacacctgtgc accttctgcggcaagggcttcaacgacaccttcgacctgaagaggcacgtccgcacacac acaggcattcgtccctacaaatgcaacgtctgcaataaagccttcacccagcgctgctct ctggagtcccacctgaagaaaatccatggggtgcagcagcagtatgcctataagcagcgg cgggacaagctctacgtctgcgaggattgcggctacacgggccccacccaggaggacctg tacctgcacgtgaacagtgcccatccgggcagctcgtttctcaaaaagacatctaaaaaa ctggcagcccttctgcagggcaagctgacatccgcacaccaggagaataccagcctgagt gaggaggaggagaggaagtga