GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:39:00 Sequence gi568815595f:150309351_150558633 : 249283 bp : 38.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 5445 5565 121 0 1 97 46 102 0.886 3.87 1.02 PlyA + 5892 5897 6 1.05 2.06 PlyA - 6952 6947 6 1.05 2.05 Term - 10887 10707 181 2 1 72 42 83 0.162 -1.80 2.04 Intr - 14174 13834 341 2 2 38 97 219 0.062 11.15 2.03 Intr - 31913 31756 158 0 2 93 109 96 0.215 11.01 2.02 Intr - 35856 35747 110 2 2 104 23 92 0.000 3.31 2.01 Init - 58949 58909 41 2 2 95 106 28 0.131 5.24 2.00 Prom - 90195 90156 40 -3.35 3.00 Prom + 90787 90826 40 -3.35 3.01 Init + 100001 101958 1958 1 2 76 80 1854 0.596 170.94 3.02 Intr + 147726 147777 52 0 1 66 115 49 0.843 3.39 3.03 Term + 149026 149286 261 0 0 110 42 178 0.994 9.94 3.04 PlyA + 150141 150146 6 1.05 4.00 Prom + 160116 160155 40 -8.25 4.01 Init + 160470 160654 185 2 2 51 27 170 0.133 5.84 4.02 Term + 186707 186827 121 2 1 100 43 113 0.089 4.97 4.03 PlyA + 186864 186869 6 1.05 5.03 PlyA - 187558 187553 6 1.05 5.02 Term - 192021 191786 236 1 2 44 49 202 0.367 7.40 5.01 Init - 192522 192084 439 0 1 48 28 277 0.358 14.02 5.00 Prom - 197609 197570 40 -3.85 6.00 Prom + 200428 200467 40 -5.75 6.01 Init + 202244 202296 53 1 2 106 94 38 0.566 6.88 6.02 Intr + 224801 225135 335 2 2 -18 52 265 0.006 6.59 6.03 Term + 228411 228538 128 1 2 78 43 85 0.847 0.46 6.04 PlyA + 230118 230123 6 1.05 7.07 PlyA - 230731 230726 6 1.05 7.06 Term - 230883 230849 35 1 2 76 54 34 0.006 -4.43 7.05 Intr - 236428 236353 76 1 1 90 119 21 0.150 3.57 7.04 Intr - 236802 236520 283 2 1 -3 78 268 0.584 13.20 7.03 Intr - 237214 236994 221 1 2 76 69 108 0.587 3.88 7.02 Intr - 237438 237244 195 0 0 8 57 159 0.537 3.49 7.01 Intr - 237827 237556 272 0 2 101 35 178 0.603 10.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 30971 30888 84 2 0 141 41 18 0.812 -0.93 S.002 Init - 35399 35221 179 2 2 75 94 111 0.887 9.18 S.003 Term - 99073 98876 198 1 0 10 45 311 0.970 15.42 S.004 Init - 99445 99137 309 1 0 68 28 177 0.931 5.06 S.005 Init + 224907 225135 229 2 1 34 52 222 0.950 11.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_1|40_aa XLRSSFKPLAVGGADGSLTGPLPTMRKEKVPFILSLITGG >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_1|123_bp nnactcagaagttcctttaagcctctggccgttggaggagctgatggcagcttgactggg cctctgccaaccatgaggaaagaaaaagtcccattcatattgtcccttatcactgggggc tga >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_2|276_aa MAPDDSLLPGMHAGDSLVQEGSSFGNVLTPFENSAKVQGFDGLIKPYFRPKPRLLTLQIL FAENGCLFAVYCDSSTRKEPNRFNLLNSTLLNKKECRIYSTNQSPVVCFDRALIGAFTIP ELDIKVLHIPTRLRNPAGFTQWIQHRGCRWSCLPVPVPCAPHSSALGRSMGLGTLEQGVA LIGEARAAQEPMEGVGGSGMAGCRSRALPRGKAVKARNTMGVPIGSDLCLSINHQACSHP VFNQSGKEFSKAVIQQKLCGFVRLIQGFGMTHWGLG >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_2|831_bp atggcccctgatgactccctccttcctggtatgcatgctggtgattccttagttcaagag ggttcatcctttgggaatgttttaacaccctttgaaaacagtgccaaagtccagggcttt gacggcttgataaaaccttattttaggccaaaaccccgcttattgactctacagatactt tttgcagaaaatggctgcctctttgctgtctactgtgactcttcaactaggaaagaacca aatagatttaacctactcaactcgacacttctgaacaaaaaagaatgcagaatatatagc accaaccagagcccagtggtctgtttcgacagggcgctgattggtgcgtttacaatccct gagctagacataaaggttctccacatccccaccagactcaggaacccagctggcttcacc cagtggatccagcaccggggttgcaggtggagctgcctgccagtcccggtgccgtgcgcc ccccactcctcagcccttgggcggtcgatgggactgggcaccttggagcagggggtggcg ctcatcggggaggctcgggccgcacaggagcccatggagggagtgggaggctcaggcatg gcgggctgcaggtcccgagccctgccccgcgggaaggcagttaaggcccgaaatacaatg ggtgtgcccattgggtcagacttgtgcctttctataaatcatcaggcttgcagtcatcca gtgttcaaccagtcaggcaaagaattctccaaggcagtaattcaacagaagctttgtgga tttgtcagactgattcaagggtttggaatgactcactggggcttgggttaa >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_3|756_aa MSKMPAKKKSCFQITSVTTAQVATSITEDTESLDDPDESRTEDVSSEIFDVSRATDYGPE EVCERSSSEETLNNVGDAETPGTVSPNLLLDGQLAAAAAAPANGGGVVSARSVSGALAST LAAAATSAPAPGAPGGPQLAGSSAGPVTAAPSQPPTTCSSRFRVIKLDHGSGEPYRRGRW TCMEYYERDSDSSVLTRSGDCIRHSSTFDQTAERDSGLGATGGSVVVVVASMQGAHGPES GTDSSLTAVSQLPPSEKMSQPTPAQPQSFSVGQPQPPPPPVGGAVAQSSAPLPPFPGAAT GPQPMMAAAQPSQPQGAGPGGQTLPPTNVTLAQPAMSLPPQPGPAVGAPAAQQPQQFAYP QPQIPPGHLLPVQPSGQSEYLQQHVAGLQPPSPAQPSSTGAAASPATAATLPVGTGQNAS SVGAQLMGASSQPSEAMAPRTGPAQGGQVAPCQPTGVPPATVGGVVQPCLGPAGAGQPQS VPPPQMGGSGPLSAVPGGPHAVVPGVPNVPAAVPAPSVPSVSTTSVTMPNVPAPLAQSQQ LSSHTPVSRSSSIIQHVGLPLAPGTHSAPTSLPQSDLSQFQTQTQPLVGQVDDTRRKSEP LPQPPLSLIAENKPVVKPPVADSLANPLQLTPMNSLATSVFSIAIPVDGDEDSASGGGVV AIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVERNSLLERENALLKSLSSNDQLSQ LPTQQANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_3|2271_bp atgtccaagatgccggccaagaagaagagctgcttccagatcaccagtgtcaccacggcc caggtggccactagcatcaccgaggacaccgagagcttggacgacccggacgagtcacgc acagaggacgtctcctccgagattttcgacgtctctcgggccacggattatggccctgag gaggtctgcgagcgcagctcttccgaagagacgcttaacaatgttggggatgcggagact cccgggaccgtctccccaaacctcctcctagatgggcagctggcagcggcggctgctgct cccgccaacggaggaggagtcgtttcggcccggagcgtgtctggggcgctcgccagtacc ctggcggcggctgccacttcggcccccgcccccggagcacccggcggcccccagctcgcg ggctcatccgccgggccagtgactgcagccccatctcagcctcccaccacatgtagttcc cgttttcgcgtgatcaagctggaccacgggagcggagagccctatagacgcggccgatgg acgtgtatggaatactatgagagggattcagacagcagcgtcctgactagatccggggat tgcattagacacagcagtacttttgaccagactgcggagcgggacagcggcctgggcgcc accggagggtcggtggtggtagtagtggcctccatgcagggggcgcacgggcccgagtcg ggaactgacagctccttgactgctgtgtcacagctacccccgtcggagaaaatgagccag cccactccggcccagccgcagagttttagcgttgggcagccacagccgccgccgccaccc gtaggtggggctgtggctcaaagctcggctccgctgccgccgttcccgggagccgcgacc gggccgcagccaatgatggcagccgcgcagcccagccagccccagggagcggggcccggg ggacagactctgccgccgacgaatgtaaccctggcgcagccggctatgtccctgcctccg cagccgggccctgcagtgggcgcccccgcggcgcagcagccccagcagttcgcgtatcct cagcctcagataccgcccggacatttgctgcccgtccagccctccggccagagtgagtac ctgcagcagcacgtggccggcctgcagccgccaagccccgcgcagccctcgtccaccggc gccgcagcgagccccgccacggcggccacccttcccgtgggcaccggccagaatgcttcc tcggtgggcgcgcagctcatgggcgcgtcttcccagcccagcgaagccatggccccccgg acgggaccagcgcaaggcgggcaggtcgcgccttgtcagccgactggagtgcccccggct actgtgggaggcgtggtgcagccgtgcctcggtcctgccggggctgggcagccccagtcc gtgcctccgccgcagatgggtggcagtggtccgctgtcagccgtacctggtggccctcac gccgtggtgcccggagttccaaacgtgcctgcagccgtgcccgctccaagcgtgcctagt gtgtctaccacttctgttactatgccaaatgtacccgcgcctctggcccagtcgcaacag ctgagcagccatacgccagtcagcaggagcagcagcataatccagcatgttgggctgccc ttagcgccaggcacacacagcgcaccaacaagtctaccacagtctgacctaagccagttt caaactcagacccagcctttagtcgggcaagtcgacgatactagaagaaaatcagaaccc ctacctcaaccaccactttctctcattgctgaaaataagcctgttgtgaagccgcctgtt gcagattccctggcaaacccccttcagttaacacctatgaacagtctggccacctctgta ttcagcatagctattcctgttgatggtgatgaagacagtgcatctgggggaggtgttgta gccattgacaacaaaatagaacaagcaatggatctggtgaaaagccatttgatgtatgca gtaagagaagaagtggaagttttaaaggaacaaataaaagaattagttgaaagaaactct ttacttgaacgagaaaatgcactgttaaaatctctttcaagcaatgatcaattatcccaa ctcccaacccaacaggccaatcctggtagcacttctcaacagcaagcagtgatagcacag cctccgcagccaacgcaacctccacagcagccgaatgtctcctcagcataa >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_4|101_aa MAVMTADSSGNAAFLEASSAQIITLWEGKKVTTIFYPRILKNVIPQKGVLRGGGECQPQE GSKKLPPSSPLAPPTPDAFTVFSVDSKFGTEKLTALKVLLL >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_4|306_bp atggctgtgatgacagctgactcatctggtaatgcagctttcctggaggcttcctctgct cagataatcacactctgggagggaaaaaaggttacaacaatcttctaccccaggatactc aagaatgtcatcccccagaagggagttctacgaggaggaggagaatgtcagccccaggag gggagcaaaaagctgccgccctccagtcccctggctccccccactccagatgcattcaca gtttttagtgtagattccaaatttggaactgaaaaactcacagccttgaaagtcctacta ttgtag >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_5|224_aa MDNEVQAEVFSDGDEELVGNWSKGYSCYALAKRLVAFCPCLRDLWNFELEGDDLGYLVKE ISKWQSVQKEAEDNSLEILQPNEAVEKENPFSGEKFKPAAEICISNEELNVNHQDNGENV SGHVRELHNSPSHHRPRVPGGKDGFVAMAKRGQGTAWTTASQGASPKPGQLPHDIEPVGA EMSRIEIWEPPPRFQRMYGNTWMSKQKFAGEGEALMENLLENLC >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_5|675_bp atggacaatgaagtccaggctgaggtgttctcagatggagatgaggaacttgttgggaac tggagtaaaggttactcttgctatgctttagcaaagagactggtggcattttgcccctgc cttagagatctgtggaactttgaacttgagggagatgatttagggtatctggtgaaagaa atttctaagtggcaaagcgttcaaaaggaagcagaggacaatagtttggaaattttgcaa cccaatgaagcagtagaaaaggaaaaccccttttctggggagaaattcaagccagctgca gaaatttgcataagtaatgaggagctgaatgttaatcaccaagacaatggggaaaatgtc tcagggcatgtcagagaacttcacaacagcccctcccatcacaggcccagagtcccagga gggaaagatggttttgtggccatggctaaaagaggccaaggcacagcttggaccactgct tcacagggtgcaagccccaagcctgggcagcttccacatgatattgagcctgtgggtgca gagatgtcaagaattgagatttgggaacctccacctagatttcagaggatgtatggaaac acctggatgtccaagcagaagtttgctggcgagggggaagccctcatggagaacctcttg gagaacctctgctag >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_6|171_aa MDNPGGLYVKQNKSGTERWGKKEIEFVTKQLSTKKSLGPDGFIGEFYKTYKEFMPILHKL FQETEENNSRLILRSQYYPDNKTYTTRNENYTKISYEHKCKNSTEILANQNLYLKKGIFK KLTTNIILHDRSPEDPVKMQILIQWAFGGVSETLHFYQASRLLLVLNLHSE >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_6|516_bp atggataaccctggagggctttatgttaagcaaaataagtcaggcacagaaaggtgggga aaaaaagaaattgaatttgtaacaaaacaactatccacaaagaaaagcctaggcccagat ggctttattggtgaattctacaaaacatacaaagaattcatgccaattcttcacaagctt ttccaggaaaccgaagagaacaactcccgacttattctacgaagccagtattaccctgat aacaaaacctacacaacaagaaatgaaaactacaccaagatctcttacgaacataaatgt aaaaattctacagagatactagcaaaccaaaatctctacctgaaaaagggcatcttcaaa aaactcacaactaacatcatacttcatgatagatcacctgaggatcctgtgaaaatgcag attctgattcagtgggcctttggtggggtgtctgagacactgcatttctatcaagcttcc aggttattgctggtcctcaacctacactctgagtag >gi568815595f:150309351_150558633|GENSCAN_predicted_peptide_7|360_aa XLNDFTRARVTPVLQTHVSQIQLAHCFSPPSSISVEDESAGKNDGWVKGRGNTKAQRRRL HFRLPRYIKKANSLAFPQATRKGRSDSPAPRRNRSGESSPGPNPAPTRLLMRGDSELVFP EPSASGPIHFRLRSDAEKAPKRARANLGRRLTELGRREGRALPRGGVRLVLTLAAEPKVD RGGGLHIPVVALRFLPLSLRAHGGGQSGGDGGARTTRRPVLFLLRTCPARWWRREDGRQA KDPYGQREAQQEHHPARQRRQDLGKERAGAPLRSAAGAESGSLGPRRVFSRIRPTVGWCR GGVGARVCEPPDCGGSGTEGETCERNAPEEKASVGPWLLALFIFVVCGSVCYGESLPNGE >gi568815595f:150309351_150558633|GENSCAN_predicted_CDS_7|1083_bp nacctaaatgacttcacaagggcccgtgtaacaccagtattgcagacccatgtgtcacaa attcaactagcacattgtttttctcccccttcttctatttctgtagaagatgaatccgca ggcaagaatgatggatgggtgaaagggagggggaatacgaaagcccagaggcggagactt cacttccgattgccacgctatatcaagaaagccaacagtctggcctttccgcaagccaca agaaaaggcagatcagactccccggctcctcggagaaaccgctcgggtgagagttcacca gggccgaacccggcccctacacgtctcctcatgcgtggagacagcgaattggtgtttccg gagccatcagcctcggggccaattcacttccggttacggagtgacgcggaaaaggcgccc aaacgcgcacgcgcaaatctagggcgacgcttgacagagcttgggaggcgggaggggcgc gcacttccgcggggcggagtccgtctagtgctgacgttggcagccgaacccaaagtagat cgaggcggcgggctgcacattcccgttgttgcgttgcgtttccttcctctttcactccgc gctcacggcggcggccaaagcggcggcgacggcggcgcgagaacgacccggcggccagtt ctcttcctcctgcgcacctgccccgctcggtggtggcgccgcgaagatggtcgccaagca aaggatccgtatggccaacgagaagcacagcaagaacatcacccagcgcggcaacgtcgc caagacctcggtaaggaaagagcgggcgcccctctgcgttccgcagccggagctgagtcc gggtccctggggccgcgtcgcgttttctcccggatccggccgaccgtgggctggtgccgt ggaggggtaggggcccgagtctgcgaaccgccagactgtgggggctcgggaacggaaggg gagacctgtgagagaaatgcccccgaagagaaggcgtctgtaggaccctggttattggct ctcttcatttttgttgtctgtggttctgtgtgttatggagaaagcctgccaaacggagaa tga