GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:02:13 Sequence gi568815596r:61378823_61633897 : 255075 bp : 42.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 38097 38383 287 2 2 78 24 312 0.067 20.40 1.02 Intr + 46583 46672 90 2 0 77 39 94 0.036 1.59 1.03 Term + 48528 48801 274 0 1 25 32 235 0.819 5.56 1.04 PlyA + 49532 49537 6 1.05 2.00 Prom + 51598 51637 40 -6.65 2.01 Init + 54327 54457 131 2 2 60 80 66 0.312 2.67 2.02 Intr + 60925 61109 185 1 2 144 54 26 0.315 3.31 2.03 Intr + 65011 65131 121 2 1 17 42 139 0.009 0.83 2.04 Term + 91745 91919 175 2 1 81 42 159 0.527 6.85 2.05 PlyA + 95539 95544 6 1.05 3.19 PlyA - 96696 96691 6 1.05 3.18 Term - 100144 99998 147 1 0 93 38 161 0.966 8.52 3.17 Intr - 102116 102088 29 0 2 83 86 -15 0.314 -5.28 3.16 Intr - 104269 104135 135 2 0 89 99 26 0.705 3.42 3.15 Intr - 105283 105115 169 1 1 68 66 88 0.797 3.20 3.14 Intr - 107140 106946 195 1 0 119 93 90 0.993 11.29 3.13 Intr - 109449 109343 107 1 2 72 115 27 0.979 2.81 3.12 Intr - 109949 109766 184 2 1 61 78 143 0.999 9.04 3.11 Intr - 111954 111820 135 0 0 66 106 102 0.999 9.64 3.10 Intr - 113374 113213 162 1 0 9 91 124 0.735 4.05 3.09 Intr - 113926 113745 182 2 2 67 91 94 0.873 6.27 3.08 Intr - 114231 114093 139 0 1 2 106 97 0.960 2.02 3.07 Intr - 116791 116633 159 1 0 85 94 74 0.988 6.96 3.06 Intr - 118185 118057 129 0 0 55 90 118 0.985 8.67 3.05 Intr - 121072 120887 186 1 0 74 29 88 0.554 0.46 3.04 Intr - 123218 123174 45 2 0 109 115 46 0.985 7.09 3.03 Intr - 123488 123427 62 0 2 67 94 34 0.591 -0.57 3.02 Intr - 143861 143789 73 2 1 70 97 36 0.258 0.76 3.01 Init - 147693 147598 96 0 0 59 80 95 0.259 6.16 3.00 Prom - 148206 148167 40 -8.35 4.00 Prom + 148837 148876 40 -6.45 4.01 Init + 149569 149581 13 0 1 74 83 24 0.019 -0.92 4.02 Term + 159273 159904 632 1 2 114 32 346 0.898 25.09 4.03 PlyA + 160853 160858 6 1.05 5.00 Prom + 184774 184813 40 -6.35 5.01 Init + 211590 211716 127 2 1 71 -20 211 0.943 8.97 5.02 Term + 213343 213488 146 2 2 109 49 132 0.622 8.49 5.03 PlyA + 215819 215824 6 1.05 6.04 PlyA - 218848 218843 6 1.05 6.03 Term - 226817 226588 230 0 2 83 43 97 0.055 0.51 6.02 Intr - 249146 249040 107 1 2 93 55 91 0.323 5.24 6.01 Init - 251104 251097 8 1 2 96 105 0 0.394 3.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 38097 38387 291 2 0 78 39 317 0.862 21.20 S.002 Init + 47831 47928 98 1 2 75 72 77 0.812 4.63 S.003 Intr + 158050 158306 257 1 2 37 16 205 0.826 4.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_1|216_aa MNGDILDPTAIWDEPLLSHHIFKFIGIELGEAPLLPDVDLLAAGELELGPEQDLNHMRLV LQLGMDGHNLANVNPGHSALGLSKGTSHACLDPTLGQQQKAWCLEPLGAEGGTTQQLQAT ELSAVLKPSQEGQLQIRPDSEDYNKYLTLQCHSRTSTSINAIQENMTSPNELNKASGTNP GETEICDLSDKEFKIAVLRKLKEIQDNTEGIQNSIR >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_1|651_bp atgaatggcgacatacttgaccccacagccatctgggatgagccgcttctcagccaccat atcttcaaattcatcggcattgaacttggtgaagccccacttcttccagatgtggatctt ctggcagccggggaacttgaacttggccctgagcaggacctcaatcacatgcgccttgtt ctgcagcttggtatggatggacataacttggccaatgtgaaccctggccacagtgccctg gggctttccaaaggcacctcacatgcctgtttggaccctacactgggccagcagcaaaag gcctggtgcctagagcctctcggtgctgagggagggacaacacagcaactgcaagccact gaactcagtgctgtcctaaagccttcccaagaaggacagcttcaaataaggccagacagt gaagactacaataaatacctaactcttcaatgccactcaagaacatctactagcatcaac gccatccaggaaaacatgacctcaccaaatgaactaaataaggcatcagggaccaatcct ggagaaacagagatatgtgacctttcagataaggaattcaaaatagccgtgttgaggaaa ctcaaagaaattcaagataacacagaaggaatacagaattctatcagataa >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_2|203_aa METHLLVTKSRKAVWKHPASAAPSPLTQMGFACSQQGLPDAGKKEAGGCGQRVGLETCSA IPCHYVGHTQLGGDLMMCGPERCLGVDSVDGCTVPLGQGLLLGQQDSPSVTNVPLTLVSD VDNGGGFVCGESGGIPEISAPYAQFWPRQPGEARELREDPKPCTPTGRYRGYLLSFNTST RSAQFSHIVRPPPPPSPASDHTD >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_2|612_bp atggagacacacctgttggtgacaaagtccaggaaggcagtatggaagcatccagcctct gcagccccatctcccctcacccagatgggatttgcctgcagtcagcagggacttcctgat gcagggaaaaaggaggcaggaggctgtgggcaaagggtgggcctggagacctgctcagcc ataccctgtcactatgtgggtcacacacagctgggtggagacctcatgatgtgtggccca gagaggtgtctgggtgtggattctgtggatggctgcactgtgccattaggacagggactg ctgctgggccaacaggattcaccaagtgtaacaaatgtaccactcactctggtgtcagat gttgataatggaggaggcttcgtgtgtggggagtcaggtggcataccggaaatatctgca ccttatgctcaattttggccgcggcagcccggggaggccagagagctgcgcgaggacccc aaaccgtgcaccccgacaggccgctaccgcggctacttactttcatttaacacctccacc aggtctgcgcagttctcgcacatcgttcggccgccgccccccccctcccccgcttcggat cacactgactga >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_3|777_aa MAQEVLTHLKEHPDAWTRVDTILEFSQNMNTKYYGLQILENVIKTRWKILPRNQCEGIKK YVVGLIIKTSSDPTCVEKEKVYIGKLNMILVQILKQEWPKHWPTFISDIVGASRTSESLC QNNMVILKLLSEEVFDFSSGQITQVKSKHLKDRQFLNVPMFRNVSLKCLTEIAGVSVSQY EEQFVTLFTLTMMQLKQMLPLNTNIRLAYSNGKDDEQNFIQNLSLFLCTFLKEHDQLIEK RLNLRETLMEVRLLMVSRMAKPEEVLVVENDQGEVVREFMKDTDSINLYKNMRETLVYLT HLDYVDTERIMTEKLHNQVNGTEWSWKNLNTLCWAIGSISGAMHEEDEKRFLVTVIKTHD GVQDMACDTFIKIAQKCRRHFVQVQVGEVMPFIDEILNNINTIICDLQPQQVHTFYEAVG YMIGAQTDQTVQEHLIEKYMLLPNQVWDSIIQQATKNVDILKDPETVKQLGSILKTNVRA CKAVGHPFVIQLGRIYLDMLNVYKCLSENISAAIQANGEMVTKQPLIRSMRTVKRETLKL ISGWVSRSNDPQMVAENFVPPLLDAVLIDYQRNVPAAREPEVLSTMAIIVNKLGGHITAE IPQIFDAVFECTLNMINKDFEEYPEHRTNFFLLLQAVNSHCFPAFLAIPPTQFKLVLDSI IWAFKHTMRNVADTGLQILFTLLQNVAQEEAAAQSFYQTYFCDILQHIFSVVTDTSHTAG SLKCIMEYWEFAGEDTSDLFLEEREIALRQADEEKHKRQMSVPGIFNPHEIPEEMCD >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_3|2334_bp atggctcaagaagtactgacacatttaaaggagcatcctgatgcttggacaagagtcgac acaattttggaattttctcagaatatgaatacgaaatactatggactacaaattttggaa aatgtgataaaaacaaggtggaagattcttccaaggaaccagtgcgaaggaataaaaaaa tacgttgttggcctcattatcaagacgtcatctgacccaacttgtgtagagaaagaaaag gtgtatatcggaaaattaaatatgatccttgttcagatactgaaacaagaatggcccaaa cattggccaacttttatcagtgatattgttggagcaagtaggaccagcgaaagtctctgt caaaataatatggtgattcttaaactcttgagtgaagaagtatttgatttctctagtgga cagataacccaagtcaaatctaagcatttaaaagacaggcaattcctgaatgttccaatg tttcgaaatgtctctctgaagtgcctcactgagattgctggtgtgagtgtaagccaatat gaagaacaatttgtaacactatttactctgacaatgatgcaactaaagcagatgcttcct ttaaataccaatattcgacttgcgtactcaaatggaaaagatgatgaacagaacttcatt caaaatctcagtttgtttctctgcacctttcttaaggaacatgatcaacttatagaaaaa agattaaatctcagggaaactcttatggaggtccgtttattaatggttagtcgaatggct aaaccagaggaagtattggttgtagagaatgatcaaggagaagttgtgagagaattcatg aaggatacagattccataaatttgtataagaatatgagggaaacattggtttatcttact catctggattatgtagatacagaaagaataatgacagagaagcttcacaatcaagtgaat ggtacagagtggtcatggaaaaatttgaatacattgtgttgggcaataggctccattagt ggagcaatgcatgaagaggacgaaaaacgatttcttgttactgttataaagacccatgat ggagtccaggatatggcttgtgatactttcattaaaatagcccaaaaatgccgcaggcat ttcgttcaggttcaggttggagaagtgatgccatttattgatgaaattttgaacaacatt aacactattatttgtgatcttcagcctcaacaggttcatacgttttatgaagctgtgggg tacatgattggtgcacaaacagatcaaacagtacaagaacacttgatagaaaagtacatg ttactccctaatcaagtgtgggatagtataatccagcaggcaaccaaaaatgtggatata ctgaaagatcctgaaacagtcaagcagcttggtagcattttgaaaacaaatgtgagagcc tgcaaagctgttggacacccctttgtaattcagcttggaagaatttatttagatatgctt aatgtatacaagtgcctcagtgaaaatatttctgcagctatccaagctaatggtgaaatg gttacaaagcaaccattgattagaagtatgcgaactgtaaaaagggaaactttaaagtta atatctggttgggtgagccgatccaatgatccacagatggtcgctgaaaattttgttccc cctctgttggatgcagttctcattgattatcagagaaatgtcccagctgctagagaacca gaagtgcttagtactatggccataattgtcaacaagttagggggacatataacagctgaa atacctcaaatatttgatgctgtttttgaatgcacattgaatatgataaataaggacttt gaagaatatcctgaacatagaacgaactttttcttactacttcaggctgtcaattctcat tgtttcccagcattccttgctattccacctacacagtttaaacttgttttggattccatc atttgggctttcaaacatactatgaggaatgtcgcagatacgggcttacagatacttttt acactcttacaaaatgttgcacaagaagaagctgcagctcagagtttttatcaaacttat ttttgtgatattctccagcatatcttttctgttgtgacagacacttcacatactgctggt agtcttaaatgtataatggaatattgggaatttgcaggtgaagacacttctgatttgttt ttggaagagagagaaatagccctacggcaggctgatgaagagaaacataaacgtcaaatg tctgtccctggcatctttaatccacatgagattccagaagaaatgtgtgattaa >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_4|214_aa MGGRATGGSPLPLWVVARTAAAKTGTGTAAGAVATVALADAVAPAAPAAAAAAASPPPPS ASNSEPVETGSDWEIPSRLSFQPMAPRSVWNLGLRPQPALLLVSSSYSRAMSLSLIGPSA SQSRLTAPHEPAASAPTSFQQRFARRVGEGKWGGSKGKRRGVEEEEVGRRAGMSRKESEG KPRGGFVRTSCQSRAARPRALDFPSPCGPLPPGA >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_4|645_bp atgggaggccgagcgactggtggttcccccctccccctctgggtggttgcacggactgca gcagcaaagactggaacaggcaccgccgccggggctgtagctactgttgctcttgctgat gctgtagctcccgctgctcctgccgcggccgctgcagccgcttctccccctcctcctagt gcctcaaactcggaaccggttgaaaccggttcagactgggagattccatctcggttgagt tttcagcccatggcgccgcggagcgtttggaacctgggcctccgcccccagcccgccctc ctattggtcagttccagctacagccgcgccatgagcctatctctcattggcccgagcgcc agtcaatcccgcctcacggctccacacgagcccgccgcttcggcccctacctcttttcag cagcgatttgcccgcagagtgggggaggggaagtggggagggtcgaaaggcaagaggaga ggggtggaggaggaggaggtgggacgaagagcagggatgagtagaaaagagagcgaagga aagccgcgtggaggattcgttcgaacgagctgtcaatcaagagcagcccgccctcgagct ctcgacttcccttccccctgcggccctctccctcccggtgcctag >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_5|90_aa MGIIKEEEEKEEKEERRRKKKEEEEEEEEEEEKLAEHDDVCSTRAQNWTENEIDKLTEVG FRRWVITNSSELKEHVLTQCKEAKNLDERL >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_5|273_bp atggggataataaaagaggaagaagaaaaggaagagaaagaggaaagaagaaggaagaag aaggaagaagaagaagaggaagaagaggaagaagaaaaattagctgagcatgatgatgta tgctctacaagggcacaaaactggaccgagaatgagattgacaaattgacagaagtaggc ttcagaaggtgggtaataacaaactcctctgagctaaaggagcatgttctaacccaatgc aaggaagctaagaaccttgatgaaaggttatag >gi568815596r:61378823_61633897|GENSCAN_predicted_peptide_6|114_aa MPWELLHSEVGLKALALVPAVFMEFQICAELPASGGVGVYVLPKQAPELMFFTSKRDPWI ANPPICGMGNDARIPSFNRTQRQRGERGTVAPRHWGMTQTNLVNSAEYSVNPYI >gi568815596r:61378823_61633897|GENSCAN_predicted_CDS_6|345_bp atgccctgggagttgctgcattccgaagtaggactgaaggcgcttgccctggtcccagct gttttcatggagttccaaatatgtgctgaacttcctgccagtggaggggtaggagtatat gtgctgccaaagcaagcaccagagctgatgttttttacgtcaaaaagagatccatggatt gcaaacccacccatctgtggcatggggaatgatgccagaataccctcctttaacagaacc caaaggcaaagaggtgagagaggaacggtggccccacgtcactggggaatgacacagacc aatttagtcaattcagcagaatattcagtcaatccatatatctga