GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:12:55 Sequence gi568815597f:235235265_235442769 : 207505 bp : 42.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 5187 5049 139 2 1 103 97 143 0.995 16.25 1.06 Intr - 11247 11156 92 0 2 78 92 57 0.998 2.97 1.05 Intr - 17545 17466 80 2 2 49 100 103 0.986 5.95 1.04 Intr - 20486 20396 91 2 1 34 82 125 0.992 5.15 1.03 Intr - 25488 25378 111 0 0 98 87 112 0.997 11.76 1.02 Intr - 45402 45256 147 0 0 60 55 208 0.928 14.21 1.01 Init - 45856 45809 48 1 0 54 63 45 0.592 -0.38 1.00 Prom - 46577 46538 40 -3.75 2.00 Prom + 53386 53425 40 -5.65 2.01 Init + 54840 54852 13 2 1 37 111 20 0.512 -0.32 2.02 Term + 60877 61073 197 2 2 -43 39 274 0.654 5.99 2.03 PlyA + 61121 61126 6 1.05 3.00 Prom + 79513 79552 40 -6.45 3.01 Sngl + 91429 91959 531 0 0 49 48 449 0.558 32.81 3.02 PlyA + 91977 91982 6 -4.04 4.04 PlyA - 92056 92051 6 -10.56 4.03 Term - 92902 92299 604 0 1 -9 44 431 0.642 22.00 4.02 Intr - 94210 93986 225 0 0 68 -10 170 0.422 1.38 4.01 Init - 97008 96971 38 0 2 86 76 37 0.440 1.83 4.00 Prom - 97179 97140 40 -6.85 5.00 Prom + 98801 98840 40 -5.35 5.01 Init + 106768 107479 712 0 1 52 60 441 0.025 32.80 5.02 Intr + 127156 127258 103 2 1 83 36 71 0.013 -0.29 5.03 Intr + 132115 132240 126 1 0 84 99 89 0.059 8.47 5.04 Intr + 144755 144885 131 2 2 83 115 181 0.201 19.72 5.05 Term + 144960 144973 14 1 2 77 43 17 0.613 -6.51 5.06 PlyA + 147002 147007 6 1.05 6.03 PlyA - 147799 147794 6 1.05 6.02 Term - 149378 148753 626 0 2 46 42 283 0.608 13.06 6.01 Init - 150174 149715 460 0 1 53 20 179 0.567 3.56 6.00 Prom - 150261 150222 40 -6.15 7.03 PlyA - 150430 150425 6 1.05 7.02 Term - 151309 150657 653 2 2 -65 43 321 0.288 5.51 7.01 Init - 151655 151355 301 2 1 88 -8 283 0.298 16.16 7.00 Prom - 152382 152343 40 -8.95 8.00 Prom + 161231 161270 40 -4.95 8.01 Init + 164567 164627 61 1 1 61 62 14 0.116 -2.44 8.02 Intr + 166239 166323 85 1 1 97 93 43 0.251 3.66 8.03 Intr + 179169 179354 186 0 0 93 115 213 0.978 22.38 8.04 Term + 182205 182322 118 0 1 -22 39 168 0.411 -2.07 8.05 PlyA + 183626 183631 6 1.05 9.03 PlyA - 184067 184062 6 1.05 9.02 Term - 188138 187921 218 0 2 28 47 168 0.517 3.02 9.01 Init - 190485 190296 190 0 1 27 60 215 0.968 11.82 9.00 Prom - 193185 193146 40 -6.85 10.00 Prom + 195781 195820 40 -6.05 10.01 Init + 197721 197907 187 2 1 85 92 253 0.988 24.67 10.02 Intr + 198795 198947 153 1 0 68 66 92 0.473 4.12 10.03 Intr + 201280 201344 65 2 2 101 71 52 0.382 2.32 10.04 Intr + 202058 202210 153 1 0 75 83 173 0.652 14.85 10.05 Intr + 203505 203687 183 2 0 -3 41 207 0.129 5.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 106768 107508 741 0 0 52 35 477 0.869 34.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_1|236_aa MLYRVVLVRSHLDGKLAPSKEGPLAVLRLGTGVLSAILATCPPEAAILGSGRSRLRGEER RGRNPALDEPPYLTVGTDVSAKYRGAFCEAKIKTAKRLVKVKVGAIVEVKNLDGAYQEAV INKLTDASWYTVVFDDGDEKTLRRSSLCLKGERHFAESETLDQLPLTNPEHFGTPVIGKK TNRGRRSNHIPEEESSSSSSDEDEDDRKQIDELLGKVVCVDYISLDKKKALWFPAL >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_1|708_bp atgctgtatagagttgtcctggtgagaagccaccttgatggcaagcttgcaccgagcaag gaggggcccctggctgttcttcgcctgggcactggtgtgctttcggcgatcctggccacc tgcccacctgaagctgccatcttgggttctggcaggagccgtttgcgtggcgaggagcgg agaggcaggaacccagcccttgatgagcctccctatttgacagtgggcactgatgtgagt gctaaatacagaggagccttttgtgaagccaagatcaagacagcaaaaagacttgtcaaa gtcaaggtaggagctattgtggaagtgaagaatcttgatggtgcatatcaggaagctgtt atcaataaactaacagatgcgagttggtacactgtagtttttgatgacggagatgagaag acactgagacgatcttcactgtgcctgaaaggagagaggcattttgctgaaagtgaaaca ttagaccagctcccactcaccaaccctgagcattttggcactccagtcataggaaagaaa acaaatagaggaagaagatctaatcatataccagaggaagagtcttcatcatcctccagt gatgaagatgaggatgataggaaacagattgatgagctactaggcaaagttgtatgtgta gattacattagtttggataaaaagaaagcactgtggtttcctgcattg >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_2|69_aa MYTAEQKVNNILVFTVDVKANKYQIKQAKKKLCGTDMANINTLIRPDGKKAHVLPVPDYD DVANKIGII >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_2|210_bp atgtacacggcagaacagaaagtcaacaacattcttgtgttcactgtggatgtcaaggcc aacaaataccagatcaaacaggctaagaaaaagctctgtggcactgacatggccaatatc aacaccctgatcaggcctgatggaaagaaggcccatgttctaccggttcctgactatgat gacgttgccaacaaaattgggatcatctaa >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_3|176_aa MGESSSVETSYNCLQLHSKARQLTPVSLGKGSVTKSPNTPSSAKLGSPTRRPRRNLLLPE FDNPRDSFSAGNLTFMMTLGPRYPLKHQVQLHLEGKQKTPSQHHRSPSALAEAEGKKRAT SEPRRKSRNQAPAPEPSHADSGLPPAPPSPQTQPPTQLGSFPSPLSRPDGVSRLRQ >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_3|531_bp atgggagaatcatcttcagttgagacatcctataactgcctccaactccacagcaaagcc cgccagctgaccccggtctctctggggaagggctcggtcaccaagtcaccaaacacacct tcctctgcaaaactcggaagccccacacgacgacctcgtcgaaacctcctacttcctgaa ttcgataaccccagagattcgttttccgcaggcaatcttaccttcatgatgactctggga ccaaggtatcctctaaaacaccaggttcagctgcacctggaggggaaacaaaagacaccc agtcaacaccacaggagcccctctgcactggcggaggcggagggaaagaagcgagcgacg tccgaaccccgaagaaagagccgcaaccaggcgccagcccccgaaccatcacacgccgac tcggggctccctccggccccgccatccccgcaaacccaaccacctacacagctcggcagc ttcccgtccccattgtcgagacccgacggagtttcccgtctacgacaatga >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_4|288_aa MECGMIDNGDSKGSQIPSFSYHRGTLLWGTATAVISHRFVQHLSVSPGAGLRRQRPPQWS ALNTHEEMKAPSNALDDETRAHNLASKNRVRSEELGRRSGGRLLSFILPPPRPPPGPLPG GSCRGSIAAVLWRAARLGARTSSPGGIFRRPPPPNQGARAAAKQRYQSPPREEEEPEPLP QQPLDPPPFFPISPPGLLVLGGRRREGTLDVPGSDLASEEGRDIFWGFRGLLRGFFWESL GPAKEQRKIAVVVPAAPEFGASASREPPPPAAAVSEPPPGWAAAHSAV >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_4|867_bp atggagtgtggaatgatagacaatggagattccaaaggaagccaaattccctcattctcc tatcaccgtggaactctcctctggggcactgccactgcggtaatttcacatcgttttgtt cagcatctgtccgtctcaccaggcgcaggtctacgaaggcagaggccaccccagtggtcg gcgctcaacacccatgaagaaatgaaagcgccttctaacgcgttagacgatgaaaccagg gcacacaaccttgcgtctaagaatcgagtccggagtgaggagctcggtcgccgaagcgga gggagactcttgagcttcatcttgccgccgccacggccaccgcctggacctttgcccgga gggagctgcagagggtccatcgccgccgtcctctggagggcagcgcgattgggggcccgg acctccagtccgggggggatttttcgtcgtccccctccccccaaccagggagcccgagcg gccgccaaacaaaggtaccagtcgccgccgcgggaggaggaggagccggagcctctgcct cagcagccgctggacccgccgcccttcttccccatctctcccccgggcctgctggttttg ggggggagaaggagagaggggactctggacgtgccagggtcagatctcgcctccgaggaa ggtagggatattttctggggctttcgtggtctcctaagggggttcttttgggagtcgctg ggcccggccaaggagcagaggaagatcgcggtggtggtccctgcggcgcccgaattcggg gcctcggcctcccgggaacccccaccccccgcagccgctgtgtccgagccgccccctggc tgggcggccgcacactcagcggtttag >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_5|361_aa MLHNASLLIDDIEDNSKLRRGFPVAHSIYGIPSVINSANYVYFLGLEKVLTLDHPDAVKL FTRQLLELHQGQGLDIYWRDNYTCPTEEEYKAMVLQKTGGLFGLAVGLMQLFSDYKEDLK PLLNTLGLFFQIRDDYANLHSKEYSENKSFCEDLTEGKFSFPTIHAIWSRPESTQVQNIL RQRTENIDIKKYCVHYLEDVGSFEYTRNTLKELEAKAYKQIDARGGNPELVALVKHLRLP LSTRQIKQELAEEYETTKSPVPPAYSLQLLLSSRTSLRWRRDFLQHFRPEPQASLLGSWL EGLLLGTPGVSAGRSHILDSGYIIMSDTLTADVIGRRVEVNGEHATVRFAGVVPPVAAVS V >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_5|1086_bp atgttgcataatgccagtttactcatcgatgatattgaagacaactcaaaactccgacgt ggctttccagtggcccacagcatctatggaatcccatctgtcatcaattctgccaattac gtgtatttccttggcttggagaaagtcttaacccttgatcacccagatgcagtgaagctt tttacccgccagcttttggaactccatcagggacaaggcctagatatttactggagggat aattacacttgtcccactgaagaagaatataaagctatggtgctgcagaaaacaggtgga ctgtttggattagcagtaggtctcatgcagttgttctctgattacaaagaagatttaaaa ccgctacttaatacacttgggctctttttccaaattagggatgattatgctaatctacac tccaaagaatatagtgaaaacaaaagtttttgtgaagatctgacagagggaaagttctca tttcctactattcatgctatttggtcaaggcctgaaagcacccaggtgcagaatatcttg cgccagagaacagaaaacatagatataaaaaaatactgtgtacattatcttgaggatgta ggttcttttgaatacactcgtaatacccttaaagagcttgaagctaaagcctataaacag attgatgcacgtggtgggaaccctgagctagtagccttagtaaaacacttaaggttaccc ctaagcaccagacagatcaaacaggagcttgctgaggagtatgagaccactaagagtcca gtgcccccagcctacagcctccaactcctcctaagcagccgtacctcactccggtggagg cgggacttcctacagcacttccggccagagcctcaagcttcgctgctgggcagttggctg gaggggctgctgctgggaacacctggagtctccgcgggcagatctcatattttggattct ggatatattataatgagtgacactttgacagcggatgtcattggtcgaagagttgaagtt aatggagaacatgcaacagtacgttttgctggtgttgtccctcccgtggcagctgtttct gtgtaa >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_6|361_aa MIMGDFNTPLSTLDRSTRQKVNKDSQELNSALHQADVIDIYRTLHPKSTEYTFFSAPHHT YSKIDHIVGSKALLSKCKRSDIITNCLSDHSAIKLEFRIKKLTQNRSTTWKLNNLLLNDY WVHNEMKTEIKMFFETNENKDTTYQNLWDNIQSKIQTTIREYYKHLYANKLENLEEMDKF LDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEQVPFL LKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQ QHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPS C >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_6|1086_bp atgataatgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaa gttaacaaggatagccaggaattgaactcagctctgcaccaagcggacgtaatagacatc tacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacc tattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagatca gacattataacaaactgtctctcagaccacagtgcaatcaaactagaattcaggattaag aaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactac tgggtacataatgaaatgaagacagaaataaagatgttctttgaaaccaacgagaacaaa gacacaacataccagaatctctgggacaacattcaaagcaaaatacaaactaccatcaga gaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaattc ctcgacacatacaccctcccaagactaaaccaggaagaagttgaatctctgaatagacca ataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccagga ccagatggattcacagccgaattctaccagaggtacaaggaggaacaggtaccattcctt ctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggcc agcatcatcctgataccaaagccaggcagagacacaaccaaaaaagagaattttagacca atatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccag cagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggc tggttcaatatatgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaa aaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaaccttca tgctaa >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_7|317_aa MGKKQSRKTGNSKKQSASPPPKEHSSSPATEQSWTENDFDKLREEGFRRSNYSELQEEIQ TKGKEVKNFEKNLDECITRITNTEKCLKELMELKAKAQELQRVSVMEDEMNEMKREGKFR EKRIKRNEQSLQEIWDYVKKPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANN QIQKIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAARDKGKPIRLTADLLAETLQAR REWEPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDKQILRDFVTTRPALKELLKEAL NMERNNWYQPLQNHAKL >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_7|954_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggacagagaatgactttgac aaattgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcaagaacta caaagggtatcagtgatggaagatgaaatgaatgaaatgaagcgagaagggaagtttaga gaaaaaagaataaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaaaa ccaaatctacgtctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaa aacactctgcaggatattatccaggagaacttccccaatctagcaaggcaagccaacaat cagattcagaaaatacagagaatgccacaaagatactcctcgagaagagcaactccaaga cacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccaga gacaaagggaagcccatcagactaacagcggatctcttggcagaaactctacaagccaga agagagtgggagccaatattcaacattcttaaagaaaagaattttcaacccagaatttca tatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaa atactgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcacta aacatggaaaggaacaactggtaccagccactgcaaaatcatgccaaattgtaa >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_8|149_aa MSSQIVSELNWRTPRWHPLLGPWLGVEWDNPERGKHDGSHEGTVYFKCRHPTGGSFIRPN KVNFGTDFLTAIKNRYVLEDGPEEDRKEQIVTIGNKPVETIGFDSIMKQQSKYLLNQCGT GNGIGGVQSDSRVWEVGQCPADTELTGHK >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_8|450_bp atgtccagtcagatagtgtcagaattgaattggcggacacccagatggcatcccctgctt ggaccctggttaggagtagaatgggacaatcccgagagaggaaagcatgatgggagccac gaagggactgtgtattttaaatgcaggcacccgacaggaggatcctttattcgtccgaac aaggtaaattttggaacagactttcttactgcaattaagaaccgctatgtgttagaagat ggaccagaggaagatagaaaagagcaaattgttacaattggaaataaacctgtggagact atcggttttgactctattatgaaacagcaaagcaagtatctcttaaatcagtgtggaact ggaaatggaattggtggcgtccagtctgattccagggtttgggaagttgggcagtgccca gcagacactgagttgacaggacataagtaa >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_9|135_aa MTGTASACVTRAEGLGGTRNKVTRNKGPGHITKGLAGHFKVCLKRTFQSLLETGDIGGFE TATELLFLRSVQSTQTTYILKSTTNNSCQYVVAPPTHTVQGAPVKAQAKHLGTDSPPQVM REEEAHPLPWKYPQI >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_9|408_bp atgaccgggaccgccagcgcctgtgtgactcgagcagaggggcttgggggtactagaaac aaggtcacaaggaataaggggccaggccacatcacaaagggccttgcaggccacttcaaa gtctgtttgaaacggacttttcaaagtttgcttgaaacgggagatattggaggctttgaa acagcgacagagttattgtttctaaggtctgtacagagcacacagaccacctacatcctg aaatcaaccacgaacaacagctgtcaatatgtggtcgcccctcccacacacaccgttcaa ggagcccccgtgaaagcacaggccaagcacctgggcacagactctcctccgcaggtcatg agggaggaggaagctcatcccctcccctggaagtaccctcagatttga >gi568815597f:235235265_235442769|GENSCAN_predicted_peptide_10|247_aa MQKDASKFVDLCVLQKCSTSNCIISAKDHTSMRMNVAKASEVTGRFNSQFKTCAICRTVC RMGLKQSYELSVRHLFAETVTELNSCGGSYEHPGRNSQQRPPEPEPSFSSRCCGCKTSMF PSLKYLVVNDNQISQWSFFNELEKLPSLRALSCLRNPLTKEDKEAETARLLIIASIGQLK TLNKCEILPEERRRAELDYRKAFGNEWKQAGGHKDPEKNRLSEEFLTAHPRYQFLCLSTC VYTGGLQ >gi568815597f:235235265_235442769|GENSCAN_predicted_CDS_10|741_bp atgcagaaagacgccagcaagttcgtggatctgtgcgtgctgcagaaatgctccaccagc aactgcatcatcagtgccaaggaccacacatccatgcggatgaacgtggccaaggccagt gaggtcacgggcaggtttaacagccagtttaaaacctgtgctatctgcaggactgtttgc aggatgggcctgaaacagtcttatgaactgtccgtgaggcacttgtttgctgaaacagta actgagctgaattcttgtggtggttcctatgaacacccgggaagaaacagccagcagagg ccgcctgagcctgaaccgagtttctcttccaggtgctgcgggtgcaaaacgtccatgttc ccatccttgaagtacctggtagtaaacgacaatcagatatcacaatggtcgtttttcaat gagctagagaagttaccaagtctacgggctttgtcctgcctaagaaaccccctgaccaaa gaggacaaagaagcagagacggcgcgactactcattatcgccagcattggccagctgaag acgctgaacaaatgtgagattctccccgaggagaggcggagagctgagcttgactaccga aaagcttttggaaatgagtggaaacaggctggtggacataaggatccggaaaaaaacaga ctcagcgaagaattcctcacagcccatcccagataccagttcctctgcctgagtacgtgc gtatacactggtggccttcag