GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:32:29 Sequence gi568815593f:76719183_76933798 : 214616 bp : 44.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 13181 14321 1141 2 1 69 49 866 0.005 72.23 1.02 PlyA + 14552 14557 6 1.05 2.00 Prom + 33580 33619 40 -4.06 2.01 Init + 100001 100082 82 1 1 61 100 166 0.939 14.23 2.02 Term + 113508 114619 1112 1 2 137 44 744 0.968 67.12 2.03 PlyA + 115451 115456 6 1.05 3.04 PlyA - 115732 115727 6 1.05 3.03 Term - 128352 128282 71 1 2 72 42 89 0.235 0.80 3.02 Intr - 141130 140945 186 2 0 102 39 89 0.406 5.06 3.01 Init - 142925 142913 13 2 1 83 87 -1 0.518 -0.35 3.00 Prom - 151203 151164 40 -5.16 4.00 Prom + 151637 151676 40 -5.16 4.01 Init + 156178 156318 141 0 0 107 109 130 0.856 17.14 4.02 Term + 158492 158650 159 1 0 61 43 192 0.998 9.94 4.03 PlyA + 159380 159385 6 1.05 5.11 PlyA - 160783 160778 6 1.05 5.10 Term - 164574 164458 117 0 0 30 48 95 0.336 -1.76 5.09 Intr - 165418 165144 275 2 2 94 80 77 0.560 4.76 5.08 Intr - 167330 167263 68 1 2 16 114 54 0.032 -0.65 5.07 Intr - 174713 174623 91 0 1 63 77 79 0.047 3.35 5.06 Intr - 189097 189018 80 0 2 45 97 58 0.001 1.59 5.05 Intr - 189990 189805 186 2 0 48 53 131 0.001 4.40 5.04 Intr - 192721 192658 64 2 1 68 72 15 0.000 -4.32 5.03 Intr - 195555 195430 126 1 0 43 127 45 0.001 4.55 5.02 Intr - 195813 195611 203 2 2 69 62 95 0.001 3.93 5.01 Intr - 205971 205883 89 0 2 109 54 30 0.004 0.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 10840 10849 10 0 1 77 93 4 0.817 0.57 S.002 Term + 13132 14321 1190 2 2 93 49 885 0.990 77.01 S.003 Term + 194757 194923 167 1 2 99 43 165 0.892 11.18 S.004 Sngl + 201326 201625 300 1 0 72 42 162 0.879 5.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:76719183_76933798|GENSCAN_predicted_peptide_1|380_aa XNPNDKYEPFWEDEEKNESGLTEYRLVSINKSSPLQKQLPAFISEDASGYLTSSWLTLFV PSVYTGVFVVSLPLNIMAIVVFILKMKVKKPAVVYMLHLATADVLFVSVLPFKISYYFSG SDWQFGSELCRFVTAAFYCNMYASILLMTVISIDRFLAVVYPMQSLSWRTLGRASFTCLA IWALAIAGVVPLLLKEQTIQVPGLNITTCHDVLNETLLEGYYAYYFSAFSAVFFFVPLII STVCYVSIIRCLSSSAVANRSKKSRALFLSAAVFCIFIICFGPTNVLLIAHYSFLSHTST TEAAYFAYLLCVCVSSISCCIDPLIYYYASSECQRYVYSILCCKESSDPSSYNSSGQLMA SKMDTCSSNLNNSIYKKLLT >gi568815593f:76719183_76933798|GENSCAN_predicted_CDS_1|1143_bp nngaaccccaatgataaatatgaaccattttgggaggatgaggagaaaaatgaaagtggg ttaactgaatacagattagtctccatcaataaaagcagtcctcttcaaaaacaacttcct gcattcatctcagaagatgcctccggatatttgaccagctcctggctgacactctttgtc ccatctgtgtacaccggagtgtttgtagtcagcctcccactaaacatcatggccatcgtt gtgttcatcctgaaaatgaaggtcaagaagccggcggtggtgtacatgctgcacctggcc acggcagatgtgctgtttgtgtctgtgctcccctttaagatcagctattacttttccggc agtgattggcagtttgggtctgaattgtgtcgcttcgtcactgcagcattttactgtaac atgtacgcctctatcttgctcatgacagtcataagcattgaccggtttctggctgtggtg tatcccatgcagtccctctcctggcgtactctgggaagggcttccttcacttgtctggcc atctgggctttggccatcgcaggggtagtgcctctgctcctcaaggagcaaaccatccag gtgcccgggctcaacatcactacctgtcatgatgtgctcaatgaaaccctgctcgaaggc tactatgcctactacttctcagccttctctgctgtcttcttttttgtgccgctgatcatt tccacggtctgttatgtgtctatcattcgatgtcttagctcttccgcagttgccaaccgc agcaagaagtcccgggctttgttcctgtcagctgctgttttctgcatcttcatcatttgc ttcggacccacaaacgtcctcctgattgcgcattactcattcctttctcacacttccacc acagaggctgcctactttgcctacctcctctgtgtctgtgtcagcagcataagctgctgc atcgaccccctaatttactattacgcttcctctgagtgccagaggtacgtctacagtatc ttatgctgcaaagaaagttccgatcccagcagttataacagcagtgggcagttgatggca agtaaaatggatacctgctctagtaacctgaataacagcatatacaaaaagctgttaact tag >gi568815593f:76719183_76933798|GENSCAN_predicted_peptide_2|397_aa MRSPSAAWLLGAAILLAASLSCSGTIQGTSRSSKGRSLIGKVDGTSHVTGKGVTVETVFS VDEFSASVLTGKLTTVFLPIVYTIVFVVGLPSNGMALWVFLFRTKKKHPAVIYMANLALA DLLSVIWFPLKIAYHIHGNNWIYGEALCNVLIGFFYGNMYCSILFMTCLSVQRYWVIVNP MGHSRKKANIAIGISLAIWLLILLVTIPLYVVKQTIFIPALNITTCHDVLPEQLLVGDMF NYFLSLAIGVFLFPAFLTASAYVLMIRMLRSSAMDENSEKKRKRAIKLIVTVLAMYLICF TPSNLLLVVHYFLIKSQGQSHVYALYIVALCLSTLNSCIDPFVYYFVSHDFRDHAKNALL CRSVRTVKQMQVSLTSKKHSRKSSSYSSSSTTVKTSY >gi568815593f:76719183_76933798|GENSCAN_predicted_CDS_2|1194_bp atgcggagccccagcgcggcgtggctgctgggggccgccatcctgctagcagcctctctc tcctgcagtggcaccatccaaggaaccagtagatcctctaaaggaagaagccttattggt aaggttgatggcacatcccacgtcactggaaaaggagttacagttgaaacagtcttttct gtggatgagttttctgcatctgtcctcactggaaaactgaccactgtcttccttccaatt gtctacacaattgtgtttgtggtgggtttgccaagtaacggcatggccctgtgggtcttt cttttccgaactaagaagaagcaccctgctgtgatttacatggccaatctggccttggct gacctcctctctgtcatctggttccccttgaagattgcctatcacatacatggcaacaac tggatttatggggaagctctttgtaatgtgcttattggctttttctatggcaacatgtac tgttccattctcttcatgacctgcctcagtgtgcagaggtattgggtcatcgtgaacccc atggggcactccaggaagaaggcaaacattgccattggcatctccctggcaatatggctg ctgattctgctggtcaccattcctttgtatgtcgtgaagcagaccatcttcattcctgcc ctgaacatcacgacctgtcatgatgttttgcctgagcagctcttggtgggagacatgttc aattacttcctctctctggccattggggtctttctgttcccagccttcctcacagcctct gcctatgtgctgatgatcagaatgctgcgatcttctgccatggatgaaaactcagagaag aaaaggaagagggccatcaaactcattgtcactgtcctggccatgtacctgatctgcttc actcctagtaaccttctgcttgtggtgcattattttctgattaagagccagggccagagc catgtctatgccctgtacattgtagccctctgcctctctacccttaacagctgcatcgac ccctttgtctattactttgtttcacatgatttcagggatcatgcaaagaacgctctcctt tgccgaagtgtccgcactgtaaagcagatgcaagtatccctcacctcaaagaaacactcc aggaaatccagctcttactcttcaagttcaaccactgttaagacctcctattga >gi568815593f:76719183_76933798|GENSCAN_predicted_peptide_3|89_aa MRKKDEVKVAGVDVASILATVAMAAESSRASVLGGHNGSGLLIWPILFCGFGHHSWKIHL KLSKEFFLDAISKIQTMGKIYKQPGFFNK >gi568815593f:76719183_76933798|GENSCAN_predicted_CDS_3|270_bp atgagaaaaaaagatgaggtgaaagttgctggagtcgatgtagcatctatactggctaca gtggcaatggctgcagagtcaagcagagcctcggtgcttggtggccacaatgggagcggt cttctcatctggccaattctgttctgtggttttggacatcattcctggaagattcacctg aagctctcaaaagagttctttctggatgcaatcagcaaaatccagactatggggaaaatc tacaaacaacctggtttcttcaacaagtaa >gi568815593f:76719183_76933798|GENSCAN_predicted_peptide_4|99_aa MPTQLEMAMDTMIRIFHRYSGKERKRFKLSKGELKLLLQRELTEFLSCQKETQLVDKIVQ DLDANKDNEVDFNEFVVMVAALTVACNDYFVEQLKKKGK >gi568815593f:76719183_76933798|GENSCAN_predicted_CDS_4|300_bp atgcccacccagctcgagatggccatggacaccatgattagaatcttccaccgctattct ggcaaggaaaggaagagattcaagctcagcaagggggaactgaaactgctcctgcagcga gagctcacggaattcctctcgtgccaaaaggaaacccagttggttgataagatagtgcag gacctggatgccaataaggacaacgaagtggattttaatgaattcgtggtcatggtggca gctctgacagttgcttgtaatgattactttgtagaacaattgaagaagaaaggaaaataa >gi568815593f:76719183_76933798|GENSCAN_predicted_peptide_5|432_aa PLIFGDPHIISSPRDRTLWQETEAFCQQHDGFLVLLTSRTKPQTLAVSVTVLKDGPEFVP SDVQMYPEFLPSGGFVVSLTSGVKLQTFTVRVTALKGGVKLQTFAVSVTALKGGTSGVAF SSWWVHGLSGFRSEAADLHGIHSQQKPVIPRNPRNCFNVLGTRNGPWSLLIGIAVLTDAV AENTSCPRTHNGPWTLLIGIVALTNTAAETLVFLLDHKEDQRRTPKLNGFLSVDPWLSPE VQEKQKLVPGKSENTIQDLQKEIVETQGAPLVDEGEAEKRDPPTTSGPQTNQPKEHLTDF KSDLSQISQRLGSFSSDPTKYIQEFRYLTLSYNLTWSDLNVILTSTLSPDEWERVFSPAQ SHTDNHWLHEPDLQEGIRAVPREDPNGTIRQIPQWKSDCPTHLAATPGAPGTLAQGSLTD SFPDILSLVAKD >gi568815593f:76719183_76933798|GENSCAN_predicted_CDS_5|1299_bp ccactcatctttggggaccctcatatcataagcagtcctagggatcggaccttgtggcaa gaaactgaagcattctgccaacaacatgatggattcttggtcttgctgacttcaagaaca aagccacagacccttgcggtgagtgttacagttcttaaagatggtccggagtttgttcct tcagatgttcagatgtatccggagtttcttccttctggtgggttcgtggtctcgctgact tcaggagtgaagcttcagaccttcactgtgagggttacagctcttaaaggtggagtgaag ctgcagaccttcgcggtgagtgttacagctcttaaaggtggcacgtctggagttgctttt tcctcctggtgggtccatggtctcagtggcttcaggagtgaagctgcagaccttcacggg atccatagtcagcaaaagccggtgattccaaggaacccccgcaactgttttaatgtctta ggaacccgtaacggtccctggagcctgctgattggaatagctgtgctcaccgatgcagta gccgaaaacacctcttgtccaagaacccacaatggtccctggaccctgctgatcggaata gttgcgcttaccaacacagcagcagaaacactagttttcctcctagaccacaaggaggac caaagaaggaccccgaagctgaatggcttcctctccgttgacccttggctcagcccagaa gtacaggaaaagcagaaactggttccaggaaaatcagagaacacaatccaagatttgcaa aaggagatagtagaaacccagggtgcacccttagttgatgaaggagaagctgagaaaaga gatccacctacgacctcgggtcctcagaccaaccagcccaaggaacatctcaccgatttt aaatcggacctttcccaaatcagccagcgtttaggctctttctcatcagaccccactaaa tatatacaggaattccgatatctaactctgtcctacaatttaacctggagtgacttaaat gtcatcctgacttctaccctctccccggatgaatgggaaagagttttttctccagcccaa tctcacactgataaccactggcttcatgagccagacctccaggaaggcattagagcagtt ccccgagaagatcccaatggaactatcaggcagattccccagtggaaatcggactgtcca actcacctggcagccactcccggagcccctggaactctggcccaaggctctctgactgac tcattcccagatattctcagcttagtggctaaagactga