GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:48:38 Sequence gi568815593f:142008975_142244671 : 235697 bp : 44.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 3067 2875 193 1 1 116 86 207 0.498 22.37 1.01 Init - 22995 22876 120 0 0 45 106 111 0.932 8.79 1.00 Prom - 23404 23365 40 -2.06 2.02 PlyA - 25804 25799 6 1.05 2.01 Sngl - 28151 27591 561 2 0 55 37 238 0.987 11.78 2.00 Prom - 32939 32900 40 -2.46 3.00 Prom + 33631 33670 40 -9.85 3.01 Sngl + 33672 35864 2193 2 0 26 48 723 0.827 56.05 3.02 PlyA + 35940 35945 6 1.05 4.00 Prom + 49916 49955 40 -5.26 4.01 Init + 50654 50708 55 1 1 72 77 28 0.304 1.55 4.02 Intr + 56467 56626 160 2 1 79 3 102 0.608 -0.15 4.03 Intr + 57368 57500 133 2 1 54 115 67 0.684 6.65 4.04 Intr + 59184 59304 121 2 1 111 69 27 0.738 3.17 4.05 Term + 66087 66181 95 1 2 85 49 62 0.356 -0.01 4.06 PlyA + 69246 69251 6 1.05 5.03 PlyA - 69337 69332 6 1.05 5.02 Term - 71655 71516 140 1 2 92 48 78 0.521 2.33 5.01 Init - 91969 91888 82 1 1 75 100 43 0.195 5.33 5.00 Prom - 92430 92391 40 -5.76 6.00 Prom + 93491 93530 40 -5.46 6.01 Init + 100001 100063 63 1 0 105 115 159 0.917 20.26 6.02 Intr + 100132 100263 132 0 0 83 75 42 0.735 3.24 6.03 Intr + 122840 122921 82 1 1 45 96 94 0.436 5.11 6.04 Intr + 123238 123368 131 2 2 29 70 93 0.623 2.01 6.05 Intr + 126756 126843 88 2 1 84 89 26 0.866 1.84 6.06 Intr + 128760 128884 125 1 2 105 110 -16 0.410 2.60 6.07 Intr + 131589 131655 67 2 1 67 97 16 0.095 -1.12 6.08 Intr + 154990 155020 31 2 1 112 75 7 0.088 -1.01 6.09 Term + 155267 155345 79 2 1 146 50 33 0.416 2.54 6.10 PlyA + 156325 156330 6 1.05 7.10 PlyA - 158145 158140 6 1.05 7.09 Term - 162049 161916 134 2 2 117 43 68 0.697 3.45 7.08 Intr - 162616 162434 183 2 0 20 47 134 0.427 2.26 7.07 Intr - 171252 171128 125 2 2 116 61 19 0.008 2.23 7.06 Intr - 180957 180812 146 0 2 22 44 112 0.002 -0.72 7.05 Intr - 182888 182843 46 1 1 121 100 -7 0.019 2.21 7.04 Intr - 184230 184133 98 0 2 66 100 1 0.005 -2.09 7.03 Intr - 194290 194104 187 0 1 78 87 50 0.038 3.59 7.02 Intr - 202508 202477 32 2 2 83 100 40 0.017 1.63 7.01 Intr - 227548 227393 156 1 0 57 62 79 0.019 2.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_1|105_aa MIATAIFSLCPSDEWNITSTHTGRCSLYRKHVLETVETYKDKMKLIILEHYSQASEWAAK YIRNRIIQFNPGPEKYFTLGLPTGAFLDGEGAVDKGGESRQQGPX >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_1|315_bp atgatagcaactgccatcttttccttgtgtccctctgacgagtggaacatcaccagtact cacactggaagatgctccctatacagaaagcatgtcctagaaactgtggagacctataaa gacaagatgaagctcatcatcctggagcactattctcaggcgagcgagtgggcggctaaa tacatcaggaaccgcatcatccagtttaacccagggccagagaagtacttcaccctgggg ctccccactggtgcgttcttggatggagagggtgccgtggataagggtggagagagtagg caacaggggccagnn >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_2|186_aa MSKFGRATRGLRKPEVGDVIRTIVRAGLAMPGPPLGPVLGQRRASINQFCKEFNERTKDI KEGIPLLTKIFLKPDGTFEIKIGQPTVSYFVKAAAGIEKGARPTASVQKSLQLSRRNKPS SWLLRRRQIWPPRRKLPRSDPFPQLLDFKGGSWEGASCKGCAQGEEGGHTNMMMVFVTLN DIFLYI >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_2|561_bp atgtcaaagttcggccgggccacccggggcctcaggaagcccgaggtcggcgatgtgatc cggaccatcgtgcgggcaggcctggccatgcccgggcccccactaggcccagtgctgggt cagagaagggcttccatcaaccaattttgcaaggagttcaatgagaggacaaaggacatc aaggaaggcattcctctgcttaccaagatttttctgaagcctgacgggacatttgaaatc aagattggacaacccactgtttcctacttcgtgaaggcagcagctgggattgaaaagggg gcccggccaacagcctcagttcagaagagtttgcagctttccagaaggaacaagccatct tcctggctgctcagaagaaggcagatttggccacccaggaggaagctgccaagaagtgac cctttcccccaactcctagatttcaaaggaggcagctgggaaggggccagttgcaaaggc tgtgcccaaggggaggaaggaggtcacaccaatatgatgatggttttcgtgactttgaat gatatatttttgtacatctag >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_3|730_aa MANCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDCWVHKEMKAEIKMFFETNENKDT TYQNLWDTFKAVCRGKFIALNAHKRKQKRSKIDTLTSQLKELEKQEQTHSKASRRQEITK IRAELKEIETQKNLQKINESRSWFFERINKVDRPLSRLIKKKREKNQIDAIKNDKGDITT DPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQGEVDSLNRPITGSEIVAII NSLPTKKSPGPDGFTAKFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGR DTTKKENFRPISLMNTDAKILNKILANRIQQHIKKLIHRDQVGFIPGMQGWFNICKSINV IQHIDRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANII LNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFAD DMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIA SKRIKYLGIQLTRDVKDLFKETYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPMVI YRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRDTDQWNRTEPSEIMPHIYNYLIFDKLEKNKQWGKDSLFSKWCWENWSN HISTQLSIIC >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_3|2193_bp atggcaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcagaactgctcaactacgtggaaactgaacaacctgctcctgaatgactgctgggta cataaagaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcagaaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaaaccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaagttgatagaccactatcaagactaataaag aagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac cagggagaagttgactctctgaatagaccaataacaggctctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccaaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccaggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacactgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccgtgat caagtgggcttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaacgta atccagcatatagacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatctcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcgcaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagacctacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccatggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatacagatcaatggaacagaaca gagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacttgagaaa aacaagcaatggggaaaggattccttatttagtaaatggtgctgggaaaactggtccaat catatttcaacacagctgtccataatttgttga >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_4|187_aa MHLTDEESEALPCKRLTQARSAGKCRHPYLDSGAVLCCYGNSGALWLVEEGSGCVEIGIG GSSCALLLQQSGLGDNGLPAGRQELKQGTHEETTALAQVRAEGLPVLVWMETEDLQVMSF RISLSRHGPKPSHPKSDSFSLAGSKSLLPFKDLKHTDTATIRFLNLFPTFPAFLFHKAAI VILARSQ >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_4|564_bp atgcatttaacagatgaggaatctgaggccctgccatgcaagagacttactcaagccaga agtgccgggaaatgcagacatccctatttggactctggtgcagtcctctgttgctatggc aactcaggggctctgtggttggtggaagaaggcagtggctgtgtggaaatcggcattgga ggcagcagctgtgcccttttgctccagcagtctggcctaggggacaatggacttcctgca ggaaggcaagagttgaagcaggggacccatgaggaaaccactgcgctggcccaggtgaga gctgaagggctgcctgtgctggtatggatggagacagaggatttgcaggtgatgagcttc cgaatcagcctgagtagacatgggccgaagccctcccaccccaagtcagacagtttcagt ttggcgggatccaaatccctcctgccctttaaggatctcaagcacacagacacggcaacc atccgatttctcaatcttttccccacctttcccgcctttctattccacaaagccgccatt gtcatcctggcccgttctcaatga >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_5|73_aa MNGPKEKTIAWDKGPLSSGEGGDKLWEAQFLIGHGLVQVHDPGVGDPWSRSLHFTCYGSN CHVTTSSILTEAL >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_5|222_bp atgaatgggcccaaagaaaagacaatagcctgggacaaaggtcctctgagctctggggaa ggtggtgacaagttatgggaagcccagttcctaataggccatggactggtacaggtccac gacccaggggttggggacccctggtctagatcacttcactttacgtgttacggctccaac tgccatgttacgacttccagcattctcacggaagcactctga >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_6|265_aa MALALAALAAVEPACGSRYQQASLAGPRPTRRRGLAGRAGPGGAGRAPGSDSRLPRARPA RLRPQNEEESGEPEQAAGDAPPPYSSISAESAAYFDYKDESGFPKPPSYNVATTLPSYDE AERTKAEATIPLVPGRDEDFVGRDDFDDADQLRIGNDGIFMLTFFMAFLFNWIGFFLSFC LTTSAAGRYGAISGFGLSLIKWILIVRFSTYFPGYFDGQYWLWWVFLVLGRFNHLPNPLR IPFIEGGNEGDSGLQAQDICPPTAA >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_6|798_bp atggcgttggcgttggcggcgctggcggcggtcgagccggcctgcggcagccggtaccag caggcctctctggccggcccgcggccaactcgacgccggggcttggcaggccgcgctggg cctggaggggcgggtcgggccccgggctcggattcgaggctgccacgggcccggcctgcc cgtctgcgaccccagaatgaagaagagtctggagaacctgaacaggctgcaggtgatgct cctccaccttacagcagcatttctgcagagagcgcagcatattttgactacaaggatgag tctgggtttccaaagcccccatcttacaatgtagctacaacactgcccagttatgatgaa gcggagaggaccaaggctgaagctactatccctttggttcctgggagagatgaggatttt gtgggtcgggatgattttgatgatgctgaccagctgaggataggaaatgatgggattttc atgttaacttttttcatggcattcctctttaactggattgggtttttcctgtctttttgc ctgaccacttcagctgcaggaaggtatggggccatttcaggatttggtctctctctaatt aaatggatcctgattgtcaggttttccacctatttccctggatattttgatggtcagtac tggctctggtgggtgttccttgttttaggtagatttaatcatctccccaaccctttgagg attccctttattgaaggtgggaatgagggggattctggactgcaggcacaggacatctgc cctcccacagcagcctga >gi568815593f:142008975_142244671|GENSCAN_predicted_peptide_7|368_aa WSFRARKRAVQKPDSTPSHSPCSPVAASPQKLAETGVLFSEEGTTEGQQAAQNTPQAEKA LERCVFGFVMKEEVSCSRRTLVPTSWRWRETPVDLVGSCWFSLFPTYIPLLFLPCGCQAP VSDEKILSHQLWHNLSGDKPSTLFLETVGHLELRRQIRNSSQGWFGIHPATLKSMAAVTF WNPDPCCEKLKPHGETSKREAKGAPGDSPSPAPASTNYQPHHCSAQRHVPWGIFWVKQTL PGESTEIMLNGLLYRMIPGSATPYLPNWKAFTHASECGSIAQWFGAGSVVKHGMEPQLYL AQLYGPEQVTQHSYTWGPPLENGDANIFSAPERNLIRPYSWVAFHVVLKVPRAWATSKDS PSEDSAVT >gi568815593f:142008975_142244671|GENSCAN_predicted_CDS_7|1107_bp tggagcttccgagcccgaaaacgggctgtacagaagccagactccacaccctcccactcg ccatgcagtccggttgctgcctctccccaaaagctggcagagacaggcgttttattctcc gaagaaggaacaacagagggtcaacaagcagcacagaacacaccgcaggccgagaaggcc ctagaaaggtgtgtgtttggctttgtgatgaaggaagaggtcagctgctcccgcaggaca cttgtaccaacaagttggaggtggagagagactcctgtggatcttgtgggctcctgctgg ttctcactcttccctacttacatccctcttctcttcctgccgtgtggatgccaagctcca gtgtcagatgagaagattctgagccatcagctctggcataacctcagtggtgacaagcct tcaactttatttctggaaaccgtaggccatttggagctcagacggcaaataaggaacagc tcccagggttggtttgggatccaccctgccaccctcaagagtatggcagcagtgacattc tggaacccggacccatgctgcgagaaactcaagccacatggagagaccagtaagagggaa gccaaaggcgctcctggggatagccccagcccagcaccagccagcaccaactaccagccc caccactgctctgcacagaggcacgtgccctggggcattttctgggtgaagcagacactc ccaggggagagcacagagataatgctaaatggcctgttgtacagaatgattcctggcagt gccactccttacctgcccaactggaaagcttttacacatgcttcagaatgcggcagtatt gcacaatggtttggggcaggttctgtggttaaacatgggatggaaccccagctctacctg gctcagctgtacggccctgagcaagtgactcaacattcctacacatggggacccccgctg gaaaatggagatgccaacatcttctcagcaccagagagaaacctaattcgcccttactcc tgggttgctttccatgttgtcctcaaggtgcctcgggcatgggccaccagtaaggattca cccagtgaggattcagctgtcacctag