GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:59:59 Sequence gi568815589r:68913040_69114092 : 201053 bp : 43.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1812 1820 9 2 0 75 107 8 0.663 1.48 1.02 Intr + 3382 3558 177 0 0 108 41 40 0.710 1.42 1.03 Intr + 4509 4720 212 2 2 117 113 213 0.995 24.41 1.04 Intr + 6642 6690 49 0 1 69 98 30 0.966 0.78 1.05 Intr + 10263 10347 85 2 1 115 116 2 0.942 5.09 1.06 Intr + 21851 22006 156 0 0 85 98 129 0.613 13.58 1.07 Intr + 27607 27751 145 2 1 90 90 41 0.123 3.84 1.08 Intr + 33004 33102 99 1 0 83 82 84 0.143 6.53 1.09 Intr + 78101 78218 118 2 1 91 83 89 0.171 9.17 1.10 Term + 87231 87263 33 2 0 100 38 33 0.108 -2.91 1.11 PlyA + 88450 88455 6 1.05 2.02 PlyA - 89582 89577 6 1.05 2.01 Sngl - 101053 99998 1056 1 0 103 50 2050 0.999 199.86 2.00 Prom - 104973 104934 40 -3.56 3.04 PlyA - 105447 105442 6 1.05 3.03 Term - 111754 111583 172 0 1 108 48 164 0.877 11.70 3.02 Intr - 112379 112280 100 0 1 38 92 26 0.426 -2.83 3.01 Init - 118592 118577 16 2 1 97 91 14 0.576 2.95 3.00 Prom - 118650 118611 40 -9.16 4.00 Prom + 121506 121545 40 -4.86 4.01 Init + 122744 122908 165 1 0 68 92 260 0.997 22.03 4.02 Intr + 123592 123627 36 0 0 90 50 52 0.010 0.06 4.03 Intr + 133346 133443 98 1 2 68 90 26 0.251 -0.39 4.04 Intr + 140101 140221 121 1 1 76 34 162 0.955 10.10 4.05 Intr + 151899 152004 106 2 1 65 77 58 0.938 2.19 4.06 Term + 154213 154628 416 2 2 89 47 269 0.937 18.32 4.07 PlyA + 155259 155264 6 -3.24 5.00 Prom + 155577 155616 40 -4.56 5.01 Init + 157399 157430 32 0 2 103 97 5 0.704 2.33 5.02 Term + 159573 159723 151 0 1 80 44 209 0.997 13.08 5.03 PlyA + 161554 161559 6 1.05 6.02 PlyA - 164381 164376 6 1.05 6.01 Term - 195161 194879 283 1 1 13 55 248 0.954 8.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 123592 123633 42 0 0 90 55 79 0.976 1.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_1|360_aa MTLSIAISFVWLHPTIFNSLASPLSAEWIIKHEVLCFVEIIFLKDLLQGTERHLRKTDFE GWVLESFKIMDYSLLLGIHFLDHSLKEKEEETPQNVPDAKRTGMQKVLYSTAMESIQGPG KSGDGIITENPDTLMKKLEHSWKALVYDGDTVSVHRPSFYADRFLKFMNSRVFKKIQALK ASPSKKRCNSIAALKATSQEIVSSISQEWKDEKRDLLTEGQSFSSLDEEALGSRHRPDLV PSTPSLFEAASLATTISSSSLYVNEHYPHDRPTLYSNRWTPNHYVIIEKRLLLDMFLLGL ENPISSVDTASKGLPSSSTFTLEEGTIYLTAEPNTLEVQDDNASVLDVYLVLAYSHGTPR >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_1|1083_bp atgacactgagcattgccatctcttttgtctggctgcatccaactatttttaattccctt gcctccccactgtctgctgagtggataattaaacatgaagtactatgctttgttgagatt atatttttaaaggacttgctacagggaactgaacggcaccttcgaaagacagactttgag ggctgggtgctagaaagcttcaagatcatggattatagccttctgttgggaattcatttc ctggaccattccctcaaagagaaagaggaggagaccccacaaaatgtgcctgatgctaag cggactgggatgcagaaggttctctactcaacagccatggaatctatccagggtccaggg aaatctggagatgggataatcacagagaacccagacacgttaatgaagaagttagaacat tcctggaaagctcttgtttatgatggggacactgtttctgttcatagaccaagcttttat gcagacagatttcttaagttcatgaattccagagttttcaagaaaattcaagctttgaag gcttcaccgtctaagaaacggtgcaattcaatcgccgccctaaaggccacttcacaggag attgtgtcctcaattagccaggaatggaaggatgagaagcgggatttgctgactgaagga caaagttttagcagccttgatgaagaagccctgggatcccgacacaggccagacctggtc cctagcactccatcactgtttgaagctgcttccttggcaaccacaatttcatcttcttcc ttatacgtcaatgagcactatccacacgacaggcctacactctattcaaacaggtggact ccaaatcactacgtaatcatcgagaaaagacttttattagatatgttcctgctgggttta gaaaatcctatatcttctgttgatacagcaagcaaagggttaccttccagttcaacattt accttggaagaggggaccatctacttgaccgctgagcccaacactctggaagtgcaggat gacaatgcttctgtgcttgacgtctatttagtcctcgcctacagccatggaactcccagg tga >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_2|351_aa MGNAPAKKDTEQEESVNEFLAKARGDFLYRWGNPAQNTASSDQFERLRTLGMGSFGRVML VRHQETGGHYAMKILNKQKVVKMKQVEHILNEKRILQAIDFPFLVKLQFSFKDNSYLYLV MEYVPGGEMFSRLQRVGRFSEPHACFYAAQVVLAVQYLHSLDLIHRDLKPENLLIDQQGY LQVTDFGFAKRVKGRTWTLCGTPEYLAPEIILSKGYNKAVDWWALGVLIYEMAVGFPPFY ADQPIQIYEKIVSGRVRFPSKLSSDLKHLLRSLLQVDLTKRFGNLRNGVGDIKNHKWFAT TSWIAIYEKKVEAPFIPKYTGPGDASNFDDYEEEELRISINEKCAKEFSEF >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_2|1056_bp atgggcaacgcccccgccaagaaggacaccgagcaggaggagagcgtgaacgagttccta gccaaagccagaggagatttcctctacagatggggaaaccccgctcaaaacaccgccagc tcggatcagttcgaacggctcaggacgctgggcatgggctccttcgggcgggtgatgctg gtgaggcaccaggagaccggcggccactacgccatgaagatcctcaacaagcagaaggtg gtgaagatgaagcaggtcgagcacatactgaacgagaagcgcatcctgcaggcgatcgac tttccgttcctcgtcaagctccagttctcctttaaggacaactcctacctgtacctggtg atggagtacgtgccgggtggggagatgttctcccgcctacagcgcgtcggaaggtttagc gagccccatgcctgtttctatgccgcccaggtcgtcctggccgtccagtacctacactcg ctcgacctcatccaccgcgacctgaagcccgagaatctcctcatcgaccagcagggctac ctgcaggtgacggacttcggtttcgccaagcgcgtgaagggccgcacttggaccttgtgc gggaccccagagtacctggcccccgagatcatcctgagcaaaggctacaacaaggccgtg gactggtgggccctaggggtgctcatctatgagatggccgtgggcttcccacccttctac gccgaccagcccatccagatctacgagaagatcgtctctgggagggtgcggtttccctcc aaactcagctctgacctcaagcatctgctgcggagcctgctgcaggtggacctcaccaag cgcttcggaaacctcaggaacggggttggcgacatcaagaaccacaagtggttcgccaca accagctggatcgccatctatgagaagaaggtggaagctcccttcatcccgaagtacaca ggccctggggatgccagtaactttgacgactacgaggaggaagagctccggatctccatc aatgagaagtgtgccaaggagttttctgagttttag >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_3|95_aa MEQKQGTVLGADYTAVNKTGKHPLLLMFYCRDTDDNKIRSLAMWELVAMLSESMERPLVR TSSTQPKASKDVRLASNHMSELEVGPLASDDCSSD >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_3|288_bp atggagcagaaacaaggcactgttctaggtgccgattatacagcagtgaacaagacaggc aaacatcccttgcttcttatgttctattgtagggacactgatgataacaaaataagatca cttgctatgtgggagctggttgccatgctgtcagaatctatggagaggcccctggtgagg acctccagtacacagcccaaagccagcaaggacgtgaggcttgccagcaaccacatgagt gagctggaagtgggtcctcttgcctcagatgactgcagctctgactga >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_4|313_aa MWTLGRRAVAGLLASPSPAQAQTLTRVPRPAELAPLCGRRGLRTDIDATCTPRRAGEDKG DAHFADLSSNQRGLNQIWNVKKQSVYLMNLRKSGTLGHPGSLDETTYERLAEETLDSLAE FFEDLADKPYTFEDYDVSFGSGVLTVKLGGDLGTYVINKQTPNKQIWLSSPSRYVDCGVP GEQDPGRSAPVFLEGAGRSAGAPCSCTCEGTGFRGKIPQFRAVVIVFIPVANGMTRHTVM SGETPVEEQEVVDMLWPGQWLPSSWSLHPPNLAKEVPILSVEKISRSCCVNIKASKQQQQ QQKHEIINNKDHP >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_4|942_bp atgtggactctcgggcgccgcgcagtagccggcctcctggcgtcacccagcccagcccag gcccagaccctcacccgggtcccgcggccggcagagttggccccactctgcggccgccgt ggcctgcgcaccgacatcgatgcgacctgcacgccccgccgcgcaggagaagataaaggt gacgcccattttgcggacctgagttcgaaccaacgtggcctcaaccagatttggaatgtc aaaaagcagagtgtctatttgatgaatttgaggaaatctggaactttgggccacccaggc tctctagatgagaccacctatgaaagactagcagaggaaacgctggactctttagcagag ttttttgaagaccttgcagacaagccatacacgtttgaggactatgatgtctcctttggg agtggtgtcttaactgtcaaactgggtggagatctaggaacctatgtgatcaacaagcag acgccaaacaagcaaatctggctatcttctccatccaggtatgtagactgcggggttccg ggggagcaggacccaggccgttctgcgcctgtcttcttggaaggagcaggccggagcgcg ggagcgccgtgtagctgtacctgcgaaggcacaggattccgcgggaagatcccgcagttt cgggccgtcgtcattgtttttatacctgtggcaaatggcatgaccagacacacggttatg tctggagaaacccctgtagaggagcaggaggttgtggacatgctgtggcccggacagtgg ctgccgagcagttggagcctgcacccgcccaacttggctaaagaagtccccatactctct gtggaaaagatttccagaagctgttgtgtcaatatcaaagcctcaaaacaacaacaacaa caacaaaaacatgaaattatcaacaataaagatcatccttga >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_5|60_aa MGHQGHQEGGSGPKRYDWTGKNWVYSHDGVSLHELLAAELTKALKTKLDLSSLAYSGKDA >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_5|183_bp atgggccaccagggccaccaggagggaggcagtggacctaagcgttatgactggactggg aaaaactgggtgtactcccacgacggcgtgtccctccatgagctgctggccgcagagctc actaaagccttaaaaaccaaactggacttgtcttccttggcctattccggaaaagatgct tga >gi568815589r:68913040_69114092|GENSCAN_predicted_peptide_6|94_aa XGQVYGIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAHETQTKPVLEYYQKKGVL ETFSGTETNKIWPYEYAFLQTKVPQTSQKSSVTS >gi568815589r:68913040_69114092|GENSCAN_predicted_CDS_6|285_bp nntggccaagtctacggcattgaattcaaccctcccaaaactgtgggcattgatgatcta actggggagcctctcattcagcgtgaggatgataaaccagagacggttatcaagagacta aaggctcatgaaacccaaacaaagccagtcctggaatattaccagaaaaaaggggtgttg gaaacattctctggaacagaaaccaacaagatttggccctatgaatatgctttcctacaa actaaagttccacaaacaagccagaaatcttcagttacttcatga