GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:51:44 Sequence gi568815578r:772963_1016214 : 243252 bp : 48.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 316 311 6 1.05 1.02 Term - 3981 3747 235 2 1 120 49 168 0.933 11.89 1.01 Init - 23816 23644 173 2 2 101 84 105 0.124 10.32 1.00 Prom - 40509 40470 40 -2.16 2.00 Prom + 41056 41095 40 -4.26 2.01 Init + 43395 43446 52 2 1 57 110 26 0.486 3.03 2.02 Intr + 51955 52128 174 2 0 54 70 82 0.217 2.91 2.03 Intr + 60777 60989 213 1 0 87 94 125 0.780 11.69 2.04 Term + 61365 61540 176 1 2 108 43 16 0.777 -2.98 2.05 PlyA + 63060 63065 6 1.05 3.05 PlyA - 64979 64974 6 1.05 3.04 Term - 65475 65285 191 1 2 107 41 37 0.361 -1.49 3.03 Intr - 67044 66652 393 1 0 -1 98 411 0.189 27.83 3.02 Intr - 69187 68919 269 0 2 53 97 107 0.235 5.28 3.01 Init - 69603 69545 59 0 2 93 75 47 0.911 3.20 3.00 Prom - 70545 70506 40 -7.06 4.00 Prom + 71310 71349 40 -15.08 4.01 Sngl + 71843 72730 888 1 0 99 48 978 0.956 88.99 4.02 PlyA + 73290 73295 6 1.05 5.04 PlyA - 74209 74204 6 1.05 5.03 Term - 92330 92146 185 0 2 58 45 128 0.420 3.21 5.02 Intr - 95876 95669 208 2 1 61 94 130 0.591 9.55 5.01 Init - 96515 96333 183 2 0 85 92 10 0.773 0.28 5.00 Prom - 97641 97602 40 -6.76 6.12 PlyA - 97879 97874 6 1.05 6.11 Term - 100158 99998 161 1 2 89 43 290 0.998 22.70 6.10 Intr - 101452 101322 131 0 2 78 92 152 0.998 14.94 6.09 Intr - 102527 102381 147 1 0 51 9 142 0.436 1.95 6.08 Intr - 105365 105199 167 2 2 122 94 150 0.987 17.76 6.07 Intr - 106886 106785 102 2 0 98 68 79 0.934 7.27 6.06 Intr - 108324 108209 116 1 2 121 72 120 0.970 13.87 6.05 Intr - 112363 112116 248 0 2 114 100 466 0.991 47.50 6.04 Intr - 115477 115356 122 1 2 48 94 164 0.975 12.29 6.03 Intr - 117406 117251 156 1 0 106 92 240 0.745 26.41 6.02 Intr - 140028 139936 93 0 0 111 80 43 0.831 5.96 6.01 Init - 143234 142944 291 2 0 75 102 157 0.524 11.19 6.00 Prom - 143407 143368 40 -2.46 7.16 PlyA - 147298 147293 6 1.05 7.15 Term - 147720 147590 131 1 2 78 38 29 0.194 -4.76 7.14 Intr - 147980 147878 103 2 1 103 116 -9 0.248 3.15 7.13 Intr - 167564 167264 301 1 1 80 119 93 0.032 8.34 7.12 Intr - 183697 183556 142 2 1 112 30 81 0.138 4.11 7.11 Intr - 184047 183920 128 2 2 50 72 26 0.229 -2.48 7.10 Intr - 186820 186570 251 1 2 69 99 51 0.383 0.54 7.09 Intr - 187504 187428 77 2 2 93 58 67 0.464 3.43 7.08 Intr - 190156 190073 84 2 0 33 91 67 0.478 1.29 7.07 Intr - 191158 190973 186 2 0 112 83 35 0.780 5.06 7.06 Intr - 194352 194212 141 1 0 85 96 122 0.883 13.02 7.05 Intr - 195176 194988 189 0 0 83 68 371 0.995 34.16 7.04 Intr - 207413 207348 66 0 0 101 82 34 0.014 2.98 7.03 Intr - 213485 213318 168 0 0 122 55 36 0.006 3.62 7.02 Intr - 229381 229124 258 2 0 65 82 288 0.992 23.33 7.01 Init - 235037 234974 64 2 1 55 103 49 0.333 4.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 212765 212816 52 2 1 103 42 56 0.821 -0.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_1|135_aa MDNSYFFRQLKIIQCLGEQLLVNKVLTFTINETSFSPNFPKCMGHSKGIPGICIDVVLFF ADLTPMAPPKPDVSTRTANPAPVAPPPKAIQPVGAQLQPPMISSPPQPISSKHLLPGHPQ LFPYTAFEKPLTYKL >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_1|408_bp atggacaatagctatttcttccggcaactgaagattattcaatgcttgggtgaacagctc cttgtgaacaaggtcttgacctttactattaatgagacctcgttcagtccaaattttccc aaatgtatgggccactccaaaggcatacctggaatctgtatagatgtggttctgtttttt gcagatttgacacccatggctccacctaaacctgatgtctccaccaggaccgccaaccct gctcctgtggccccccccccaaaagcgattcagcctgtaggagcacagcttcaaccccct atgatttcatctccaccccaaccaatcagcagcaagcacctgttacctggccacccccaa ctcttcccctatactgcctttgaaaaacccctcacctacaagctttga >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_2|204_aa MWEKKAERGPSEGASTPGSVLKALPASFQRLDVTSYIVGEIIVPIKDGPTVWLSQWGMSS DLECSHRKLGPILSEGTFPAVGAFDSEGHPLSSTAEGSSKGDPKGAHAGLGPPGRRWERP LPARVRAVSLSKACARGAGTRLARAPVRGWTRLIWRLSVYGDRGPLFGDRTGLTFLGWMQ GQQILQPPEPLLASWESSPVWGRE >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_2|615_bp atgtgggagaagaaggcagagagagggccttctgagggggcctccacgccaggctctgtg ctaaaagctttgccagcatcatttcagagacttgatgtgacatcttacatcgtaggggaa attattgttcccattaaagatgggcccacagtgtggctaagccagtggggcatgtcctca gacctggagtgctcccataggaagctgggcccgattctttctgaagggactttccctgcg gtgggggccttcgacagtgagggccacccgctcagctcgaccgcggagggcagctccaaa ggggaccccaaaggtgcccacgcggggctggggcctcctgggcgtcgttgggagcggcca ctaccggcccgggtccgagctgtcagcctctccaaagcctgcgcgagaggagccgggaca cgcctagcgcgggctccagtccggggttggactcggctcatttggcgcctttctgtctac ggggacagaggaccactctttggggatcggacgggactgacatttctgggctggatgcag gggcagcagattctccagcccccagagccgctgctggcctcttgggaatcatccccagtt tggggaagggaatag >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_3|303_aa MEWMGKEASLHQGRPPGDLRLGYLIRETRLKIDLPEPPGHDKDGEIKQVEGFPRHLAKRR SHEPGSWSSRRFRLTKSTDHPHAGVSKQAPRGSVPAHTADKAGPRQRLIAPAPDPTAAKM LMPKKNRIAIHELLFKEGVMVAKKDVHMPKHPELADKNVPNLHVMKAMQSLKSRGCVKEQ FAWRHFYWYLTNEGSQYLRDYLHLPPEIVPATLHLPPEIVPATLHRSRPETGRPRPKGLE GTKHEEIPLNNRLFWTQPTSKAKATHQEVYIKCIKLFLISVRKREERKRGRKKGEKVKHI CQE >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_3|912_bp atggagtggatgggaaaggaagcctcgctccaccagggccgacccccaggagacctcaga ctcggttaccttatccgtgaaacgaggttaaaaatagaccttcctgagcctccaggtcac gataaggatggagagatcaaacaggtcgagggctttcctcggcatctggcaaaaaggcgg agccacgaaccaggctcctggtcttcaagaagatttcgactaaccaagtccacagaccac ccccacgcaggggtctctaagcaagccccacgagggtctgtcccagcccacaccgcggac aaagcaggcccaagacagaggctgatcgccccagccccggaccctacagctgccaagatg ctgatgcctaagaagaaccggattgccattcatgaactcctttttaaggagggagtcatg gtggccaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgtgccc aaccttcacgtcatgaaggccatgcagtctctcaagtcccggggctgcgtgaaggaacag tttgcctggagacatttctactggtaccttaccaatgagggtagccagtatctccgtgat taccttcatctgcccccagagattgttcctgccaccctacatctgcccccggagattgtt cctgccaccctacaccgcagccgtccagagactggcaggcctcggcctaaaggtctggag gggaccaaacacgaggaaataccgttgaacaaccggctgttctggacccagcccacctca aaggccaaggccactcatcaagaggtatatattaaatgtataaagctttttcttatatca gtaagaaagagagaagaaagaaagagaggaagaaagaaaggagagaaagttaaacatatc tgtcaagaataa >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_4|295_aa MPVHTLSPGAPSAPALPCRLRTRVPGYLLRGPADGGARKPSAVERLEADKAKYVKSLHVA NTRQEPVQPLLSKQPLFSPETRRTVLTPSRRALPGPCRRPQLDLDILSSLIDLCDSPVSP AEASRTPGRAEGAGRPPPATPPRPPPSTSAVRRVDVRPLPASPARPCPSPGPAAASSPAR PPGLQRSKSDLSERFSRAAADLERFFNFCGLDPEEARGLGVAHLARASSDIVSLAGPSAG PGSSEGGCSRRSSVTVEERARERVPYGVSVVERNARVIKWLYGLRQARESPAAEG >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_4|888_bp atgcctgtgcacacgctgagccccggagccccgtccgcccccgccctaccttgccgcctg cggaccagggtccctggctacctgctacgggggccggcagatggtggagcccggaaaccg agcgctgtggagcgcctggaggccgacaaggccaagtacgtcaagagcctgcacgtggcc aacacccgccaggagcctgtgcagcccctgctgtccaaacagccgctctttagccctgag actcgccgcacagtgctcacgcccagccgccgagccctgcctggcccctgccgacggccc cagctggacctggacatcctcagcagcctcatcgacttgtgtgacagccccgtgtcccct gccgaggccagccgcactcctggacgggccgagggagccggccgtcctcccccagccacc cctccgcgaccgccgcccagtacctctgcggtccgccgggtggacgtccgccccctgccc gcctcgcctgcccggccctgcccatcacccggccctgccgccgcctccagcccagcccgg ccgccgggtttgcaacgctccaagtcggacttgagcgagcgcttttctagggcagccgct gatctcgagcgcttttttaacttctgcggcctggacccggaggaggcgagagggttgggt gtggcccacctggcacgggccagctcggatatcgtgtccctggcagggcccagtgctggg ccgggcagctctgaagggggctgctcccgccgcagctcggtgactgttgaggagcgggcc cgggagcgcgttccctatggcgtgtcggtggtggagcgcaatgcccgcgtgatcaagtgg ttgtatgggctaaggcaggctcgggagagcccagcagctgaaggctag >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_5|191_aa MAADKTNLRILGSTRHCRGHGWCGLEPRLGLLGATAHRCQPLGYLHLRVPMENTCHLSLH QSTLGIWASTTKLVYQGQEELPGKASKTTQSLPLALDGSIISEISALFPAKAVSTLGREF PEYTPETTTTGAQMRMCVLRFNTGTYSATTDVSSGPSQQPGITHVPTEDVFGIAFCCFSA AFLLLRPHELA >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_5|576_bp atggctgcagacaagaccaacctgaggatcctgggcagcacaaggcactgtagagggcac ggctggtgtggtctcgagcctagactgggtctcctgggggccacagctcataggtgccag cctctaggatacctgcatctgcgggttcctatggaaaacacttgtcacctctccctgcac cagagcactctgggcatctgggcatctaccaccaaactcgtgtaccaagggcaggaggaa cttcctggaaaggcctccaagaccactcagtctctcccactggcacttgatggatccatc atttctgagatctctgcgttgttcccagcaaaggcagtttccacactggggagagagttc cctgagtacaccccagagacaaccaccacaggtgcccagatgcgcatgtgtgtgctgagg ttcaacactggaacttattcagctacaacagatgtgtcctctgggccctcgcagcaacct ggtataacacacgtgcctacagaggatgtgtttggcatcgcattctgctgcttttctgct gcttttctgcttcttcgaccacatgaattggcttag >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_6|577_aa MLQGSLLLVVATMSVAQQTRQEADRGCETLVVQHGHCSYTFLLPKSEPCPPGPEVSRDSN TLQRESLANPLHLGKLPTQQVKQLEQALQNNTQWLKKCTGEETGQRGKVTCQSHVAVKAN LDSHYKLQLERAIKTILRSKLEQVQQQMAQNQTAPMLELGTSLLNQTTAQIRKLTDMEAQ LLNQTSRMDAQMPETFLSTNKLENQLLLQRQKLQQLQGQNSALEKRLQALETKQQEELAS ILSKKAKLLNTLSRQSAALTNIERGLRGVRHNSSLLQDQQHSLRQLLVLLRHLVQERANA SAPAFIMAGEQVFQDCAEIQRSGASASGVYTIQVSNATKPRKVFCDLQSSGGRWTLIQRR ENGTVNFQRNWKDYKQGFGDPAGEHWLGNEVVHQLTRRAAYSLRVELQDWEGHEAYAQYE HFHLGSENQLYRSHGLHHQAGLDMPTKATEPGKEIEILTWLAADVEAAELGCPGAQIHGG WLSVVGYSGSAGRQSSLVLQNTSFSTLDSDNDHCLCKCAQVMSGGWWFDACGLSNLNGVY YHAPDNKYKMDGIRWHYFKGPSYSLRASRMMIRPLDI >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_6|1734_bp atgctgcagggcagcctcctccttgtggttgccaccatgtctgtggctcaacagacaagg caggaggcggataggggctgcgagacacttgtagtccagcacggccactgtagctacacc ttcttgctgcccaagtctgagccctgccctccggggcctgaggtctccagggactccaac accctccagagagaatcactggccaacccactgcacctggggaagttgcccacccagcag gtgaaacagctggagcaggcactgcagaacaacacgcagtggctgaagaagtgtacaggt gaggaaacaggccagagagggaaagtgacttgccagagccatgtagcagtgaaagccaat ctggattctcactacaaactgcagctagagagggccatcaagacgatcttgaggtcgaag ctggagcaggtccagcagcaaatggcccagaatcagacggcccccatgctagagctgggc accagcctcctgaaccagaccactgcccagatccgcaagctgaccgacatggaggctcag ctcctgaaccagacatcaagaatggatgcccagatgccagagacctttctgtccaccaac aagctggagaaccagctgctgctacagaggcagaagctccagcagcttcagggccaaaac agcgcgctcgagaagcggttgcaggccctggagaccaagcagcaggaggagctggccagc atcctcagcaagaaggcgaagctgctgaacacgctgagccgccagagcgccgccctcacc aacatcgagcgcggcctgcgcggtgtcaggcacaactccagcctcctgcaggaccagcag cacagcctgcgccagctgctggtgttgttgcggcacctggtgcaagaaagggctaacgcc tcggccccggccttcataatggcaggtgagcaggtgttccaggactgtgcagagatccag cgctctggggccagtgccagtggtgtctacaccatccaggtgtccaatgcaacgaagccc aggaaggtgttctgtgacctgcagagcagtggaggcaggtggaccctcatccagcgccgt gagaatggcaccgtgaattttcagcggaactggaaggattacaaacagggcttcggagac ccagctggggagcactggctgggcaatgaagtggtgcaccagctcaccagaagggcagcc tactctctgcgtgtggagctgcaagactgggaaggccacgaggcctatgcccagtacgaa catttccacctgggcagtgagaaccagctatacagatctcatggtcttcatcaccaggca ggcttggacatgcccaccaaggccacagagcctggcaaagaaatagaaatcctcacctgg ctggcagcagatgtcgaagctgcagagctggggtgccctggagcccagatccatgggggc tggctttctgtggtcgggtacagcggctcagcagggcgccagagcagcctggtcctgcag aacaccagctttagcacccttgactcagacaacgaccactgtctctgcaagtgtgcccaa gtgatgtctggagggtggtggtttgacgcctgtggcctgtcaaacctcaacggcgtctac taccacgctcccgacaacaagtacaagatggacggcatccgctggcactacttcaagggc cccagctactcactgcgtgcctctcgcatgatgatacggcctttggacatctaa >gi568815578r:772963_1016214|GENSCAN_predicted_peptide_7|762_aa MSMLITLTRHMAGSSSKFDKAGQCPGSANALTRPGGRTDPRLADAPGAPTAAPAPAVPPP GRGALGPSGRLGSASPDPAAQMRAPLCLLLLVAHAVDMLALNRRKKQGRYVMLSHSLLSS LWVDFSPVCWGAEEGMGIETAVTTLMLLWPPQFPVLENSGAITSEVKRTPKKASAQHWEL SKASVVGTGLGGNCTGCIICSEENGCSTCQQRLFLFIRREGIRQYGKCLHDCPPGYFGIR GQEVNRCKKCGATCESCFSQDFCIRCKRQFYLYKGKCLPTCPPGTLAHQNTRECQGECEL GPWGGWSPCTHNGKTCGSAWGLESRVREAGRAGHEEAATCQVLSESRKCPIQRPCPGDHY TYFQNVSGGYEGPLIYIEERPQGVLERSPGQKKGRKDRRPRKDRKLDRRLDSPEAISTAF SHQPPRGAGWTRLQGFQRPGLDTSPSWPWRVTTWPLGGQPALEGILCKLPATLIGTPPQA SLSWVPCPFFPSRISLSHILWRIEGPLWEGDSAVRWASPTPFLQAKLGEVHREAQKQALH VIPTCCHRMTAVNQEQVAARLVKERISETGSCCGSLLAGLEQGGGLDVMVIMGCSLAVKA GGLKDNLVPTPQCGCRWLFRPPHLPPSAFAGAHMECPPGLTVLQTFTGAPQAVHVGNVRK PKRHHAPNTKEVFSLLTKSPTWVPLEQEVLFFQENRFQHVTLGKWPPSLSISFLMSEMQG FLERHCTPPLVPSDPVRLTCSKAPGLDTGSSLTIHSSSFLWL >gi568815578r:772963_1016214|GENSCAN_predicted_CDS_7|2289_bp atgtctatgctcatcacacttactcggcatatggctggaagttctagcaagtttgataaa gcagggcagtgccccggctccgccaacgccctcactagacctggcggccggaccgacccg cgcctggcggatgcgcccggcgcgcccacagcagcccccgcgcccgccgtgccgccgccg ggacgtggggcccttgggccgtcgggccgcctggggagcgccagcccggatccggctgcc cagatgcgggcgccactctgcctgctcctgctcgtcgcccacgccgtggacatgctcgcc ctgaaccgaaggaagaagcaagggagatatgtcatgctgtctcattctcttctctcctcc ctctgggttgatttctcccctgtatgttggggagcagaagaaggaatggggatagagaca gctgtcactaccctgatgctgctgtggcctccacaattcccggtgttagaaaactctggg gccatcaccagtgaggtcaagcggacaccgaagaaggcctcagcacagcactgggaactt agcaaggcgtcagtagtgggcactggcctggggggcaactgcacaggctgtatcatctgc tcagaggagaacggctgttccacctgccagcagaggctcttcctgttcatccgccgggaa ggcatccgccagtacggcaagtgcctgcacgactgtccccctgggtacttcggcatccgc ggccaggaggtcaacaggtgcaaaaaatgtggggccacttgtgagagctgcttcagccag gacttctgcatccggtgcaagaggcagttttacttgtacaaggggaagtgtctgcccacc tgcccgccgggcactttggcccaccagaacacacgggagtgccagggggagtgtgaactg ggtccctggggcggctggagcccctgcacacacaatggaaagacctgcggctcggcttgg ggcctggagagccgggtacgagaggctggccgggctgggcatgaggaggcagccacctgc caggtgctttctgagtcaaggaaatgtcccatccagaggccctgcccaggagaccactac acgtatttccaaaatgtttctgggggctatgaaggaccactgatttacatagaagagagg ccacagggtgtgcttgagaggagccccggccagaagaagggcaggaaggaccggcgccca cgcaaggacaggaagctggaccgcaggctggactccccagaggccatttccacagccttc agccaccagccaccccgaggagctggctggacaaggctccagggcttccagaggcctggc ttggacacctcccccagctggccgtggagggtcacaacctggcctctgggtgggcagcca gccctggagggcatcctctgcaagctgcctgccaccctcatcggcactcccccacaggcc tccctctcatgggttccatgcccctttttcccaagccggatcagcctctcccacattctc tggaggattgaggggcctctctgggaaggagactctgctgtcaggtgggcatcgcccact cccttcttacaagccaagcttggagaagttcacagagaagctcagaaacaggccctgcat gtcattcccacatgctgccaccggatgacggcagtcaaccaggaacaggtcgcagccagg cttgtcaaggagcgcatttctgaaacggggtcgtgttgcggttctctcctagcaggactg gagcagggtggagggcttgatgtgatggtgatcatgggctgctctctggcagtcaaggcg ggaggactgaaggacaatctggttccaacaccacagtgtggctgccgttggctcttcagg cctcctcacctcccaccctctgcctttgctggtgcccacatggaatgtcctcccgggctg accgtcctgcagacttttacaggtgccccccaagcggtgcatgtgggtaacgttcgtaag cccaagaggcaccatgcaccaaataccaaagaggttttttccttactcaccaaaagccca acatgggtgcccctggagcaggaagttctgttcttccaggaaaatagattccagcatgta acactgggcaagtggcctccctctctgagtatcagtttcctcatgagtgaaatgcaaggt tttcttgagcgtcactgtaccccacccttagtcccttccgatcctgtgaggttgacctgc tccaaggctccaggcctggacacaggatccagcctgaccatccacagtagctccttcctc tggctataa