GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:48:09 Sequence gi568815592f:150269032_150498234 : 229203 bp : 43.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 601 596 6 1.05 1.02 Term - 5421 5278 144 0 0 126 49 115 0.896 9.31 1.01 Init - 9930 9721 210 0 0 99 66 116 0.485 9.29 1.00 Prom - 12959 12920 40 -4.36 2.00 Prom + 18670 18709 40 -5.06 2.01 Init + 35741 35936 196 1 1 77 87 57 0.443 3.60 2.02 Intr + 38976 39049 74 1 2 24 94 86 0.123 1.93 2.03 Term + 56235 56348 114 2 0 105 36 99 0.867 4.97 2.04 PlyA + 57441 57446 6 1.05 3.00 Prom + 60611 60650 40 -0.96 3.01 Init + 65886 65973 88 2 1 60 96 73 0.283 6.10 3.02 Intr + 100029 100178 150 1 0 53 98 122 0.042 9.83 3.03 Intr + 101679 101806 128 1 2 3 68 48 0.092 -5.30 3.04 Intr + 111897 112008 112 2 1 101 81 55 0.818 6.05 3.05 Intr + 115931 116035 105 0 0 92 57 78 0.754 5.29 3.06 Intr + 120321 120512 192 1 0 89 72 85 0.979 6.46 3.07 Intr + 123341 123473 133 0 1 -12 72 294 0.551 17.50 3.08 Intr + 125068 125224 157 1 1 109 91 132 0.999 15.61 3.09 Term + 129024 129206 183 2 0 111 48 248 0.980 20.64 3.10 PlyA + 134923 134928 6 1.05 4.03 PlyA - 135686 135681 6 1.05 4.02 Term - 140507 140456 52 1 1 103 49 59 0.622 0.40 4.01 Init - 142142 142006 137 2 2 65 63 97 0.748 4.51 4.00 Prom - 148412 148373 40 -3.66 5.00 Prom + 150792 150831 40 -5.36 5.01 Init + 152693 152761 69 1 0 87 84 74 0.542 7.95 5.02 Intr + 154914 155021 108 2 0 106 77 19 0.285 3.08 5.03 Intr + 160783 160906 124 0 1 44 89 83 0.114 4.16 5.04 Intr + 164561 164717 157 0 1 121 39 44 0.042 1.87 5.05 Intr + 183973 184085 113 1 2 106 38 61 0.117 2.92 5.06 Intr + 185501 185569 69 0 0 72 75 63 0.172 2.55 5.07 Term + 192390 192835 446 1 2 -12 48 263 0.072 7.70 5.08 PlyA + 193870 193875 6 1.05 6.00 Prom + 204232 204271 40 -3.06 6.01 Sngl + 205268 207001 1734 1 0 79 47 387 0.751 28.56 6.02 PlyA + 207209 207214 6 1.05 7.03 PlyA - 207331 207326 6 1.05 7.02 Term - 212129 212124 6 2 0 99 44 0 0.048 -5.43 7.01 Init - 216338 216261 78 2 0 68 115 53 0.361 7.06 7.00 Prom - 228767 228728 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 55845 55904 60 2 0 83 66 27 0.812 0.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_1|117_aa MGYSKDVDASYYYYSCRNIKPGEGVPLPPGGSQYGRGHWGPGSSPREGKEMRGAAVWNLW SRLLLLHRAHLCFLDALTVSNNKDKGRARDLKGRRSVEEEEKRWGGCFTFSTHSRLL >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_1|354_bp atggggtacagtaaggatgtggatgccagctattattattactcttgtagaaacataaag ccaggggaaggtgtgcccctgcccccggggggcagtcagtatgggagaggacactgggga cctggaagcagcccccgggagggtaaggagatgagaggagcagccgtctggaacctgtgg tcacggctgttgctgctgcacagggcacacctctgtttcctggatgctctgactgtcagc aataacaaggacaaaggtagggcaagggacctaaaaggacgaaggagtgtggaggaggag gagaagcgctggggtggctgtttcaccttcagcacacactccaggctgctctga >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_2|127_aa MKHAKGNGDAHYWEGAHLWRKPTSRLSSVTREAIERVLECPGMPLEVIYATIFDNFICLL ISKDEASKELMKKMTHESGFGYDIYDKRTQVPSDLEPFQVPGTKKANKVEKHFLLSESFY PGERDRQ >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_2|384_bp atgaaacatgcaaagggcaacggtgatgcccactactgggagggagcccacctttggaga aagcccaccagtagactctcctcagttaccagggaagccatagagagagttctggaatgc cctggaatgcccttagaagtcatctatgctacaatttttgacaattttatttgccttctg atctccaaggatgaagcatcaaaggagctaatgaagaaaatgacccatgagtcaggcttt ggatatgacatttacgacaaacggacacaggttccaagtgacctggagcccttccaggtg ccgggaacaaaaaaggccaacaaagtggaaaagcatttcctgctctcagagagtttctat cctggagagagagaccgacaataa >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_3|415_aa MSGFDGYTVVPLGDQDLNSTSICQTPFRAAILCILVVWIFKNADRSMEKKKGEPRTRAEA RPWVDEDLKDSSDLHQAEEGGNAVEGVDQSKVNGRATGFASKLDIGKRRREGELETGSYH KWTHAYALKKSGPWNPGTWQHRKIVRGLQFYTVFFPHQPVLAFLAPVIDPSVASSSSLRS STTDNELAELSEQEDADEWQESEENVEHIPFSHNHYPEKEMVKRSQEFYELLNKRRSVRF ISNEQVPMEVIDNVIRTAEPWTFVVVKDPDVKHKIRKIIEEEEEINYMKRMGHRWVTDLK KLRTNWIKEYLDTAPILILIFKQVHGFAANGKKKVHYYNEISVSIACGILLAALQNAGLV TVTTTPLNCGPRLRVLLGRPAHEKLLMLLPVGYPSKEATVPDLKRKPLDQIMVTV >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_3|1248_bp atgtcaggatttgatggctacactgtggttcctcttggtgaccaagacctcaattccacg agcatttgccaaacaccttttagggcagccattctctgcattttggttgtgtggatcttt aaaaatgccgacagaagcatggagaaaaagaagggggagcctagaaccagggccgaagct cgcccctgggtggatgaagacttaaaagacagcagtgacctgcaccaagcagaagaaggt gggaatgcagtggaaggcgtggatcagagtaaagttaatggaagagcaacagggtttgct agtaaattggatattgggaaaaggaggagagagggagagttggagactggaagctatcat aagtggactcatgcctatgctctgaaaaagagtggtccatggaatccagggacatggcag cacaggaaaatcgtcagaggtttacaattttacactgtcttcttccctcatcagcctgtt ttggcatttttagctccagtgattgacccttctgttgcatcttcatcttctctaagatct tcaactactgataatgagcttgcagaactttctgaacaggaggatgctgatgaatggcaa gaatcagaagaaaatgttgaacacatccccttctctcataaccactatcctgagaaggaa atggttaagaggtctcaggaattttatgaacttctcaataagagacggtcagtcaggttc ataagtaatgagcaagtcccaatggaagtcattgataatgtcatcagaacggcagagccc tggaccttcgtggttgtgaaggacccagacgtgaagcacaagattcgaaagatcattgag gaggaagaggagatcaactacatgaaaaggatgggacatcgctgggtcacagacctcaag aaactgagaaccaactggattaaagagtacttggatactgcccctattttgattctcatt ttcaaacaagtacatggtttcgccgcaaatggcaagaaaaaagtccactactacaatgag atcagtgtttccatcgcttgtggcatcctgctagctgccctgcagaatgcaggtctggtg actgtcactaccactcctctcaactgtggccctcgactgagggtgctcctgggccgcccc gcacatgaaaagctgctgatgctgctccccgtggggtaccccagcaaggaggccacggtg cctgacctcaagcgcaaacctctggaccagatcatggtgacagtgtag >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_4|62_aa MAMIYGFFYVWNATCHIKIKTAALSDEEQTLEYSLEEPFHTQAIFKCRSSMEVLKHLPKI TI >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_4|189_bp atggctatgatctatggcttcttctatgtctggaatgccacatgtcatattaaaattaag actgcagctttaagtgatgaagaacaaacactggaatattctttagaagagcctttccac actcaggctattttcaaatgtagatcttcgatggaggtgttgaaacacttgcccaagatc accatttag >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_5|361_aa MQRGPHDEEMKPSSNSRHRLASHGPRRIREDGSDTIPECPIPLSGCWEQQARPGSPALPL ILANLAMNVADISPPVGRKYRKALGRVAHLSVVEAAEVARGLSPQPSQGNFSHCHGFHVI RTLMVTTLIAPVLISPLILDNCQLDITSQMSDRPQCWGQQFMHKKSGFFGAIHAAKCTLT FQAKPHSAFEVILLDAWHWKWIPSIVPDTHNAIGIHLTRVVKDLFKENYKPVLNEIKEDT NKWKNIPCSWMGRINIVKMAMLPKLIYRFNAIPIKLPMTFFTELEKTTLKFIWNQNRACI AKTILSQKNKAGGITLSDFKLYYKATVTKTAWYWYQNTDTDQWNRTEASENNTTHLQPSD R >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_5|1086_bp atgcagagagggccacatgatgaggaaatgaagccttcttccaacagccggcaccgactt gccagccatggccccaggcgaatcagagaagatggctcagacaccatcccagagtgccca attccactcagtggctgctgggagcagcaggccaggcctggatcaccagccctacccctc atcctcgccaacttggccatgaatgtggctgatataagcccaccagtggggaggaagtac aggaaggccttaggccgtgttgcccatctcagtgtggtagaagctgcagaagttgcccgc ggactcagtcctcagccttcccaaggcaatttcagtcactgccatgggttccatgttatt cgtacattgatggttaccacacttatagctcctgtcctgatctctcccttgatcttggat aactgccagcttgacatcacctctcagatgtctgacaggccccagtgctggggtcagcag ttcatgcataagaaatcaggattctttggagcaattcatgctgccaaatgcacactgaca ttccaggccaaaccccactccgcatttgaagtcatcctgctcgatgcttggcactggaaa tggattccctcgattgtgccagatacccacaatgccataggaatccatcttacaagggtt gtaaaggacctcttcaaagagaactacaaaccagtgctcaatgaaataaaagaggacaca aacaaatggaagaacattccatgctcatggatgggaagaatcaatatcgtgaaaatggcc atgctgcccaagttaatttatagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaatagagcctgcatt gccaagacaatcctaagccaaaagaacaaagctgggggcatcacgctatctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacacagataca gaccaatggaacagaacagaggcctcagaaaataacaccacacatctacaaccatctgat cgttga >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_6|577_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFKIAS KRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTHKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNEKRACIAKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDVDQRNRTEPSEITLHIYNYLIFDKPEKNKKRGKDSLFNKWCWENWLAI CRKLKLDPFLTSYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMA TKAKIDKWDLIKLKSFCTAKETTIRVNKQPTKWEKIFATYSSDKGLISRICHELKQIYKK KTNNPIKKWMKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKITMRYHLTPVRMAIIK KSGNNRRWRGCGEIGTLLHCWWDCKLVQPLWKSVWQFLRDLELEIPFDPAIPLLGIYPKD YKSCCYKDTCTHMFIAALFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNAAIKND EFTSFVGTWMKLEIIILSKLSQEQKTKHRIFLLIGGN >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_6|1734_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcaaaattgcttca aagagaataaaatacctaggaatacaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggatacacacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactccccaaggtaatttac agattcaatgccatccccatcaagctaccaatgactttcttcacagaactggaaaaaact actttaaagttcatatggaacgaaaaaagagcctgcatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacggta accaaaacagcatggtactggtaccaaaacagagatgtagatcaacggaacagaacagag ccctcagaaataacgctgcatatctacaactatctgatctttgacaaacctgagaaaaac aagaaacggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacatcttatacaaaaatcaattcaagatgg attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca accaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaagcaacctacaaaatgggagaaaattttcgcaacctac tcatctgacaaagggctaatatcaagaatctgccatgaactcaaacaaatttacaagaaa aaaacaaacaaccccatcaaaaagtggatgaaggacatgaacagacacttctcaaaagaa gacatttatgcagccaaaaaacacatgaaaaaatgctcaccatccctggccatcagagaa atgcaaatcaaaatcacaatgagataccatctcacaccagttagaatggcaatcattaaa aagtcaggaaacaacaggcgctggagaggatgtggagaaataggaacacttttacactgt tggtgggactgcaaactagttcaaccattgtggaagtcagtgtggcaattcctcagggat ctagaactagaaataccatttgacccagccatcccattactgggtatatacccaaaggac tataaatcatgctgctataaagacacatgcacacatatgtttattgcagcattattcaca atagcaaagacttggaaccaacccaaatgtccaacaatgatagactggattaagaaaatg tggcacatatacaccatggaatactacgcagccataaaaaacgcagccataaaaaatgat gagttcacgtcctttgtagggacatggatgaaattggaaatcatcattctcagtaaacta tcgcaagaacaaaaaaccaaacaccgcatattcttactcataggtgggaattga >gi568815592f:150269032_150498234|GENSCAN_predicted_peptide_7|27_aa MGYYPAIKRNEIPSFAATQMELEDIMV >gi568815592f:150269032_150498234|GENSCAN_predicted_CDS_7|84_bp atgggatactatccagccataaaaaggaatgaaatcccgtcatttgcagcaacacagatg gaactggaagacattatggtgtga