GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:32:47 Sequence gi568815596r:86503846_86722866 : 219021 bp : 40.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 438 433 6 1.05 1.06 Term - 2104 1959 146 2 2 113 50 239 0.999 19.69 1.05 Intr - 3748 3634 115 1 1 59 80 178 0.967 13.20 1.04 Intr - 6634 6513 122 2 2 73 57 172 0.678 11.89 1.03 Intr - 25552 25373 180 2 0 64 99 191 0.910 16.72 1.02 Intr - 41487 41137 351 1 0 8 78 222 0.250 7.57 1.01 Init - 53062 52996 67 1 1 72 68 97 0.572 7.39 1.00 Prom - 86798 86759 40 -1.75 2.06 PlyA - 86915 86910 6 1.05 2.05 Term - 100803 99998 806 1 2 38 41 332 0.356 15.89 2.04 Intr - 101573 101368 206 1 2 97 -8 155 0.330 4.72 2.03 Intr - 108429 108314 116 0 2 47 111 65 0.693 3.03 2.02 Intr - 116624 116485 140 0 2 112 80 127 0.652 13.56 2.01 Init - 119041 118615 427 1 1 70 -27 449 0.762 26.11 2.00 Prom - 132203 132164 40 -4.45 3.03 PlyA - 132540 132535 6 1.05 3.02 Term - 142841 142733 109 1 1 -4 52 127 0.005 -3.10 3.01 Init - 155144 152053 3092 2 2 44 53 1108 0.581 94.37 3.00 Prom - 155402 155363 40 -5.25 4.02 PlyA - 155484 155479 6 -4.04 4.01 Sngl - 156518 155502 1017 2 0 88 43 739 0.998 66.17 4.00 Prom - 159821 159782 40 -8.65 5.00 Prom + 162309 162348 40 -6.15 5.01 Sngl + 162780 163475 696 2 0 47 41 360 0.311 23.15 5.02 PlyA + 163601 163606 6 1.05 6.00 Prom + 165521 165560 40 -3.65 6.01 Init + 189356 189496 141 1 0 57 76 166 0.813 12.48 6.02 Intr + 200278 200381 104 0 2 65 36 70 0.001 -2.45 6.03 Intr + 209067 209229 163 0 1 61 116 63 0.004 5.46 6.04 Intr + 214554 214692 139 2 1 15 23 138 0.016 -0.88 6.05 Intr + 216710 216964 255 0 0 23 66 338 0.948 21.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 59001 59300 300 2 0 38 43 263 0.905 11.04 S.002 Term - 202174 202002 173 2 2 102 28 134 0.836 5.81 S.003 Init + 215290 215470 181 0 1 49 55 144 0.922 6.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_1|326_aa MVFRVRPADPGQTTDERIHSDTGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRP GGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRPGGEERLCP AALRPGGEERLCPAALRLGDIQREEEKVKRSVKDAAKKGQKDVCIVLAKEMIRSRKAVSK LYASKAHMNSVLMGMKNQLAVLRVAGSLQKSTEVMKAMQSLVKIPEIQATMRELSKEMMK AGIIEEMLEDTFESMDDQEEMEEEAEMEIDRILFEITAGALGKAPSKVTDALPEPEPPGA MAASEDEEEEEEALEAMQSRLATLRS >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_1|981_bp atggtgttccgggtccgacccgcagatcctggccaaacgacggatgaaagaatacactca gacacaggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctc tgcccggccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtccg ggaggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctctgc ccggccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtccggga ggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctctgcccg gccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtctgggagat atccaaagagaagaagaaaaagtgaaacgatctgtgaaagatgctgccaagaagggccag aaggatgtctgcatagttctggccaaggagatgatcaggtcaaggaaggctgtgagcaag ctgtatgcatccaaagcacacatgaactcagtgctcatggggatgaagaaccagctcgcg gtcttgcgagtggctggttccctgcagaagagcacagaagtgatgaaggccatgcaaagt cttgtgaagattccagagattcaggccaccatgagggagttgtccaaagaaatgatgaag gctgggatcatagaggagatgttagaggacacttttgaaagcatggacgatcaggaagaa atggaggaagaagcagaaatggaaattgacagaattctctttgaaattacagcaggggcc ttgggcaaagcacccagtaaagtgactgatgcccttccagagccagaacctccaggagcg atggctgcctcagaggatgaggaggaggaggaagaggctctggaggccatgcagtcccgg ctggccacactccgcagctag >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_2|564_aa MWLKLFFLLLYFLVLFVLARFFEAIVWYETGIFATQLVDPVALSFKKLKTILECRGLGYS GLPEKKDVRELVEKSGEYRASPRVPSTWERGRPVPPGTWLFPQSEAFPGSVKAAQVSSSK LPTIKYADPSFEEESEIIWRSVGDLMEGELYSALKEEEASESVSSTNFSGEMHFYELVED TKDGIWLVQVIANDRSPLVGKIHWEKMVKKVSRFGIRTGTFNCSSDPRYCRRRGWVRSTL IMSVPQTSTSKGKVMLKEYSGRKIEVEHIFKWITAHAASRIKTIYNAEHLKEEWNKTLFL STYLGHGLLIDYFEKKRRRNNNNDEVNANNLEWLSSLWDWYTSYLFHPIASFQNFPVESD WDEDPDLFLERLAFPDLWLHPLIPTDYIKNLPMWRFKCLGVQSEEEMSEGSQDTENDSES ENTDTLSSEKEVFEDKQSVLHNSPGTASHCDAEACSCANKYCQTSPCERKGRSYGSYNTN EDMEPDWLTWPADMLHCTECVVCLENFENGCLLMGLPCGHVFHQNCIVMWLAGGRHCCPV CRWPSYKKKQPYAQHQPLSNDVPS >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_2|1695_bp atgtggctgaagctttttttcttgctcctctatttcctggtcctgttcgtcctggccagg ttttttgaggccattgtgtggtatgaaactggcatctttgccacccagctggtggatccg gtggcgctgagcttcaagaagctgaagaccattttggagtgccgggggttgggctactca gggttgcccgagaagaaggatgtccgggagctggtggaaaagtcaggcgagtatcgggct tcccctagggtcccctccacctgggagaggggacgacctgtacctcctggtacctggctg ttcccccagagtgaggcctttcctggaagcgttaaagctgcccaggtttcttcgtccaag cttcctacaatcaagtatgcggatcccagctttgaggaggagtctgaaattatttggagg agtgttggtgacttgatggagggtgagctctattctgctctcaaggaagaagaagcatcc gaatcggtttctagtaccaatttcagtggtgaaatgcacttctatgagcttgtggaagac acaaaagatggcatctggctggttcaggtcatagcaaatgacagaagtcccttggtgggc aaaattcactgggagaaaatggttaaaaaggtgtcaagatttggaatacgtacaggcaca tttaactgttccagtgatcccagatattgcaggagaagaggctgggtccgatccacactc attatgtctgttccacaaacaagtacttcaaaagggaaagtcatgcttaaagaatacagt ggacgcaagattgaagtagagcacatttttaaatggataactgctcatgcagcttctcgg atcaaaaccatttataatgctgaacacttgaaagaagaatggaataaaaccctgtttctc agtacataccttggtcatggtttactaattgattactttgagaagaagagaaggcgcaac aacaacaatgatgaagtcaatgccaataacttagaatggttatcaagtctgtgggactgg tacaccagctacctcttccacccgattgcttcttttcagaactttcctgtagaatctgat tgggacgaagaccctgacttattcttggagcgcttagctttccctgacctttggcttcac cctctgataccaactgattatattaaaaacttaccaatgtggcgatttaaatgtcttgga gtccagtctgaagaggaaatgtcggaggggtctcaagatactgaaaatgactcggaaagt gagaacacagacactttgagtagtgagaaggaagtatttgaagataagcaaagcgtactt cacaattctccaggaacagcaagtcactgtgatgctgaggcttgttcatgtgccaataaa tattgtcagaccagcccatgtgaaaggaaggggaggtcatatggatcatataacactaat gaagatatggaacctgattggttaacttggcctgctgatatgctgcactgtactgaatgt gttgtttgcctagagaattttgaaaatggatgtttgctaatggggttgccttgtggtcat gtgtttcatcagaattgcattgtgatgtggttggctgggggccgacattgttgccctgtt tgccggtggccttcttataaaaaaaagcagccatatgcacaacaccagcccttgtcaaat gatgtcccatcttaa >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_3|1066_aa MVKGSIQQEELTILNMYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSAEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKRFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNSIDYARLIGFGFSEHQEISGESNSETLLPSMYSPST >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_3|3201_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatgtatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcagcagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagataccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaacgattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcgtgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctgtttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactcaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacagcatagattatgctcgcctcatcggattt ggcttttctgaacaccaggaaatttctggagaaagtaattcagaaacattactcccttcc atgtactcgccaagtacttga >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_4|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELCEECRSLRSRCDQLEERVSA MEDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGQV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_4|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta tgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcaggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_5|231_aa MFFETNENKDTMYQSLWDTFKAVCRGKFIAPNAHKRNQERSKIDTLTSQLKELEKQEQTN SKASRRQEITNIRAELKEIETQKTLQKINESRSWFFERSDKIDRPLARLIKKKREKTQVD AIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFMDTYPLPRLNQEEVESLNR PITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEKLVSFLLKLFQSIEKE >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_5|696_bp atgttctttgaaaccaatgagaacaaagacacaatgtaccagagtctctgggacacattt aaagcagtgtgtagagggaaatttatagcaccaaatgcccacaagagaaaccaggaaaga tctaaaatcgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacaaat tcaaaagctagcagaagacaagaaataactaatatcagagcagaactgaaggagatagag acacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaggagtgac aaaattgatcgaccattagcaagactaataaagaagaaaagagagaagactcaagtagat gcaataaaaaatgataaaggggatatcaccactgatcctacagaaatacaaactaccatc agagaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaa ttcatggacacatacccccttccaagactaaaccaggaagaagttgaatctctgaataga ccaataacaggatctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtcca ggaccagatggattcacagctgaattttaccagaggtacaaagagaagctggtgtcattc cttctgaaactattccaatcaatagaaaaagagtga >gi568815596r:86503846_86722866|GENSCAN_predicted_peptide_6|268_aa MALEITTSAIRQEKEVKGIQTEKEEVKLLLFEDNMIITENSKEFAKQFSFLRSSSFIQSR SNCHKRRLSSFVQSRSDCYILGALVGSHSEYQRKKIHASGRRKEKVILKYARAFNDKEKI LGRARGEKYLTYRRTKVKPIWAESRKEEPTAEKEGEDLDRGERQPAGPNGCLGKGEGTPG PLASRAARLGNEEQDAASVGPGPNGCGHLGAEEPSAAASGMDQCVTVERELEKVLHKFSG YGQLCERGLEELIDYTGGLKHEILQSHX >gi568815596r:86503846_86722866|GENSCAN_predicted_CDS_6|804_bp atggcactggagattacaaccagtgcaatcagacaagaaaaagaagtaaagggcatccag actgaaaaggaagaagtaaaattgttattatttgaagacaacatgatcattacggaaaat tcaaaggaatttgcaaaacagttttcatttctgcgttcgtcctctttcattcagtcccgt agtaactgtcacaagcggcgtttgtcctcctttgttcagtctcgtagtgactgttacata ttaggagctttggtaggttctcacagtgaatatcagagaaaaaaaatccatgcttctgga aggagaaaggaaaaagttattttgaaatatgccagagcattcaatgataaagaaaaaatt cttggaagagccagaggagaaaagtaccttacctatagaagaacaaaggtcaaacccatt tgggctgaaagtagaaaggaagaacctactgcagagaaagagggggaagatctggacagg ggtgagaggcagcccgcaggccccaacggctgtcttggaaagggagaagggactcctggt cctttggcctcccgcgccgcccgcttggggaacgaggagcaggacgcggcctcggtgggg cccgggccgaacggctgcggacacctgggcgccgaggagccgagcgccgccgcctccggc atggatcagtgcgtgacggtggagcgcgagctggagaaggtgctgcacaagttctcaggc tacgggcagctgtgcgagcgcggcctggaggagctcatcgactacaccggcggcctcaag cacgagatcctgcagagccacgnn