GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:20:53 Sequence gi568815594r:105582736_105800005 : 217270 bp : 36.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6513 6700 188 1 2 86 94 123 0.627 11.19 1.02 Intr + 30649 30772 124 0 1 103 76 138 0.664 13.34 1.03 Intr + 39115 39288 174 2 0 -68 81 195 0.030 2.19 1.04 Intr + 48163 48310 148 2 1 61 78 77 0.068 2.37 1.05 Intr + 62453 62652 200 2 2 55 63 125 0.093 4.77 1.06 Intr + 63788 63937 150 0 0 61 53 81 0.648 1.11 1.07 Intr + 65814 65947 134 1 2 99 91 51 0.981 5.94 1.08 Intr + 72868 72987 120 0 0 81 78 112 0.930 9.27 1.09 Intr + 76319 76630 312 1 0 90 119 266 0.614 25.36 1.10 Intr + 83442 83585 144 2 0 51 105 98 0.974 7.36 1.11 Intr + 84394 84592 199 0 1 73 100 246 0.999 22.30 1.12 Intr + 84709 84968 260 2 2 82 98 322 0.984 28.86 1.13 Term + 95017 95202 186 0 0 88 48 136 0.942 6.11 1.14 PlyA + 96006 96011 6 1.05 2.03 PlyA - 96126 96121 6 1.05 2.02 Term - 96477 96287 191 1 2 43 49 217 0.945 9.93 2.01 Init - 97284 97062 223 0 1 86 38 101 0.523 3.67 2.00 Prom - 99087 99048 40 -8.15 3.11 PlyA - 99301 99296 6 1.05 3.10 Term - 100582 99998 585 1 0 83 35 546 0.999 42.32 3.09 Intr - 104103 103957 147 0 0 64 75 148 0.992 10.61 3.08 Intr - 109400 109241 160 1 1 39 58 61 0.747 -2.73 3.07 Intr - 110751 110564 188 0 2 83 97 174 0.985 15.37 3.06 Intr - 112933 112781 153 1 0 93 69 116 0.918 9.55 3.05 Intr - 117279 117115 165 0 0 97 75 155 0.974 14.34 3.04 Intr - 120288 120207 82 2 1 29 107 49 0.521 -0.38 3.03 Intr - 125733 125605 129 2 0 95 -10 134 0.055 3.19 3.02 Intr - 126202 125903 300 0 0 63 113 291 0.748 24.02 3.01 Init - 128635 128628 8 1 2 74 53 0 0.521 -4.45 3.00 Prom - 128985 128946 40 -4.45 4.00 Prom + 130315 130354 40 -6.15 4.01 Init + 134879 135304 426 1 0 45 100 121 0.877 5.35 4.02 Intr + 136325 136792 468 1 0 103 87 288 0.996 22.57 4.03 Term + 143844 144110 267 2 0 104 38 96 0.625 0.71 4.04 PlyA + 146054 146059 6 1.05 5.02 PlyA - 146318 146313 6 1.05 5.01 Sngl - 148780 148253 528 1 0 33 38 210 0.451 6.31 5.00 Prom - 150661 150622 40 -6.15 6.02 PlyA - 150829 150824 6 1.05 6.01 Sngl - 151733 151056 678 2 0 53 43 368 0.957 24.73 6.00 Prom - 180601 180562 40 -3.45 7.03 PlyA - 182107 182102 6 1.05 7.02 Term - 190448 190006 443 0 2 65 42 108 0.086 -1.67 7.01 Intr - 193005 192757 249 1 0 23 71 205 0.209 8.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 39144 39288 145 2 1 43 81 168 0.859 11.93 S.002 Init + 62475 62652 178 2 1 75 63 114 0.863 7.07 S.003 Term - 125733 125601 133 2 1 95 36 137 0.922 5.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_1|779_aa XKMTPQGECSVAETLTPEEEHHMKRMMAKREKIIKELIQTEKDYLNDLELCVREVVQPLR NKKTDRLDVDSLFSNIESVHQISAKLLSLLEEATTDVEPAMQVIGQCGGLGDTQMQQLED QNANVRSNRKAVRFGENDNGEMVFSGSRIHQSSQERKNQQEAGEVFLQIKGPLEDIYKIY CYHHDEAHSILESYEKEEELKEHLSHCIQSLKGKPNLLDMGSLMIKPIQRVMKYPLLLCE LRNSTPPSHPDYRALDDAFAAVKDINVNINELKRRKDLGENVLGITSLENNLTIPSKVED AQTGDSENEFLSICPRETLAHVLKERCTVLKYKKNDEDESLKDKLSKLNIHSISKKSKRV TNHLKILTRGESQDAMPLALQSVMDLQEISYNKDDEMDYSETLSNALNSCHDFASHLQRL ILTPLSALLSLFPGPHKLIQKRYDKLLDCNSYLQRSTGEESDLAKKEYEALNAQLVEELQ AFNQAARKILLNCLCSFITLLRDLMLVAQQAYSTLVPMPLLVSSISEIQNQVLEEIQNLN CVKENSATFIERKLSFEKKKPVQILPEMPHQTDIHRSKLLSTYSAEELYQAKRKCNATQE YDINLLEGDLVAVIEQKDPLGSTSRWLVDTGNVKGYVYSSFLKPYNPAKMQKVDAENRFC DDDFENISLFVSSRPASDSVTGTSESSIGDSSSSLSGTCGKFETNGTDVDSFQEVDEQIF YAVHAFQARSDHELSLQEYQRVHILRFCDLSGNKEWWLAEAQGQKGYVPANYLGKMTYA >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_1|2340_bp naaaagatgactccacagggtgagtgttctgtagctgagaccttaaccccagaggaagag catcatatgaagaggatgatggcaaagcgggaaaagatcattaaggagctgatacagaca gaaaaggattatctcaatgatctagagctgtgtgttagggaagtggttcagcccctgaga aataaaaagactgataggctggatgtggatagcttgtttagcaacattgagtccgtgcat cagatatcagccaagctgctgtcattgttggaagaggccacaacagacgtggaaccggcc atgcaagtaattggtcagtgcggaggtttgggggacacccagatgcaacagttggaagat cagaatgcaaatgttagaagtaataggaaagctgtccgctttggagaaaatgacaatggg gaaatggtattttcaggatcaaggatacaccaatcctcacaagaaagaaaaaatcagcag gaagcaggagaagtattcttgcagattaaagggccactggaagatatttataaaatctac tgctatcaccatgatgaagcacatagtatactggagtcctatgaaaaggaagaagagctg aaggaacatttgagccactgtatccagtccttaaaaggcaaaccaaacttattggacatg ggctctttgatgatcaaaccaattcaacgtgtgatgaaataccccctattactgtgcgaa cttcggaattccacccctccctctcacccagattacagagcactggacgatgcctttgct gctgtgaaggacattaatgttaacatcaatgaacttaaaagaaggaaagatttaggtgag aatgtacttggaatcactagtttagagaacaacttgacaatacctagcaaagttgaagat gcacagaccggagactcagaaaatgaatttctaagtatttgcccaagagagactcttgcc catgtgctcaaagagaggtgtactgttctaaaatacaagaagaatgacgaggatgaatca cttaaagacaaattgtctaaactaaatattcattcaattagcaagaaatcaaaaagagtg acaaatcatctgaagattctgaccagaggagaatcacaggatgccatgcccctggctctg cagagtgtgatggaccttcaggagatttcatacaacaaagacgatgagatggactattct gagaccctaagtaatgccttaaattcgtgtcatgactttgcatctcacttacagagactc atcctgacccccttgtcagccctgctgtccttattcccagggcctcacaagctcatccag aaacgctatgacaaactgctggattgcaacagctacctgcagcgatcaacgggagaggag tcagacttggccaaaaaggagtatgaggccctcaacgcccagcttgtggaggagctccag gcattcaaccaggctgctcggaagattctgttgaactgtctatgcagcttcattaccctc cttagggacctgatgctcgtggcacagcaggcttactccacacttgtgccgatgccactg ttggtttcaagcatttctgagattcagaatcaagtactagaagagatccaaaatttgaat tgtgtgaaagaaaacagtgccacctttattgagaggaaactcagttttgaaaagaagaaa cctgtgcagattctgccagaaatgccacatcaaactgacattcatcgctccaaacttcta tccacatatagtgcagaggaactctatcaagctaagcgcaagtgcaatgctacacaagaa tatgacatcaatcttctggaaggagacttggtggctgtgatagaacagaaagatccactg gggagtacaagcaggtggcttgtggacacaggaaatgtgaaaggatatgtttattcctcc ttcctaaaaccctacaatccagcaaaaatgcagaaagtggatgctgagaacaggttctgt gacgatgattttgagaacatcagcctcttcgtgtcttcacggccagctagtgacagtgtc acaggcacctcagaaagcagcattggtgatagcagctcatctcttagtggcacatgtgga aagtttgaaacaaatggtactgatgttgacagttttcaagaagtagacgaacagattttc tatgcagttcatgcttttcaagcacggagtgaccatgaactcagccttcaggaataccag agagttcatatactcaggttttgtgacctaagtggcaataaagagtggtggttagctgaa gctcaagggcagaaaggatacgtgccagctaactaccttggaaagatgacttatgcttaa >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_2|137_aa MAMNPHKTECTVFTWGHCTVTKRNYTKRLQRGNYKVLVATNVAPRGLDIPVVDLVIQSSP PQDIESYIHLSGCVDRGVAGQASQVVDLAIGQVERVDKAVAQESRQDGRRRSGTEIDQEV GATNGVLTEYLIVNLAV >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_2|414_bp atggccatgaatccgcacaaaacagaatgcactgtgtttacatggggacattgcacagtc acaaagagaaattacactaaaaggcttcagagaggtaattataaagttttggtggcaacc aatgtggctcctcgtggtttggacattcctgtagttgacctggtgattcaaagttctcct cctcaggatattgagtcctatatccatctctctggatgcgtggacagaggagtggctggt caggccagtcaggttgtcgatctggcaattggtcaggtagagagagtcgacaaagcagtt gctcaggaaagtcggcaagatggtagaagacgaagtggaacagaaatagatcaagaagtg ggggccacaaatggagttttgactgagtatttgatagttaatttagcagtgtga >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_3|638_aa MKCPSWNSKAASGSCCPQTTRLHRRQEKNDSHLMWKILAPAVIHNPADRKKGSNVTSYCR TAKRFPRPYCGNLCFPAFGAGQCGNRFRRDHREQTDRQGGAEREDCASSEGVAGRDMRTE HISRSRDKGLGILPRTLSEGFSDFSTSTQLHQVADLFMTAVESKLLSLYGYLPAFAMAAT VNLELDPIFLKALGFLHSKSKDSAEKLKALLDESLARGIDSSYRPSQKDVEPPKISSTKN ISIKQEPKISSSLPSGNNNGKVLTTEKVKKEAEKRPADKMKSDITEGVDIPKKPRLEKPE TQSSPITVQSSKDLPMADLSSFEETSADDFAMEMGLACVVCRQMMVASGNQLVECQECHN LYHRDCHKPQVTDKEANDPRLVWYCARCTRQMKRMAQKTQKPPQKPAPAVVSVTPAVKDP LVKKPETKLKQETTFLAFKRTEVKTSTVISGNSSSASVSSSVTSGLTGWAAFAAKTSSAG PSTAKLSSTTQNNTGKPATSSANQKPVGLTGLATSSKGGIGSKIGSNNSTTPTVPLKPPP PLTLGKTGLSRSVSCDNVSKVGLPSPSSLVPGSSSQLSGNGNSGTSGPSGSTTSKTTSES SSSPSASLKGPTSQESQLNAMKRLQMVKKKAAQKKLKK >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_3|1917_bp atgaaatgtccgagctggaattccaaggctgcctctgggagctgctgcccgcagaccact cgcctccacagacgccaggagaaaaacgacagccaccttatgtggaaaatactcgcgccg gccgtcattcataatccggcggaccggaaaaagggaagtaacgtcacttcctactgtcgc actgccaaacgtttcccccgcccctactgtgggaacctttgttttcctgctttcggagcc ggccagtgcgggaaccgtttccgaagggaccaccgggaacagacggatcggcagggcggg gcggaacgtgaggactgtgcttcctccgagggagttgcgggccgggacatgcggaccgag catatttctaggtctagggacaagggacttggaatcctgcctcggactttgtcggagggt tttagtgacttctctacatcaacacagttacaccaagttgctgacctctttatgactgct gtagaaagtaaacttctttcactatatggatatttgccggcgtttgcaatggctgctact gtgaacttggaacttgatcccatttttttgaaagcactaggtttcttgcattcaaagagt aaagattctgctgaaaagctaaaagcactgcttgatgaatctttggctcggggcattgat tccagttaccgtccatctcaaaaggatgtggagccacccaaaatttcaagcacaaaaaac atttccattaagcaagagcccaaaatatcatccagtcttccttctggtaataataatggc aaggtcctcacaactgaaaaggtaaagaaggaagctgaaaagagacctgctgataaaatg aaatcagacatcactgaaggagttgatattccaaagaaacctagattggagaaaccagaa acacagtcatctcccattactgtccaaagtagcaaggatttacctatggctgacctttcc agttttgaggagaccagtgctgatgattttgccatggagatgggattggcctgcgttgtt tgtaggcaaatgatggtggcatctggcaatcaattagtagaatgtcaggagtgccataat ctctaccaccgagattgtcataaaccccaggtgacagacaaggaagcgaatgaccctcgc ctggtgtggtattgtgcccgatgtaccagacaaatgaaaagaatggctcaaaaaactcag aaaccaccgcagaaaccagcccctgcagttgtttctgtaactccagctgtcaaagatcca ttggttaagaaaccagaaactaaactgaaacaagagacaacttttctagcgtttaagaga acagaagtcaagacatccacagttatttcaggaaattcttctagtgccagcgtttcctcg tcagtaactagtggcttaactggatgggcagcttttgcagccaaaacttcctctgctggt ccttcaacagcaaaattgagttcaacaacacaaaacaatactgggaaacctgctacttcg tcagctaaccagaaacctgtgggtttgactggtctggcaacatcatccaaaggtggaata ggttccaaaataggttccaataacagcactacgcccactgtacctttaaaaccacctcca cctctaaccttgggtaaaactggccttagtcgctcagttagttgtgacaatgtcagcaaa gtaggtcttcctagtccaagtagtttagttccaggaagcagcagccaactaagtgggaat ggaaatagtggaacatcaggacctagtggaagtactaccagcaaaactacttcagaatcc agcagctctccctcagcatcccttaaaggcccaacttcacaagaatcacagctcaatgct atgaagcgattacagatggtcaagaagaaagctgcccaaaagaaactcaagaagtaa >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_4|386_aa MKAIKKSLTEEEYLYLDFSHQTEGCIFPLHTSVTLFLLSYCDCKIFKICLVVTKEVSRDS SLLRDDLIQDVEIQIISRQELPPIVQNCCLPAVVERSDNFCRAGLAVVLRHIIQKSYEAD PLKKELLELLGFKKTCLKACAEVSQWTRLCELTIPLAIENFLRESSDQPPTIPVEILQLE KKLSEPVRVHNDDKLRRQKLKQQKADGVGPPLTKGKAKSKVHTQETSEGLDSSSKSLELK VAFSKLTVQEEPATTNREPSHIRKAKASDLPPLEHVFAEGLYFTLADIVLLPCIHHFLVI ISRKFSEKLVEFPLLASWYQRIQEVPGVKTAASKCGIQFLHLPKLLTTSTEQHPNLCEVP GVEEQSDPLFIGGPRPTMAKLMVLNY >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_4|1161_bp atgaaagccataaagaaaagtcttacagaagaagaatacctgtacctggacttttctcac caaacagaaggatgcatctttcctcttcatacatctgtaactttatttctgttatcttac tgtgactgtaaaatctttaaaatttgcttagttgtcaccaaagaggtgagtagagatagt tcactactaagagatgacctgatccaggatgttgaaatacagattatttcaaggcaggag ctcccaccaatagtccaaaattgctgtttgcctgcagtagtagaacgatcagacaatttt tgtagagcaggacttgctgttgtattgagacacataatccagaaatcctatgaagcagac cccttaaagaaggaacttttggaacttctgggctttaaaaagacttgcttgaaagcctgt gctgaagttagtcagtggaccaggctatgtgaactcaccatccctttggctattgagaat tttctcagagaatcttctgaccagcccccaactatacctgtagaaatactacagctagag aaaaagcttagtgagcctgttagagtgcataatgatgataaactccgcaggcagaagctc aagcaacagaaggctgatggagttgggcctccccttactaagggaaaggcaaagagcaag gtccacacacaggaaacatctgaagggttggattcttcatccaagagtctggaactgaaa gtggcattctcaaagctcacagtacaggaagaaccagctactaccaacagagagccttct cacatcagaaaagcaaaagcctccgaccttccacctctggagcatgtgtttgcagaaggg ctttacttcactctggcagatattgtgctcttgccctgtatccatcatttcttggtaatt atcagcaggaaattttctgagaagctggtagaatttccattgctagcctcttggtaccag aggattcaggaagtgccaggagtaaaaacagcagcttctaagtgtgggatccaatttctc catttaccaaagttgttgacaacctcaactgaacagcatccaaacttatgtgaagtccca ggtgtagaagagcaaagcgatcctttatttataggaggaccaagaccaaccatggccaag ttaatggtacttaattattaa >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_5|175_aa MSELPFTIVSKRIKYLGIQLTRDVKDLFKENYKLLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNVILIKLPMTFFTKLEKTTLKFIWNQKRARIAKTILSQKNKAGDITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKFPI >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_5|528_bp atgagtgaactcccattcacaattgtttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaactcctgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgtcatcctcatcaagctacca atgactttcttcacaaaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaaagctggagacatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaattccccatttaa >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_6|225_aa MEERESVTEDQMNEMKQEEKFREKRVKRNEQSLQEIWDYVKRPNPWLIGLPESDGENGTK LENTLQDIIQENFPNLARQANIQIQEIRRMPQRCSSRRATPRHIIVRFTKVEMKEKMLRA AREKGRGTHKRKPIRLTADLSAEILQARREWGPIFNILKEKNFQPRISYPAKLSFISEGE IKSVTDKQMLRDFVTTRPALKELLKEALNMERKNRYQPLQKHAKL >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_6|678_bp atggaagaaagggaatcagtgactgaagatcaaatgaatgaaatgaagcaagaagagaag tttagagaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggactatgtg aaaagaccaaatccatggctgattggtttacctgaaagtgacggggagaatggaaccaag ttggaaaacacgctgcaggatattatccaggagaacttccccaacctagcaaggcaggcc aacattcaaattcaggaaatacggagaatgccacaaagatgttcctcgagaagagcaact ccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggca gccagagagaaaggtcggggtacccacaaaaggaagcccatcagactaacagctgatctc tcagcagaaatcctacaagccagaagagagtgggggccaatattcaacattcttaaagaa aagaattttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaa ataaaatccgttacagacaagcaaatgctgagagattttgtcaccaccaggcctgcccta aaagagctcctgaaggaagcactaaacatggaaaggaaaaaccgctaccagccactgcaa aaacatgccaaattgtaa >gi568815594r:105582736_105800005|GENSCAN_predicted_peptide_7|230_aa XAPAEQSFQRKEQAAIFAVLQPPLVIPRQTGCGADLQQTPADLQQTGLSVRKKTDKQKGI ASTSTKRTSTQKPHPKITNMKTKEIQTTIKEYYEHLYTNKLENLQEMDKFLDTYTFPGLN QEEVQSLNRPITSSEIEAVINSLPTKKSPGPDRFTAEFYQRYKEKLVLFLLKLLQTTEKE GLLPNSFYEANIILILKPGRDTTKKENFRLISMMNFEAKILNKILANQIQ >gi568815594r:105582736_105800005|GENSCAN_predicted_CDS_7|693_bp ngtgcccctgcggaacaaagcttccagaggaaggaacaggcagcaatctttgctgttctg cagcctccactggtgatacccaggcaaacagggtgtggagcagacctccagcaaactcca gcagacctgcagcagacgggactgagtgttagaaagaaaactgataaacagaaaggaata gcatcaacatcaacaaaaaggacgtccacacagaaaccccatccaaagatcaccaacatg aagaccaaagaaatacaaactaccataaaagaatactatgaacacctctatacaaataaa ctagaaaatctacaagaaatggataaattcctggacacatacaccttcccaggactaaac caggaagaagtccaatccctgaatagaccaataacaagttctgaaattgaggcagtaatt aatagcctaccaaccaaaaaaagtccaggaccagacagattcacagccgaattctaccag aggtacaaagagaagctggtactattccttctgaaactattgcaaacaacagaaaaagag ggactcctccctaactcattttatgaggccaacatcatcctgatactaaaacctggcaga gacacaacaaaaaaagaaaacttcaggctaatatccatgatgaactttgaagcaaaaatc ctcaataaaatactggcaaaccaaatccagtag