GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:51:10 Sequence gi568815592r:18021422_18222606 : 201185 bp : 42.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6443 6543 101 0 2 79 110 98 0.410 10.01 1.02 Intr + 14064 14227 164 2 2 33 92 96 0.090 2.35 1.03 Intr + 15270 15375 106 0 1 44 116 51 0.669 2.80 1.04 Intr + 21800 22005 206 1 2 23 96 195 0.144 10.88 1.05 Intr + 43914 44010 97 0 1 85 105 51 0.172 5.59 1.06 Term + 49363 49443 81 0 0 50 43 114 0.052 -0.19 1.07 PlyA + 49654 49659 6 1.05 2.00 Prom + 54066 54105 40 -3.35 2.01 Init + 71164 71326 163 0 1 67 92 72 0.414 5.44 2.02 Intr + 74629 74744 116 2 2 77 113 56 0.834 6.25 2.03 Intr + 76605 76752 148 2 1 19 54 145 0.612 3.09 2.04 Term + 80560 80774 215 2 2 26 49 164 0.346 2.71 2.05 PlyA + 82336 82341 6 1.05 3.00 Prom + 85195 85234 40 -5.25 3.01 Init + 96509 96632 124 1 1 65 89 179 0.976 16.08 3.02 Term + 96971 97152 182 0 2 8 54 108 0.709 -3.81 3.03 PlyA + 99458 99463 6 1.05 4.02 PlyA - 99896 99891 6 -1.75 4.01 Sngl - 101185 99998 1188 1 0 111 45 1101 0.999 103.76 4.00 Prom - 108429 108390 40 -7.85 5.03 PlyA - 109231 109226 6 1.05 5.02 Term - 109359 109243 117 0 0 126 53 25 0.703 0.36 5.01 Init - 112126 111989 138 1 0 78 64 152 0.134 11.99 5.00 Prom - 115902 115863 40 -4.15 6.06 PlyA - 116177 116172 6 1.05 6.05 Term - 117616 117538 79 0 1 101 42 100 0.008 2.96 6.04 Intr - 118296 118244 53 0 2 71 96 -5 0.613 -4.61 6.03 Intr - 122307 122175 133 2 1 47 76 141 0.835 8.53 6.02 Intr - 126494 126402 93 1 0 136 60 71 0.970 7.26 6.01 Init - 127706 127567 140 2 2 66 52 197 0.996 13.56 6.00 Prom - 130303 130264 40 -8.45 7.03 PlyA - 132807 132802 6 1.05 7.02 Term - 135100 134325 776 2 2 26 36 542 0.479 35.35 7.01 Init - 139859 139787 73 2 1 87 42 42 0.015 0.78 7.00 Prom - 153863 153824 40 -4.25 8.02 PlyA - 154090 154085 6 1.05 8.01 Sngl - 155294 154782 513 2 0 25 38 475 0.871 31.99 8.00 Prom - 160578 160539 40 -8.05 9.00 Prom + 161083 161122 40 -3.95 9.01 Init + 163502 163519 18 1 0 86 77 3 0.748 -0.89 9.02 Intr + 166371 166581 211 2 1 94 93 204 0.908 19.06 9.03 Intr + 169782 169960 179 1 2 32 106 202 0.550 15.12 9.04 Intr + 175636 175812 177 0 0 47 51 151 0.859 6.49 9.05 Intr + 176166 176240 75 2 0 86 97 65 0.981 5.99 9.06 Intr + 179018 179155 138 1 0 113 89 110 0.998 13.34 9.07 Intr + 180065 180236 172 1 1 44 64 164 0.995 8.19 9.08 Intr + 184116 184243 128 1 2 85 85 135 0.992 12.38 9.09 Intr + 185339 185440 102 1 0 102 33 50 0.556 0.35 9.10 Intr + 185977 186108 132 0 0 113 81 89 0.997 10.62 9.11 Intr + 186711 186911 201 2 0 59 119 78 0.868 6.56 9.12 Intr + 191067 191183 117 2 0 108 57 102 0.994 8.84 9.13 Intr + 192235 192360 126 0 0 117 98 88 0.999 12.66 9.14 Intr + 193586 193708 123 1 0 88 94 138 0.665 14.26 9.15 Intr + 196312 196464 153 0 0 60 64 145 0.382 8.65 9.16 Intr + 200607 200831 225 2 0 78 101 67 0.245 4.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 117616 117542 75 0 0 101 115 96 0.830 11.31 S.002 Init - 138113 138065 49 2 1 65 97 10 0.822 0.76 S.003 Intr + 139906 140033 128 0 2 53 66 154 0.880 8.26 S.004 Term + 144846 145035 190 0 1 87 49 160 0.871 7.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_1|251_aa XRVGGYWYEWSWSRMSCDSGVREWKFGMTDRLLRTLPLTMSHTKPTWFLGHLFVTPGLYH CHPICPAAPPLPPPTDAVPVLTTLRGEHCDTNWQLEISHGGGSIYTMEIGKCYKLGYWQV YIWKVRGSIRSHIHKRLVENMLPNPTTLTSGCQWAGPMEPAQQSTVVPGGHADCSSQGFS CFTSAPLQSSVAGPWEPLLNQGSPTLGPWTSTSWWPVRNWATQQEFLAEDSLDEYDFSEI DNSDDSDVSSV >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_1|756_bp nggagagttggagggtactggtatgaatggtcatggtccaggatgtcctgtgactctggt gtgagggagtggaaatttggcatgacagataggctgttgaggactctgccacttaccatg tcccacaccaagcccacgtggttcctggggcatttgtttgtgactcctggtttataccat tgtcatcccatctgtcctgctgctcctcccctgccaccacccacagacgctgtccctgtc ctcaccactctccgtggagaacactgtgacacaaattggcagcttgaaatcagccatggt ggtggcagtatttacaccatggaaattggcaaatgctacaaattaggctattggcaggta tatatatggaaggtcagaggctccatccggtcccacatccataagaggctggtggaaaac atgctgcctaaccccaccactttgacaagtggatgtcagtgggcagggcccatggagcct gctcagcagagcactgttgttcctggaggccatgctgactgcagcagtcaaggctttagc tgctttaccagtgctcccctccagagctccgtggcaggcccttgggaaccactgctaaac caggggtccccaaccctggggccgtggaccagtaccagttggtggcctgttaggaactgg gccacacagcaggagtttttggccgaagattcacttgatgaatatgatttttctgaaata gacaattctgatgattcagatgttagttctgtttag >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_2|213_aa MGNVGIFHWPSEHHTPWEVKYREKWRNGGMEEMAYIGSGAAEWGTDGAGKLPAYTYLVVQ AGQVLKSGSVTGLGCNMTRERSVFNENILSPNHTELARDIVSSERFTLQVKATHKETEKR NHSQRSNQIQKLGETGGSMKIEDIRQENLYKRPLILLLLGQKPASECASRKAAGGSHSPH EEDGNALTWLEERVWDTSEGHMLTGSLLLMVIA >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_2|642_bp atggggaatgttggcatttttcactggccttcagagcaccataccccctgggaagtgaag tacagggagaaatggagaaatggaggaatggaggagatggcctatattggaagtggggct gcagaatgggggactgatggggctggaaagcttccagcttatacttacctggtggtccag gctggccaagtccttaaatctgggtcagttacagggcttggatgcaacatgacaagggaa agaagtgtgttcaacgagaacattctgagtccaaaccacacagaactcgccagagacatt gtctcatcagaacgcttcaccctgcaagtaaaggccacacacaaagagactgagaaaagg aaccacagccagagaagcaaccaaatacagaaacttggagaaacaggagggagtatgaaa attgaagacatcaggcaggaaaacctgtataagaggcctcttatacttctcctgctgggg cagaagccagcttcagagtgtgccagcaggaaagctgcaggtggtagccattctccccat gaggaagatggtaatgcgttaacttggctggaagagagagtatgggacacaagtgaaggt catatgctgacgggaagccttttattgatggtgattgcatga >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_3|101_aa MVRGDHVIENLAADGWLMMGQFFLEVAVGTEDTMVIQYSSCELDLVEGRFMSVNQVFYKM PRKEIEKAKSRRQVQKAEQGQKQETVQGSGIGQNWNRWRVT >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_3|306_bp atggtccgaggggatcatgtaatagaaaacctggctgctgacggctggcttatgatggga cagtttttcttagaagtggctgtgggcacagaagacaccatggttattcagtacagtagt tgtgagctagatttggtagaggggagatttatgagtgtgaaccaagttttttacaaaatg ccaaggaaagagattgagaaagccaagtccaggcggcaagtccagaaggcagaacaaggt caaaagcaggagacagtccaaggatcgggaattgggcaaaattggaacagatggagggtg acgtga >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_4|395_aa MAAEASESGPALHELMREAEISLLECKVCFEKFGHRQQRRPRNLSCGHVVCLACVAALAH PRTLALECPFCRRACRGCDTSDCLPVLHLIELLGSALRQSPAAHRAAPSAPGALTCHHTF GGWGTLVNPTGLALCPKTGRVVVVHDGRRRVKIFDSGGGCAHQFGEKGDAAQDIRYPVDV TITNDCHVVVTDAGDRSIKVFDFFGQIKLVIGGQFSLPWGVETTPQNGIVVTDAEAGSLH LLDVDFAEGVLRRTERLQAHLCNPRGVAVSWLTGAIAVLEHPLALGTGVCSTRVKVFSSS MQLVGQVDTFGLSLYFPSKITASAVTFDHQGNVIVADTSGPAILCLGKPEEFPVPKPMVT HGLSHPVALTFTKENSLLVLDTASHSIKVYKVDWG >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_4|1188_bp atggcggccgaagcctcggagagcgggccagcgctgcatgagctcatgcgcgaggcggag atcagcctgctcgagtgcaaggtgtgctttgagaagtttggccaccggcagcagcggcgc ccgcgcaacctgtcctgcggccacgtggtctgcctggcctgcgtggccgccctggcgcac ccgcgcactctggccctcgagtgcccattctgcaggcgagcttgccggggctgcgacacc agcgactgcctgccggtgctgcacctcatagagctcctgggctcagcgcttcgccagtcc ccggccgcccatcgcgccgcccccagcgcccccggagccctcacctgccaccacaccttc ggcggctgggggaccctggtcaaccccaccggactggcgctttgtcccaagacggggcgt gtcgtggtggtgcacgacggcaggaggcgtgtcaagatttttgactcagggggaggatgc gcgcatcagtttggagagaagggggacgctgcccaagacattaggtaccctgtggatgtc accatcaccaacgactgccatgtggttgtcactgacgccggcgatcgctccatcaaagtg tttgatttttttggccagatcaagcttgtcattggaggccaattctccttaccttggggt gtggagaccacccctcagaatgggattgtggtaactgatgcggaggcagggtccctgcac ctcctggacgtcgacttcgcggaaggggtccttcggagaactgaaaggttgcaagctcat ctgtgcaatccccgaggggtggcagtgtcttggctcaccggggccattgcggtcctggag caccccctggccctggggactggggtttgcagcaccagggtgaaagtgtttagctcaagt atgcagcttgtcggccaagtggatacctttgggctgagcctctactttccctccaaaata actgcctccgctgtgacctttgatcaccagggaaatgtgattgttgcagatacatctggt ccagctatcctttgcttaggaaaacctgaggagtttccagtaccgaagcccatggtcact catggtctttcgcatcctgtggctcttaccttcaccaaggagaattctcttcttgtgctg gacacagcatctcattctataaaagtctataaagttgactgggggtga >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_5|84_aa MDDPDTEEKSHESYWEEGGMWLDIAYPYHPSGSDVLDRCLHIHKHGVKYAIYVVLRRLML LKNDIKVGELTVFLKSYIYLQKSK >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_5|255_bp atggatgatcctgatacagaagagaaatctcatgagtcctattgggaagaaggaggaatg tggttggatattgcatatccgtatcacccaagtggtagtgatgtgttggacagatgcctt cacatacacaagcacggggtaaaatatgcaatatacgttgtcttgagaaggttgatgctt ttgaagaacgacataaaagttggggaattgactgtctttttgaaaagttatatctactta cagaaaagtaaatga >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_6|165_aa MDGTRTSLDIEEYSDTEVQKNQVLTLEEWQDKWVNGKTAFHQEQGHQLLKKHLDTFLKGK SGLRVFFPLCGKAVEMKWFADRGHSVVGVEISELGIQEFFTEQNLSYSEEPITEIPGTKV FKSSSGNISLYCCSIFDLPRTNIGKFDMIWDRGALVAINPGDRKW >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_6|498_bp atggatggtacaagaacttcacttgacattgaagagtactcggatactgaggtacagaaa aaccaagtactaactctggaagaatggcaagacaagtgggtgaacggcaagactgctttt catcaggaacaaggacatcagctattaaagaagcatttagatactttccttaaaggcaag agtggactgagggtattttttcctctttgcggaaaagcggttgagatgaaatggtttgca gaccggggacacagtgtagttggtgtggaaatcagtgaacttgggatacaagaatttttt acagagcagaatctttcttactcagaagaaccaatcaccgaaattcctggaaccaaagta tttaagagttcttcggggaacatttcattgtactgttgcagtatttttgatcttcccagg acaaatattggcaaatttgacatgatttgggatagaggagcattagttgccatcaatcca ggtgatcgcaaatggtaa >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_7|282_aa MTKDEGSLLKSQLSSKHEGQKHHGKAFSPTRLSATKIDLKTQNKTGQSLELCVYTWCGGF PNWSGLGSPAGLPKGRLDFYVAGFHITEPQLSTAGSASSFLFHLSAARPLTNSYPNRATP PPGLQSQRSYPSNLAMLIKSQRVVAVPENYYNTCKAVRAPGRTLSAGKNKRGSSYSRTVG FSRGSDSPQEGAPNSRCQLQPPRFPRSFRKLFEVGEATAPGLTFALKRCRESRGKGTAVQ KGNIVGGLADRQHRDEPGDPTPQHPSPGIPPSSDRYPPRTLF >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_7|849_bp atgactaaggatgaaggttcactactgaaatcacagctgagttctaaacatgaaggtcaa aaacatcatggcaaagccttctctcccactcgcctttctgccactaaaatagatttaaaa acacaaaacaaaactggtcaaagccttgagttatgtgtgtatacgtggtgtgggggattc cccaactggagtggactggggagtcctgcaggcttacccaagggaagactggacttctat gtggctggtttccacatcactgaaccccaactttctacagctgggtcggcctcttccttc ctctttcacctgtcagctgcccggccgttgactaacagctaccctaatcgggccacccct cctccaggtctccaatcccagaggagttaccccagcaacctggcaatgctcatcaagtct caacgcgtggtggctgtgccagagaattactacaacacctgcaaggctgtgcgggctcct ggccgcaccctatcggccgggaagaacaagcgagggagttcctactccaggaccgtgggg ttctcccgcggctctgacagtccccaggagggagccccaaactcccggtgccagctgcag ccgcctcgctttccgagaagctttcggaagcttttcgaagtcggagaagcaactgccccg gggctcacctttgcgctgaagaggtgccgggagtctcggggaaaaggcaccgctgtacag aaaggaaacattgtaggggggttagccgaccggcaacatcgcgacgaacccggcgacccc acgccccaacaccctagcccgggaattcccccttcttcagacagatacccaccccgcacc ctcttttaa >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_8|170_aa MIISIDAEKALDKIQQRFMIKTLSKIGIQGTYLNVIKAIYDKPTDNIILNGEELKAFPLR PGTKQGCPISPLLFNIVLEVLARAIRQEKEIKGIQIGKEEVKLSLFADDMIVYPENPKDS SRRLLELIKEVSKVSRYKINVHRSVALLYTNSDQAESQIKNSTPLPIAAK >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_8|513_bp atgatcatctcaatagatgcagaaaaagcactggacaaaatccagcaacgctttatgatt aaaactctcagcaaaatcggcatacaagggacatacctcaatgtaataaaagccatctat gacaaacccacagacaacataatactgaatggggaagagttgaaagcattccctctgaga cctggaacaaaacaaggatgcccaatctcaccactcctcttcaacatagtactagaagtc ctagccagagcaatcagacaagagaaagagataaagggcatccaaatcggtaaagaggaa gtcaaactgtcactgtttgctgatgatatgatcgtttaccccgaaaatcctaaggactct tccagaaggctcctagaactgataaaagaagttagcaaagtttccagatacaagattaat gtacacagatcagtagctcttctatacaccaacagcgaccaagcagagagtcaaatcaag aactcaacccctttaccaatagctgcaaaataa >gi568815592r:18021422_18222606|GENSCAN_predicted_peptide_9|759_aa MGGVTQRVLEVSNHWWYSMLILPPLLKDSVAAPLLSAYYPDCVGMSPSCTSTNRAAATGN ASPGKLEHSKAALSVHGMNRYFQPFYQPNECGKALCVRPDVMELDELYEFPEYSRDPTMY LALRNLILALWYTNCKEALTPQKCIPHIIVRGLVRIRCVQEVERILYFMTRKGLINTGVL SVGADQYLLPKDYHNKSVIIIGAGPAGLAAARQLHNFGIKVTVLEAKDRIGGRVWDDKSF KGVTVGRGAQIVNGCINNPVALMCEQLGISMHKFGERCDLIQEGGRITDPTIDKRMDFHF NALLDVVSEWRKDKTQLQDVPLGEKIEEIYKAFIKESGIQFSELEGQVLQFHLSNLEYAC GSNLHQRALGHFCGTVESETLLSRSQNRVSLQVWSVYLVKVSARSWDHNEFFAQFAGDHT LLTPGYSVIIEKLAEGLDIQLKSPVQCIDYSGDEVQVTTTDGTGYSAQKVRARAVQGWVE TFAARGLEGRTNLSYQGVCLWESAWTLVGFLVLVTVPLALLQKGAIQFNPPLSEKKMKAI NSLGAGIIEKIALQFPYRFWDSKVQGADFFGHVPPSASKRGLFAVFYDMDPQKKHSVLMS VIAGEAVASVRTLDDKQVLQQCMATLRELFKEQEVPDPTKYFVTRWSTDPWIQMAYSFVK TGGSGEAYDIIAEDIQGTVFFAGEMGKFESHVKPQFYKRGKKPSLHSKTEMFLRRYDNAN LFHHSKSTDLKKPYKHLDLIAFSIGSTTAESLDFRIKQN >gi568815592r:18021422_18222606|GENSCAN_predicted_CDS_9|2277_bp atggggggtgttacccagagagtattggaagtttccaaccattggtggtactctatgctc atcctacctcctttgctgaaagacagtgtggcagcgcccctgctgtctgcctactaccct gactgtgttggcatgagcccctcctgcaccagcacaaaccgcgccgctgccactggcaat gccagccctgggaagctggagcactccaaggctgccctctccgtgcacggcatgaaccga tacttccagcctttctaccagcccaatgagtgtggcaaagccctctgtgtgaggccggat gtgatggaactggatgagctctatgagtttccagagtattcccgagaccccaccatgtac ctggctttgagaaacctcatcctcgcactgtggtatactaactgcaaagaagctcttact cctcagaaatgtattcctcacatcatcgtccggggtctcgtgcgtattcgatgcgttcag gaagtggagagaatactgtattttatgaccagaaaaggtctcatcaacactggagttctc agcgtgggagccgaccagtatcttctccctaaggactaccacaataaatcagtcatcatt atcggggctggtccagcaggattagcagctgctaggcaactgcataactttggaattaag gtgactgtcctggaagccaaagacagaattggaggccgagtctgggatgataaatctttt aaaggcgtcacagtgggaagaggagctcagattgtcaatgggtgtattaacaacccagta gcattaatgtgtgaacaacttggcatcagcatgcataaatttggagaaagatgtgactta attcaggaaggtggaagaataactgaccccactattgacaagcgcatggattttcatttt aatgctctcttggatgttgtctctgagtggagaaaggataagactcagctccaagatgtc cctttaggagaaaagatagaagaaatctacaaggcatttattaaggaatctggtatccaa ttcagtgagctggagggacaggtgcttcagttccatctcagtaacctggagtacgcctgt ggcagcaaccttcaccagagggccttgggtcatttctgtggaaccgtggagtcggaaaca ctgctctcgagaagccaaaatagagtatccttgcaagtctggtctgtttaccttgttaaa gtatctgctcgctcgtgggaccacaatgaattctttgcccagtttgctggtgaccacact ctgctaactcccgggtactcggtgataattgaaaaactggcagaagggcttgacattcaa ctcaaatctccagtgcagtgtattgattattctggagatgaagtgcaggttaccactaca gatggcacagggtattctgcacaaaaggtaagagctagggcagtacaagggtgggtagaa acctttgctgccaggggcttggagggaaggacaaatctttcctatcaaggagtttgcctt tgggaatctgcctggacccttgttggatttctggtattagtcactgtaccactggcttta ctacagaaaggtgccattcagtttaatccaccgttgtcagagaagaagatgaaggctatc aacagcttaggcgcaggcatcattgaaaagattgccttgcaatttccgtatagattttgg gacagtaaagtacaaggggctgacttttttggtcacgttcctcccagtgccagcaagcga gggctttttgccgtgttctatgacatggatccccagaagaagcacagcgtgctgatgtct gtgattgccggggaggctgtcgcatccgtgaggaccctggatgacaaacaggtgctgcag cagtgcatggccacgctccgggagctgttcaaggagcaggaggtcccagatcccacaaag tattttgtcactcggtggagcacagacccatggatccagatggcatacagttttgtgaag acaggtggaagtggggaggcctacgatatcattgctgaagacattcaaggaaccgtcttt ttcgctggtgagatggggaaatttgaatcacatgttaaacctcagttttataagaggggg aaaaaaccgtctctacatagtaaaactgaaatgtttctaaggcgatatgataatgcaaac ctatttcatcactctaaaagcactgacctcaaaaaaccttataagcacttagatttaatt gcattttccataggttcaactactgctgaaagtctggatttcagaataaagcagaat