GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:03:48 Sequence gi568815581r:43428522_43645617 : 217096 bp : 47.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6811 6828 18 2 0 89 101 23 0.343 0.28 1.02 Intr + 7624 7659 36 2 0 84 93 20 0.347 0.33 1.03 Intr + 11440 11534 95 2 2 83 70 102 0.495 7.58 1.04 Term + 31591 31719 129 0 0 112 38 24 0.027 -2.02 1.05 PlyA + 34271 34276 6 1.05 2.00 Prom + 43505 43544 40 -4.46 2.01 Sngl + 47677 48144 468 0 0 110 43 440 0.989 37.83 2.02 PlyA + 48227 48232 6 1.05 3.00 Prom + 51457 51496 40 -6.96 3.01 Init + 55517 55664 148 1 1 89 85 223 0.890 22.35 3.02 Intr + 60928 61013 86 2 2 103 92 73 0.991 8.74 3.03 Intr + 63662 63807 146 1 2 92 46 184 0.956 13.68 3.04 Intr + 64160 64519 360 2 0 28 90 278 0.790 16.34 3.05 Intr + 64924 65068 145 1 1 70 100 110 0.930 10.68 3.06 Intr + 65162 65365 204 1 0 78 47 259 0.974 20.20 3.07 Intr + 67660 67747 88 0 1 79 88 156 0.919 14.24 3.08 Intr + 70341 70438 98 1 2 24 100 34 0.450 -2.07 3.09 Intr + 71435 71582 148 1 1 64 58 80 0.341 2.41 3.10 Intr + 76123 76304 182 2 2 95 94 165 0.651 17.39 3.11 Intr + 78482 78676 195 1 0 119 107 123 0.992 16.91 3.12 Intr + 78982 79167 186 0 0 91 88 213 0.999 21.49 3.13 Intr + 79288 79498 211 0 1 72 110 154 0.999 14.49 3.14 Intr + 79818 79999 182 1 2 60 87 125 0.998 9.19 3.15 Intr + 84841 84981 141 0 0 127 80 167 0.994 20.35 3.16 Intr + 88646 88801 156 1 0 70 70 77 0.918 4.31 3.17 Intr + 91609 91746 138 0 0 129 90 206 0.953 25.56 3.18 Intr + 92230 92358 129 0 0 91 97 89 0.999 10.99 3.19 Intr + 92848 93044 197 0 2 72 115 254 0.995 24.71 3.20 Intr + 93526 93705 180 1 0 62 78 277 0.999 23.08 3.21 Term + 95107 95326 220 1 1 99 48 333 0.999 26.91 3.22 PlyA + 98649 98654 6 -0.45 4.15 PlyA - 99347 99342 6 1.05 4.14 Term - 100222 99998 225 1 0 138 41 220 0.986 19.08 4.13 Intr - 100715 100614 102 2 0 66 81 184 0.827 15.87 4.12 Intr - 101155 100983 173 2 2 104 64 178 0.839 16.66 4.11 Intr - 101461 101363 99 2 0 59 77 134 0.976 9.48 4.10 Intr - 101660 101586 75 0 0 138 82 0 0.881 3.89 4.09 Intr - 104418 104153 266 2 2 25 110 192 0.994 12.26 4.08 Intr - 104827 104666 162 0 0 114 82 105 0.980 11.59 4.07 Intr - 105464 105338 127 0 1 95 99 88 0.994 10.34 4.06 Intr - 107958 107905 54 1 0 104 84 42 0.947 4.35 4.05 Intr - 111154 110974 181 1 1 36 44 145 0.404 4.34 4.04 Intr - 113355 113240 116 1 2 88 35 18 0.414 -3.33 4.03 Intr - 116501 116454 48 0 0 97 113 50 0.992 7.05 4.02 Intr - 116958 116753 206 2 2 113 98 214 0.654 23.74 4.01 Init - 118703 118630 74 2 2 93 37 37 0.374 -0.35 4.00 Prom - 119047 119008 40 -7.66 5.00 Prom + 122228 122267 40 -6.66 5.01 Init + 127909 128090 182 0 2 74 96 137 0.945 9.76 5.02 Intr + 129312 129342 31 0 1 95 101 20 0.848 2.13 5.03 Intr + 154971 155183 213 2 0 116 68 136 0.822 13.31 5.04 Intr + 186173 186374 202 1 1 70 51 83 0.511 1.66 5.05 Intr + 186709 186789 81 2 0 90 86 17 0.523 1.31 5.06 Intr + 193463 193612 150 0 0 83 15 86 0.096 0.93 5.07 Term + 199987 200069 83 2 2 84 55 84 0.248 2.56 5.08 PlyA + 200740 200745 6 1.05 6.03 PlyA - 202428 202423 6 1.05 6.02 Term - 213511 213389 123 1 0 125 50 85 0.433 6.78 6.01 Intr - 215139 214967 173 1 2 89 85 275 0.998 26.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_1|92_aa XRLYQHVVLNFGYKLGLPEALSWSLMVPFVWTVDIMIVGMQIPWPYTGSLALGEAPPSAK VPFSKKHPLPTASKIGFPPPTPQQLSSLGAET >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_1|279_bp naaaggctgtaccagcatgtggttctcaactttggctacaagttaggattacctgaggct ctttcctggtccttgatggtcccctttgtgtggactgtggacatcatgattgttggaatg cagattccatggccctacaccggatccctggctcttggagaggctccaccttcagctaaa gttcctttctcaaagaagcatcccctgcccactgcttctaaaataggattcccacctcca accccacaacaactgtcatctctaggagctgaaacataa >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_2|155_aa MAKSKNHSTNNQSRKRHRNGIKKPRSRRYESLKGMDPKFPRNMCFAKKQNKKVLKKMQAN SDKAMSARAEVIKALVKPKEVKLKIPKGVSCKLDRLAYIAHPKLGKRARARIAKGLRLCW PKAKAKDQTKAQAAAPASVPAQAPKGAQAPTKASE >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_2|468_bp atggccaagtccaagaaccacagcacaaacaaccagtcccgaaaaaggcacagaaatggt atcaagaaaccccgatcacgaagatatgaatctcttaaggggatggaccccaagttcccg aggaacatgtgctttgccaagaagcaaaacaagaaggtcctaaagaagatgcaggccaac agtgacaaggccatgagtgcacgtgctgaggttatcaaggccctcgtaaagcccaaggag gttaagctcaagatcccaaagggtgtcagctgcaagctcgatcgacttgcctacattgcc caccccaagcttgggaagcgggctcgtgcccgcattgccaaggggctcaggctgtgctgg ccaaaggccaaggccaaggatcaaaccaaggcccaggctgcagctccagcttcagttcca gctcaggctcccaaaggtgcccaggcccctacaaaggcttcagagtag >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_3|1179_aa MAVAVAMAGALIGSEPGPAEELAKLEYLSLVSKVCTELDNHLGINDKDLAEFVISLAEKN TTFDTFKASLVKNGAEFTTMLDEDDVKVAVDVLKELEALMPSAAGQEKQRDAEHRFVLSV LSFGSLGDRTKKKKRSRSRDRNRDRDRDRERNRDRDHKRRHRSRSRSRSRTRERNKVKSR YRSRSRSQSPPKDRKDRDKYGERNLDRWRDKHVDRPPPEEPTIGDIYNGKVTSIMQFGCF VQLEGLRKRWEGLVHISELRREGRVANVADVVSKGQRVKVKVLSFTGTKTSLSMKDVDQE TGEDLNPNRRRNLVGETNEETSMRNPDRPTHLSLVSAPEVEDDSLERKRLTRISDPEKWE IKQMIAANVLSKEEFPDFDEETGILPKVDDEEDEDLEIELVEEEPPFLRGHTKQSMDMSP IKIVKNPDGSLSQAAMMQSALAKERRELKQAQREAEMDSIPMGLNKHWVDPLPDAEGRQI AANMRGIGMMPNDIPEWKKHAFGGNKASYGKKTQMSILEQRESLPIYKLKEQLVQAVHDN QILIVIGETGSGKTTQITQYLAEAGYTSRGKIGCTQPRRVAAMSVAKRVSEEFGCCLGQE VGYTIRFEDCTSPETVIKYMTDGMLLRECLIDPDLTQYAIIMLDEAHERTIHTDVLFGLL KKTVQKRQDMKLIVTSATLDAVKFSQYFYEAPIFTIPGRTYPVEILYTKEPETDYLDASL ITVMQIHLTEPPGDILVFLTGQEEIDTACEILYERMKSLGPDVPELIILPVYSALPSEMQ TRIFDPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVKQKVYNSKTGIDQLVVTPISQ AQAKQRAGRAGRTGPGKCYRLYTERAYRDEMLTTNVPEIQRTNLASTVLSLKAMGINDLL SFDFMDAPPMETLITAMEQLYTLGALDDEGLLTRLGRRMAEFPLEPMLCKMLIMSVHLGC SEEMLTIVSMLSVQNVFYRPKDKQALADQKKAKFHQTEGDHLTLLAVYNSWKNNKFSNPW CYENFIQARSLRRAQDIRKQMLGIMDRHKLDVVSCGKSTVRVQKAICSGFFRNAAKKDPQ EGYRTLIDQQVVYIHPSSALFNRQPEWVVYHELVLTTKEYMREVTTIDPRWLVEFAPAFF KVSDPTKLSKQKKQQRLEPLYNRYEEPNAWRISRAFRRR >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_3|3540_bp atggctgtggctgtagccatggcgggagccttaatcgggtcggagccaggccccgcggaa gaacttgccaaactcgagtacctgtctttggtgtcaaaggtttgcactgagctggacaat cacttggggatcaacgacaaggaccttgctgaatttgtgatcagtcttgctgagaaaaat accacctttgatacttttaaggcttctctcgtcaaaaatggtgcagaatttacgaccatg ttggatgaagatgatgtgaaagttgctgtggatgtcctgaaagaactggaagctttaatg cccagcgcagcaggccaggagaagcaaagagatgctgaacaccggtttgtccttagtgtc ctgtcctttggaagtttaggggacaggacaaagaagaagaagcggagtcgaagccgagat cgaaaccgagatcgagacagagatagggaacgaaaccgagatagagaccacaagcggaga caccgatcccgctctcgatcacgttccaggacccgggagaggaataaagtgaagtctaga tatcggtccaggagcaggagtcagagtccccccaaagaccggaaggaccgggacaaatat ggagagcggaatctggatagatggcgggataagcatgtggaccgccctcctccagaagag cccaccattggtgacatttataatggcaaagttaccagcatcatgcagtttggttgcttt gtgcagctggaaggactaaggaagcggtgggaaggcctggtgcacatctctgagctccgg cgggagggtcgtgtggccaatgtagctgatgtcgtgagcaaaggccagagggtcaaagtc aaagtgctgtccttcactgggaccaagaccagcctgagcatgaaggatgtggatcaagag actggagaagatctaaacccaaatagacggcgaaatcttgtcggggagaccaatgaggag acctcaatgcggaatcctgatagacccactcacttgtcccttgtcagtgctcctgaagta gaggacgactcactggaacgcaagcgcctcacccgaatctctgacccagagaagtgggag atcaaacagatgattgctgccaatgtcctttccaaagaagaatttccagactttgatgaa gagactggcattctccctaaggtggatgatgaagaagatgaggaccttgagattgaattg gttgaggaagagcctccattcctgagagggcacactaagcaaagcatggacatgagcccc attaaaattgtcaagaacccagacggctccctctcccaagcagcaatgatgcagagtgcc ttggccaaagaaaggcgggaactcaaacaggcccagcgggaagctgagatggattctatt cccatgggactcaacaaacactgggttgaccctctgcctgatgcggaaggcagacagatt gctgccaacatgaggggtattgggatgatgcccaatgatattcctgagtggaagaagcat gcctttgggggcaacaaagcctcttacggaaaaaagacccagatgtcaatccttgagcag agggagagcctgcccatctacaaactgaaggagcaattggtccaggccgtccatgacaat cagatcctgattgtcattggtgagacaggatctggaaagacaacacagatcacccagtac ctggcggaggcaggctacacttccaggggcaagattgggtgtacccagcccagaagagtg gcagctatgtcggtggccaaaagagtgtcagaggagtttggttgttgcttaggccaagag gtgggctacaccattcgatttgaggactgcactagccctgaaacagtcatcaagtacatg acagatgggatgttgcttagagagtgcttgattgaccctgacctcactcagtacgcgatc atcatgttggacgaggcacatgagaggacaattcacactgatgtgctctttggattgttg aaaaagacagttcagaaacggcaggacatgaagctgattgtcacctcagccaccttggat gcagtgaagttttctcaatacttctatgaagctcccattttcaccatcccaggtcgaaca tatccagtggaaatactgtacacaaaggaacctgagacagattatctggatgccagcctg attactgttatgcagattcatttaacagaaccaccaggtgatatcctggtcttcctgact ggtcaggaagaaattgatactgcttgtgagatcctgtatgaaagaatgaaatccctggga cctgatgttccagagttaattatcctcccagtgtactctgctcttcccagtgagatgcag acccgaatctttgacccagctccaccaggcagcagaaaggttgtgattgccaccaatatc gcagagacatcgctgactattgatggtatctactatgtggtggacccaggattcgtgaaa cagaaagtttacaattccaagacagggattgaccagctcgtggtgacgcctatttctcag gctcaggcaaagcaacgagctggcagagctgggagaacaggcccagggaagtgttacagg ttgtacacagaacgtgcctaccgagatgaaatgctgaccaccaacgtgccggaaatccag agaaccaacttagcaagcacagtgctgtcactcaaggccatgggtatcaatgatctgctg tcctttgatttcatggatgccccacctatggaaactttgatcacagccatggagcagctg tacacactgggggccctggatgacgagggcctgctcactcgcttgggccgccggatggca gagttccctctggagccaatgctatgcaaaatgctcatcatgtctgtgcatctgggctgc agtgaggaaatgctgaccattgtatccatgctgtctgtgcagaacgtcttctataggccc aaggataaacaagcccttgcagatcagaagaaggccaaattccaccagactgaaggggac cacctcaccctgctagctgtgtacaactcctggaagaacaacaagttctccaacccatgg tgctatgagaactttatccaggctcgttccctgcgccgggcccaggacattcgcaagcag atgttaggcataatggacagacacaagctggatgttgtttcctgtggcaagtccacagtc cgagtgcagaaggccatctgcagtgggttcttccgtaatgctgccaagaaagacccgcag gagggttaccggacactgatcgaccagcaggtggtctatatccatccttccagtgccctc ttcaacagacagccagaatgggtggtgtaccatgagctggtgctcaccaccaaggaatac atgcgtgaagttaccaccatcgaccctcggtggcttgtggagtttgccccagccttcttc aaggtctcagacccaactaagctaagcaaacagaagaagcaacagcgtcttgaacccttg tacaaccgctatgaggaacccaatgcctggagaatatctcgagctttccgacggcgctga >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_4|635_aa MAERELHFRKSSTLGLLVSAAISRNPSGVLGCPAPESPEDPNLVPQTKRLRVTRGHSPRF SQKSPGNGSLREALIGPLGKLMDPGSLPPLDSEDLFQDLSHFQETWLAEATMPPSPSFCL FPFLKTLGDKYVPSTTSELVPSSTEAARAHSADFCQPEDKRLLHVYSCSQEITALSVYSN PVLGAGASDDKKDKALALTELNFLVEEIAQVPDSDEQFVPDFHSENLAFHSPTTRIKKEP QSPRTDPALSCSRKPPLPYHHGEQCLYSSAYDPPRQIAIKSPAPGALGQSPLQPFPRAEQ RNFLRSSGTSQPHPGHGYLGEHSSVFQQPLDICHSFTSQGGGREPLPAPYQHQLSEPCPP YPQQSFKQEYHDPLYEQAGQPAVDQGGVNGHRYPGAGVVIKQEQTDFAYDSDVTGCASMY LHTEGFSGPSPGDGAMDLSNPQFPPQGYGYEKPLRPFPDDVCVVPEKFEGDIKQEGVGAF REGPPYQRRGALQLWQFLVALLDDPTNAHFIAWTGRGMEFKLIEPEEVARLWGIQKNRPA MNYDKLSRSLRYYYEKGIMQKVAGERYVYKFVCEPEALFSLAFPDNQRPALKAEFDRPVS EEDTVPLSHLDESPAYLPELAGPAQPFGPKGGYSY >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_4|1908_bp atggctgagagggagctacactttcggaaatcatctaccctggggcttctggtttctgct gcaatcagtagaaatcccagcggagtcctgggctgccccgcccctgagtcacccgaggac cccaacctcgtcccccagactaagcgcctcagggtgactcgcgggcattctccccgcttc tcgcagaaatcgcccggaaatgggagcttgcgcgaagcgctgatcggcccgctggggaag ctcatggacccgggctccctgccgcccctcgactctgaagatctcttccaggatctaagt cacttccaggagacgtggctcgctgaagctacaatgcctccttctccatctttctgtctc ttccccttcctgaagactcttggggacaaatatgtcccctccacaactagtgaactggtt ccttccagtacagaggctgccagggcccacagtgctgacttctgccagcctgaggacaaa cgattattgcatgtttattcatgtagtcaagaaataacagcactgagcgtctactctaac cctgtgctgggtgctggggcatcagatgacaaaaaagataaagctcttgccctcacagag cttaacttcctggtggaggagatagctcaggtaccagacagtgatgagcagtttgttcct gatttccattcagaaaacctagctttccacagccccaccaccaggatcaagaaggagccc cagagtccccgcacagacccggccctgtcctgcagcaggaagccgccactcccctaccac catggcgagcagtgcctttactccagtgcctatgacccccccagacaaatcgccatcaag tcccctgcccctggtgcccttggacagtcgcccctacagccctttccccgggcagagcaa cggaatttcctgagatcctctggcacctcccagccccaccctggccatgggtacctcggg gaacatagctccgtcttccagcagcccctggacatttgccactccttcacatctcaggga gggggccgggaacccctcccagccccctaccaacaccagctgtcggagccctgcccaccc tatccccagcagagctttaagcaagaataccatgatcccctgtatgaacaggcgggccag ccagccgtggaccagggtggggtcaatgggcacaggtacccaggggcgggggtggtgatc aaacaggaacagacggacttcgcctacgactcagatgtcaccgggtgcgcatcaatgtac ctccacacagagggcttctctgggccctctccaggtgacggggccatggatctgagcaac ccccaatttcctccacaaggctatggctatgagaaacctctgcgaccattcccagatgat gtctgcgttgtccctgagaaatttgaaggagacatcaagcaggaaggggtcggtgcattt cgagaggggccgccctaccagcgccggggtgccctgcagctgtggcaatttctggtggcc ttgctggatgacccaacaaatgcccatttcattgcctggacgggccggggaatggagttc aagctcattgagcctgaggaggtcgccaggctctggggcatccagaagaaccggccagcc atgaattacgacaagctgagccgctcgctccgatactattatgagaaaggcatcatgcag aaggtggctggtgagcgttacgtgtacaagtttgtgtgtgagcccgaggccctcttctct ttggccttcccggacaatcagcgtccagctctcaaggctgagtttgaccggcctgtcagt gaggaggacacagtccctttgtcccacttggatgagagccccgcctacctcccagagctg gctggccccgcccagccatttggccccaagggtggctactcttactag >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_5|313_aa MTFLIYEVVLAALIWVLWGGGFEFLGAQSAVAIGRKGPTFLGREVELPLKEAEGGCQKRE GEPALSFSFVNNLSKELSTLTASTSSPPMDSSTRFPPFHGNDLWFNEVQTLVNFEIALAK VINGQLVFKLRLGSCPHALLALGWKDSENSKDIKQIHNGLSSINMSQVQTEMPQAPTWFN ENIHQLCLCSHFTNKQGGEMDSEIVPGEPARFASSVKRGNNPRPVSPHDRKGSLRQGSSA LMLYLGLIARESGLDGSPEHIKHCPLPRCSDCSSCHGDKDIFFLFQRWASEQSRQIETFS AGQLASAGRSDPS >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_5|942_bp atgacctttctgatttacgaggtggtgctggctgcactcatttgggttctgtggggaggt ggctttgagttcctcggggcccagtctgcagtggctattgggagaaaaggccccaccttc ctgggcagagaagtggagcttccacttaaagaggcagaaggaggctgtcagaagagggaa ggggaaccagcgctctccttcagctttgtcaataacttatcgaaggagctgtccacactc actgcctccacttcctcacctcccatggattcctcaactcgctttccacccttccatgga aatgatctgtggtttaatgaggtgcagactttggtcaattttgaaattgctcttgctaaa gtcatcaatggccaacttgtcttcaaactacggcttggttcttgtcctcatgctcttctc gctctgggatggaaggactcggagaattcgaaggatattaaacagattcataatggactc agcagtataaacatgtctcaagtgcagacagaaatgccacaagctcctacttggtttaat gaaaacatacaccagttgtgtctgtgttcccattttacaaataaacaaggtggggagatg gactcggaaattgtccctggggagccagccaggtttgcctcatctgtgaaacggggtaac aatcccagacctgtgtcgccgcatgatcgcaagggctctctgagacaagggagctcggct ctaatgctgtacctgggcctcattgccagagagtctggtttagatgggtccccagagcac attaagcactgtcctctgccaaggtgctcagactgcagctcctgtcatggggacaaggat atcttcttcctgttccagagatgggccagcgagcagtctagacagatagagaccttcagt gctggccagctggcttctgctggcaggtcggacccgtcgtga >gi568815581r:43428522_43645617|GENSCAN_predicted_peptide_6|98_aa XNQENRGKPEGSSKARKERTAFTKEQLRELEAEFAHHNYLTRLRRYEIAVNLDLSERQVK VWFQNRRMKWKRVKGGQPISPNGQDPEDGDSTASPSSE >gi568815581r:43428522_43645617|GENSCAN_predicted_CDS_6|297_bp nacaaccaggagaacagagggaagccggagggcagcagcaaagcccgcaaggagaggacg gccttcaccaaggagcagctgcgagagctggaggcagagtttgcccatcataactacctg actcggctccgcagatatgagattgcggtaaacctggacctctctgagcgccaggtcaaa gtgtggttccagaaccgaaggatgaagtggaagcgtgtgaagggaggtcagcccatctcc cccaatgggcaggaccctgaggatggggactccacagcctctccaagttcagagtga