GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:45:24 Sequence gi568815594r:25648428_25947567 : 299140 bp : 44.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7279 7463 185 0 2 90 105 69 0.780 7.43 1.02 Intr + 13709 13856 148 2 1 79 92 22 0.913 1.94 1.03 Intr + 14071 14185 115 0 1 118 106 72 0.961 12.02 1.04 Intr + 14278 14415 138 2 0 95 98 119 0.881 14.04 1.05 Intr + 15775 15903 129 2 0 100 94 36 0.974 6.07 1.06 Intr + 17701 17844 144 2 0 92 84 176 0.999 17.85 1.07 Intr + 19453 19564 112 2 1 87 116 79 0.998 10.04 1.08 Intr + 21220 21415 196 1 1 86 93 266 0.999 26.32 1.09 Intr + 22311 22406 96 2 0 83 102 87 0.998 9.81 1.10 Intr + 23174 23294 121 1 1 54 82 152 0.579 11.27 1.11 Intr + 24660 24827 168 1 0 103 82 157 0.998 16.52 1.12 Intr + 25869 25985 117 1 0 116 97 88 0.997 12.94 1.13 Intr + 26078 26202 125 0 2 119 71 141 0.999 15.80 1.14 Term + 27708 28322 615 2 0 130 50 894 0.985 84.26 1.15 PlyA + 29235 29240 6 1.05 2.00 Prom + 30369 30408 40 -5.06 2.01 Init + 30423 30556 134 2 2 83 25 93 0.678 2.11 2.02 Term + 33819 34002 184 0 1 55 38 153 0.676 4.02 2.03 PlyA + 34183 34188 6 1.05 3.00 Prom + 38686 38725 40 -0.56 3.01 Init + 45973 46035 63 0 0 42 96 58 0.527 3.15 3.02 Term + 51068 51088 21 1 0 124 48 16 0.530 -0.49 3.03 PlyA + 52000 52005 6 1.05 4.28 PlyA - 52513 52508 6 1.05 4.27 Term - 65085 65000 86 1 2 90 37 102 0.883 3.12 4.26 Intr - 73503 73398 106 0 1 21 106 104 0.682 5.29 4.25 Intr - 77716 77632 85 0 1 91 93 13 0.126 1.92 4.24 Intr - 100503 100472 32 0 2 100 52 50 0.002 -0.47 4.23 Intr - 109363 109261 103 0 1 84 82 28 0.545 1.98 4.22 Intr - 110641 110514 128 1 2 69 105 57 0.876 4.98 4.21 Intr - 117008 116899 110 0 2 60 97 160 0.715 14.10 4.20 Intr - 118409 118238 172 2 1 100 52 -12 0.342 -4.08 4.19 Intr - 119403 119313 91 2 1 54 100 56 0.772 3.40 4.18 Intr - 127933 127850 84 0 0 121 103 126 0.993 16.34 4.17 Intr - 130776 130649 128 0 2 99 38 37 0.994 -0.72 4.16 Intr - 133991 133815 177 2 0 60 89 141 0.995 11.52 4.15 Intr - 139937 139797 141 2 0 24 116 179 0.900 14.85 4.14 Intr - 142147 142028 120 1 0 132 98 85 0.999 14.59 4.13 Intr - 154035 153856 180 0 0 99 80 111 0.832 11.46 4.12 Intr - 156325 156114 212 2 2 75 80 95 0.983 6.03 4.11 Intr - 162984 162851 134 2 2 92 62 68 0.704 4.89 4.10 Intr - 168266 168221 46 0 1 79 36 48 0.133 -3.83 4.09 Intr - 169851 169711 141 1 0 78 48 103 0.509 5.62 4.08 Intr - 171513 171381 133 0 1 102 103 53 0.996 8.42 4.07 Intr - 173701 173569 133 0 1 109 109 100 0.999 14.85 4.06 Intr - 181729 181671 59 1 2 103 97 41 0.949 4.18 4.05 Intr - 184683 184463 221 1 2 92 44 119 0.673 5.92 4.04 Intr - 185142 185021 122 2 2 72 96 2 0.978 -0.46 4.03 Intr - 186859 186770 90 0 0 87 56 110 0.646 6.81 4.02 Intr - 195410 195241 170 2 2 69 33 112 0.183 2.54 4.01 Init - 214322 214248 75 2 0 102 93 157 0.660 16.69 4.00 Prom - 215514 215475 40 -5.36 5.03 PlyA - 216363 216358 6 1.05 5.02 Term - 216979 216793 187 0 1 118 49 50 0.807 1.06 5.01 Init - 218653 218541 113 1 2 58 110 61 0.653 4.99 5.00 Prom - 219072 219033 40 -1.26 6.00 Prom + 231869 231908 40 -3.76 6.01 Init + 234591 234714 124 2 1 81 100 115 0.682 12.45 6.02 Term + 243646 243758 113 2 2 65 38 106 0.263 2.02 6.03 PlyA + 244297 244302 6 1.05 7.00 Prom + 250667 250706 40 -4.56 7.01 Init + 260508 260533 26 2 2 86 86 35 0.368 2.14 7.02 Intr + 262784 262885 102 2 0 53 108 40 0.361 1.79 7.03 Intr + 265769 265995 227 2 2 90 79 253 0.809 22.23 7.04 Intr + 279886 279942 57 2 0 78 97 40 0.080 2.76 7.05 Term + 298671 298834 164 1 2 127 51 45 0.012 2.80 7.06 PlyA + 298924 298929 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 93946 94013 68 0 2 86 96 43 0.900 3.45 S.002 Term + 94426 94489 64 1 1 89 47 76 0.870 0.96 S.003 Term - 213771 213604 168 0 0 71 47 74 0.904 -0.52 S.004 Init + 224009 224106 98 1 2 93 75 59 0.869 4.88 S.005 Term + 224987 225197 211 2 1 75 46 84 0.809 -0.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_1|802_aa MGSLRRQVGSAPAAAAAGGPGMCEGRDDSGQPLCATPSPYIPGALRSTWPPPPAQHLRRE RCAGITGVSHGARCTLLLTEGESQDKNKFTRLKVVHAANSTLYEVVCFSDWTMAPWPELG DAQPNPDKYLEGAAGQQPTAPDKSKETNKTDNTEAPVTKIELLPSYSTATLIDEPTEVDD PWNLPTLQDSGIKWSERDTKGKILCFFQGIGRLILLLGFLYFFVCSLDILSSAFQLVGGK MAGQFFSNSSIMSNPLLGLVIGVLVTVLVQSSSTSTSIVVSMVSSSLLTVRAAIPIIMGA NIGTSITNTIVALMQVGDRSEFRRAFAGATVHDFFNWLSVLVLLPVEVATHYLEIITQLI VESFHFKNGEDAPDLLKVITKPFTKLIVQLDKKVISQIAMNDEKAKNKSLVKIWCKTFTN KTQINVTVPSTANCTSPSLCWTDGIQNWTMKNVTYKENIAKCQHIFVNFHLPDLAVGTIL LILSLLVLCGCLIMIVKILGSVLKGQVATVIKKTINTDFPFPFAWLTGYLAILVGAGMTF IVQSSSVFTSALTPLIGIGVITIERAYPLTLGSNIGTTTTAILAALASPGNALRSSLQIA LCHFFFNISGILLWYPIPFTRLPIRMAKGLGNISAKYRWFAVFYLIIFFFLIPLTVFGLS LAGWRVLVGVGVPVVFIIILVLCLRLLQSRCPRVLPKKLQNWNFLPLWMRSLKPWDAVVS KFTGCFQMRCCCCCRVCCRACCLLCDCPKCCRCSKCCEDLEEAQEGQDVPVKAPETFDNI TISREAQGEVPASDSKTECTAL >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_1|2409_bp atgggttcattaaggcggcaggtaggcagtgccccggcggcggctgcggcaggcggtcct ggaatgtgcgaggggcgtgatgacagcggccagcctctttgcgcaacaccttcgccatat atacccggggcgctgcgctccacctggccgccgcctccagcccagcacctgcggagggag cgctgtgctgggattacaggcgtgagccacggtgcccgttgcaccctgcttttgactgaa ggagagagtcaagacaaaaacaagtttacgagattaaaggtggtacatgcagctaactca acactgtacgaggtagtttgctttagcgactggaccatggctccctggcctgaattggga gatgcccagcccaaccccgataagtacctcgaaggggccgcaggtcagcagcccactgcc cctgataaaagcaaagagaccaacaaaacagataacactgaggcacctgtaaccaagatt gaacttctgccgtcctactccacggctacactgatagatgagcccactgaggtggatgac ccctggaacctacccactcttcaggactcggggatcaagtggtcagagagagacaccaaa gggaagattctctgtttcttccaagggattgggagattgattttacttctcggatttctc tactttttcgtgtgctccctggatattcttagtagcgccttccagctggttggaggaaaa atggcaggacagttcttcagcaacagctctattatgtccaaccctttgttggggctggtg atcggggtgctggtgaccgtcttggtgcagagctccagcacctcaacgtccatcgttgtc agcatggtgtcctcttcattgctcactgttcgggctgccatccccattatcatgggggcc aacattggaacgtcaatcaccaacactattgttgcgctcatgcaggtgggagatcggagt gagttcagaagagcttttgcaggagccactgtccatgacttcttcaactggctgtccgtg ttggtgctcttgcccgtggaggtggccacccattacctcgagatcataacccagcttata gtggagagcttccacttcaagaatggagaagatgccccagatcttctgaaagtcatcact aagcccttcacaaagctcattgtccagctggataaaaaagttatcagccaaattgcaatg aacgatgaaaaagcgaaaaacaagagtcttgtcaagatttggtgcaaaacttttaccaac aagacccagattaacgtcactgttccctcgactgctaactgcacctccccttccctctgt tggacggatggcatccaaaactggaccatgaagaatgtgacctacaaggagaacatcgcc aaatgccagcatatctttgtgaatttccacctcccggatcttgctgtgggcaccatcttg ctcatactctccctgctggtcctctgtggttgcctgatcatgattgtcaagatcctgggc tctgtgctcaaggggcaggtcgccactgtcatcaagaagaccatcaacactgatttcccc tttccctttgcatggttgactggctacctggccatcctcgtcggggcaggcatgaccttc atcgtacagagcagctctgtgttcacgtcggccttgacccccctgattggaatcggcgtg ataaccattgagagggcttatccactcacgctgggctccaacatcggcaccaccaccacc gccatcctggccgccttagccagccctggcaatgcattgaggagttcactccagatcgcc ctgtgccactttttcttcaacatctccggcatcttgctgtggtacccgatcccgttcact cgcctgcccatccgcatggccaaggggctgggcaacatctctgccaagtatcgctggttc gccgtcttctacctgatcatcttcttcttcctgatcccgctgacggtgtttggcctctcg ctggccggctggcgggtgctggttggtgtcggggttcccgtcgtcttcatcatcatcctg gtactgtgcctccgactcctgcagtctcgctgcccacgcgtcctgccgaagaaactccag aactggaacttcctgccgctgtggatgcgctcgctgaagccctgggatgccgtcgtctcc aagttcaccggctgcttccagatgcgctgctgctgctgctgccgcgtgtgctgccgcgcg tgctgcttgctgtgtgactgccccaagtgctgccgctgcagcaagtgctgcgaggacttg gaggaggcgcaggaggggcaggatgtccctgtcaaggctcctgagacctttgataacata accattagcagagaggctcagggtgaggtccctgcctcggactcaaagaccgaatgcacg gccttgtag >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_2|105_aa MGHQQLYWSHPKKFGQGSRSCRVYSNQHGLIRKYGLNKCRQCFCQILWIRDLEQKQAQHP ASTVTRLANEATIKDTASILPDWWGQQNNPCQKRPALIADHGVNK >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_2|318_bp atgggtcaccagcagctgtactggagccacccaaaaaaattcggccagggttctcgttct tgtcgtgtctattcaaaccagcacggtctgatccggaaatatggcctcaataagtgccgc caatgtttctgtcaaatcttgtggataagagacctggagcagaaacaggcccagcatcca gccagcactgtcaccagacttgcaaatgaagccaccataaaggatacagccagcatcctg cctgactggtggggccagcagaacaacccctgccaaaaacgcccagccctgattgctgac catggagtaaacaaataa >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_3|27_aa MYNTKSEPDVSYALWVIMMCQIQEIIS >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_3|84_bp atgtacaacaccaagagtgaacctgatgtcagctatgctctttgggtgataatgatgtgt cagatccaggagataatcagctaa >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_4|1092_aa MVPSGGVPQGLGGRSACALLLLCYLARPGQAMRLIFGGDAADVYPALQGKPNPDEKSVAI GNTCVPRLGHMPPVHVLLLHSGGENTGIVKKFPRFRNRELEATRRQRMDYPVFTVSLWLY LLHYCKANLCGILYFVDSNEMYGTPSVFLTEEGYLHIQMHLVKGEDLAVKTKFIIPLKEW FRLDISFNGGQVSVCTLHHTTYEHFRLKLSIAIKPGSNKICKSKFVIVVTTSIGQDLKSY HNQTISFREDFHYNDTAGYFIIGGSRYVAGIEGFFGPLKYYRLRSLHPAQIFNPLLEKQL AEQIKLYYERCAEVQEIVSVYASAAKHGGERQEACHLHNSYLDLQRRYGRPSMCRAFPWE KELKDKHPSLFQALLEMDLLTGDRDLLKGFKQEEETYLFPLRSGCSFSVDTSVKHRSVSQ VYQDAVCQLSAVNSSRPESCSVPRNQNESVSEIGGKIFEKAVKRLSSIDGLHQISSIVPF LTDSSCCGYHKASYYLAVFYETGLNVPRDQLQGMLYSLVGGQGSERLSSMNLGYKHYQGI DNYPLDWELSYAYYSNIATKTPLDQHTLQGDQAYVETIRLKDDEILKVQTKEDGDVFMWL KHEATRGNAAAQQRLAQMLFWGQQGVAKNPEAAIEWYAKGALETEDPALIYDYAIVLFKG LHQAVNGLGWYYHKFKKNYAKAAKYWLKAEEMGNPDASYNLGVLHLDGIFPGVPGRNQTL AGEYFHKAAQGGHMEGTLWCSLYYITGNLETFPRDPEKAVVWAKHVAEKNGYLGHVIRKG LNAYLEGSWHEALLYYVLAAETGIEVSQTNLAHICEERPAIRGTGRRLQDGRRSPSTSTF VSLLWTVSLAVATFPWFQPLPGSSTLFDLSTRLWVSAYLKMGDLYYYGHQNQSQDLELSV QMYAQAALDGDSQGFFNLALLIEEGTIIPHHILDFLEIDSTLHSNNISILQELYERCWSH SNEESFSPCSLAWLYLHLRLLWGAILHSALKSVGIHDLFCRKWMRTDTLTWYFHQRRSNQ GDSILNRGKQSHGKKARKHNCANTFEDVHHVTSTEIPLAKASHMESGEHNDEDDKIIIII IIHHHDHHLDGK >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_4|3279_bp atggtcccgagtggcggcgtcccccagggcctcggcggccgctctgcctgcgcgctgctc ctgctctgctacctggccagacccggccaggccatgaggctgatttttggtggagacgca gcagacgtttacccagccctgcagggaaaacctaacccagatgagaagtcggtagccatt gggaacacgtgtgtgccacgactcggccacatgcccccggtgcacgtgttactgctgcat tctggtggagaaaacacaggcattgtcaagaagttcccgaggtttcggaaccgagagctg gaggccactcgacgccagaggatggattacccagtgtttactgtttcattgtggctttat ttactccattattgcaaggccaacctctgtgggattctgtactttgttgactctaatgag atgtacggcacaccttctgtatttcttacggaagagggctatttgcatattcagatgcat cttgtcaaaggggaagaccttgctgtaaaaactaaattcatcatacctttgaaggagtgg tttcgactggatatctcttttaacggaggccaggtatctgtatgcacgcttcatcacacg acatacgagcattttcgcttaaagttaagcattgctataaagccggggagcaacaaaatc tgcaaaagcaaatttgtgatagtagtaaccactagcattggacaggatttgaaaagctac cacaatcagaccattagcttccgggaggatttccattataatgacacagctgggtacttc attattggagggagcaggtatgtggctggcattgaagggttttttggacccctgaagtac tatcgccttcgcagtctgcaccccgcccagatttttaatcccctccttgagaagcaactt gctgaacaaatcaagttatattatgaaaggtgtgctgaggttcaagaaatagtatctgtg tatgcatctgcagcaaagcacgggggcgagagacaagaagcatgccacctccacaactcc tacctggacctccagcgcaggtatgggagaccctcgatgtgcagagccttcccctgggag aaggagctgaaagacaaacaccccagcttgttccaggcattgctggagatggatctgctg accggtgacagggacctgctgaagggtttcaagcaagaagaagagacatacctgtttcct ctgaggagcggctgctctttctcggtagacacttctgtaaaacatcgttctgtttctcag gtctaccaggacgctgtttgccagctttccgctgtcaacagcagtagaccagaatcctgc tcagtgccaaggaaccaaaatgaatctgtatcagaaatcggtgggaagatatttgagaag gctgtaaagagactctctagcattgatggtcttcaccaaattagctctatcgtccccttt ctgacggattccagctgctgtggataccataaagcatcctactaccttgcagtcttttat gagactggattaaatgttcctcgggatcagctgcagggcatgttgtatagtttggttgga ggccaggggagtgagaggctgtcttcaatgaatcttgggtataaacactaccagggtatt gacaactaccccctggactgggaactgtcgtatgcctactacagcaacattgccaccaag acaccccttgaccagcacacactgcaaggagatcaggcatatgttgaaacaattagacta aaagatgatgaaatactcaaggtacaaaccaaagaagatggagatgtctttatgtggttg aagcatgaagctacccgaggcaatgcagcagctcagcaacgattggcccagatgctgttc tgggggcagcaaggtgtggccaagaatcccgaagcagcaattgagtggtacgccaagggc gccctggagacggaggatcctgcgttaatctatgactatgccattgtgctattcaaggga ttgcatcaggcagtcaatggcctgggatggtattaccacaaattcaagaaaaattacgcc aaagcagcaaagtactggttaaaagcagaagaaatggggaacccagatgcgtcatacaat cttggagtcctgcatttggatggcatcttccctggagttcctggaaggaatcaaacttta gctggtgaatatttccataaggctgcgcaaggtggacacatggaagggaccttgtggtgt tctctctactatatcacaggcaacctggagacattccctagagatcctgagaaagctgtt gtatgggcaaaacatgtagctgagaaaaatggctacttgggccatgtcatccgcaaaggc ctcaatgcctacctggaaggttcatggcatgaagctttgctgtattatgttttagcagca gaaactggaattgaagtgtcacagacaaatttagcacacatctgtgaggagaggccagca ataagaggcactggcagaagattgcaggatgggagaagaagtccaagtacttctaccttt gtctctctgctttggactgtgtctctggcagtggctacatttccatggttccagcccctg cctggcagctccaccctctttgatctcagcaccaggctctgggtctctgcatatttgaag atgggagacctttactactatggccaccaaaaccagtcacaagacctggagttgtctgtg cagatgtacgcccaagccgccctggatggagactcccagggattttttaacctggccctg ctaatcgaggaaggtacgataatcccacaccatatcttggatttcttggaaattgactca actctccattctaataacatctccattctccaggaactgtacgaaaggtgctggagccac agtaacgaggagtccttcagcccctgctccttggcctggctttacctgcacttgcggctt ctctggggtgctatcctgcactcagccctgaagagtgttggaatccacgacctcttctgt aggaagtggatgagaactgatactctgacctggtactttcatcagaggcgttcaaaccag ggggactccatcttgaataggggcaagcaatctcatggcaaaaaggctcgaaagcacaac tgtgcaaacacatttgaagacgtccatcatgtcacttccactgagatcccactggcgaaa gcaagtcacatggagtcaggagaacataatgatgaagatgataaaataatcatcatcatc atcattcatcatcatgatcatcatcttgatggcaaatag >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_5|99_aa MPAGGTENRPIAFRPEAMREDSLKLAAFASASKIYVGGMKQLIPPPLAQDERATHTSLLE DQTLAGFLHAWCSLQKQSLPAYLWNPGASGGAQSTVGTQ >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_5|300_bp atgcctgctgggggcacagagaaccgacccattgcgtttcgacctgaggccatgagggaa gattccctaaaactggcagcatttgcctcggcctcaaagatttatgtgggagggatgaag cagttaatcccaccacctctagctcaggatgagagagcaactcacacatccctcttagaa gaccaaactctggccggttttctgcatgcctggtgctccttgcagaaacagagccttcct gcctacctctggaatcctggggcctcaggtggtgcccagtccacagtaggaacacaatag >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_6|78_aa MDSDVSMPLLAILHPVAMGKLRIEKHGFWFMYASKNAKQDRKKHHPAKKLLLNAGLMIAR VIECSDLPSSHFGTGCIS >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_6|237_bp atggattctgacgtatcaatgccacttttagccatcctccatcctgtcgctatgggcaaa ctccgcattgagaaacatggtttctggttcatgtatgccagtaagaatgccaaacaggac agaaaaaagcatcaccctgcaaagaagctgctcttgaatgcaggcctcatgattgccaga gtcatcgaatgttcggacttgcctagttctcactttggaactgggtgtatcagctag >gi568815594r:25648428_25947567|GENSCAN_predicted_peptide_7|191_aa MDQIYVFRRSEGKEKLEECGFLRNGENTGIKEMSLHLAGLSGKPGSESLSTSSERGHGPA VGNLVSESAGRSAGQGSPGPDAMSRNLRTALIFGGFISLIGAAFYPIYFRPLMRLEEYKK EQAINRAGIVQEDVQPPGNSSHRKCVLKGRTQVAPFYTQTVAWRMVNGPPYSSIASRITS ATTLFPNKRSL >gi568815594r:25648428_25947567|GENSCAN_predicted_CDS_7|576_bp atggaccagatctatgtcttcagaaggtcagagggaaaagaaaaactggaggagtgtggt tttctcagaaatggggaaaacacgggaatcaaagagatgagcttgcatttggcaggactc agtggcaagcccggaagcgaaagcctctccacctcttccgagcggggtcacggcccggcc gtcggtaacctggtttccgagagtgccgggcggtcggcgggtcagggcagcccggggcct gacgccatgtcccggaacctgcgcaccgcgctcattttcggcggcttcatctccctgatc ggcgccgccttctatcccatctacttccggcccctaatgagattggaggagtacaagaag gaacaagctataaatcgggctggaattgttcaagaggatgtgcagccaccaggaaactcc tctcacagaaagtgtgtgctgaagggacggacccaggtggcaccattctacacccaaact gtggcttggcgtatggttaatggcccaccctattccagtatagcttcacgaattacatct gcaaccaccctgtttccaaataagcgctcactctga