GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:46:27 Sequence gi568815589r:120664562_120888258 : 223697 bp : 43.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1240 1235 6 1.05 1.05 Term - 24069 24052 18 0 0 126 38 16 0.297 -1.28 1.04 Intr - 32422 32326 97 0 1 78 88 7 0.440 -0.29 1.03 Intr - 33884 33738 147 1 0 63 105 64 0.133 4.95 1.02 Intr - 49591 49197 395 1 2 68 109 468 0.706 40.35 1.01 Init - 49773 49648 126 0 0 94 43 281 0.999 22.36 1.00 Prom - 54430 54391 40 -3.66 2.00 Prom + 55991 56030 40 -8.56 2.01 Init + 56213 56535 323 1 2 65 70 282 0.297 20.91 2.02 Intr + 56717 57040 324 2 0 12 51 495 0.220 33.09 2.03 Intr + 62579 62665 87 2 0 74 97 56 0.124 4.19 2.04 Intr + 81807 81950 144 0 0 -16 45 158 0.034 0.50 2.05 Term + 96566 96821 256 2 1 89 49 120 0.377 3.26 2.06 PlyA + 97077 97082 6 1.05 3.07 PlyA - 98997 98992 6 1.05 3.06 Term - 100286 99998 289 1 1 96 47 337 0.998 25.25 3.05 Intr - 106956 106787 170 0 2 106 95 22 0.946 3.44 3.04 Intr - 108279 108193 87 0 0 70 113 45 0.976 5.37 3.03 Intr - 111665 111532 134 0 2 33 97 162 0.999 11.96 3.02 Intr - 113984 113790 195 0 0 81 63 152 0.992 11.39 3.01 Init - 123310 123208 103 1 1 49 110 69 0.706 5.60 3.00 Prom - 124363 124324 40 -8.46 4.00 Prom + 124518 124557 40 -4.26 4.01 Init + 126959 126986 28 1 1 66 65 6 0.424 -3.92 4.02 Intr + 128791 128928 138 2 0 74 90 191 0.776 18.34 4.03 Term + 134141 135117 977 0 2 58 45 340 0.359 18.93 4.04 PlyA + 137188 137193 6 1.05 5.10 PlyA - 137576 137571 6 1.05 5.09 Term - 153602 153345 258 2 0 83 34 199 0.989 9.55 5.08 Intr - 156418 156278 141 1 0 84 90 56 0.949 5.95 5.07 Intr - 156903 156794 110 1 2 27 100 11 0.751 -3.80 5.06 Intr - 160124 159933 192 0 0 84 77 193 0.764 17.26 5.05 Intr - 162346 162204 143 0 2 60 110 -4 0.491 -1.00 5.04 Intr - 164647 164538 110 1 2 104 89 39 0.508 4.68 5.03 Intr - 167384 167271 114 2 0 65 101 30 0.827 2.64 5.02 Intr - 168351 168273 79 2 1 69 51 69 0.602 0.85 5.01 Init - 178348 178176 173 1 2 95 66 318 0.973 27.21 5.00 Prom - 184266 184227 40 -1.96 6.17 PlyA - 184633 184628 6 1.05 6.16 Term - 193725 193383 343 2 1 73 42 373 0.932 25.18 6.15 Intr - 195624 195529 96 2 0 62 99 118 0.901 9.42 6.14 Intr - 196613 196528 86 2 2 134 64 67 0.917 7.52 6.13 Intr - 197444 197357 88 1 1 124 8 48 0.871 0.37 6.12 Intr - 198188 198027 162 1 0 89 52 299 0.955 25.49 6.11 Intr - 199555 199488 68 1 2 103 90 57 0.992 5.10 6.10 Intr - 201269 201149 121 1 1 76 95 145 0.999 14.50 6.09 Intr - 201535 201467 69 0 0 97 81 119 0.998 10.40 6.08 Intr - 202404 202309 96 2 0 129 94 137 0.999 17.52 6.07 Intr - 204769 204621 149 1 2 100 61 341 0.999 31.63 6.06 Intr - 205384 205284 101 2 2 107 88 115 0.999 13.23 6.05 Intr - 205977 205882 96 1 0 73 92 130 0.918 11.88 6.04 Intr - 209499 209418 82 0 1 110 101 49 0.991 7.61 6.03 Intr - 210195 209995 201 0 0 48 59 242 0.717 16.78 6.02 Intr - 211568 211391 178 1 1 69 15 41 0.219 -5.18 6.01 Init - 212906 212791 116 2 2 58 -14 250 0.787 9.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_1|260_aa MRSLPSLGGLALLCCAAAAAAAAVASAASAGNVTGGGGAAGQGDGSHGPGPEDRAPARHR PPTPGCDFSSPVPGDHPSLGDCWTLFHHLSGAARPLADHPSGGGTHFDHLSGADQTRADH PFDDHWPGADHPCSDHRTGAHDSPDPDPRSPQQQQQQRPPHPTCHRGPLFASSRMAQVKL ELSCVMLPVTAVMVLLRTGLNRRHFKKANAKMEIPKTAIVQFWLVISQIRQRSSCYSTID PTRFEYVFQLDYILQDLTTF >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_1|783_bp atgaggagcctgccgagcctgggcggcctcgccctgttgtgctgcgccgccgccgccgcc gccgccgccgtcgcctcagccgcctcggcggggaatgtcaccggtggcggcggggccgcg gggcagggcgacggctcccacggcccaggccccgaggaccgggcccccgcgcgccaccgt ccaccgacccctggctgcgacttctccagcccagtccccggagaccacccctctttgggc gactgctggaccctcttccaccacctttcaggcgccgctcggcccctcgccgaccacccc tccggcggcggaacgcacttcgaccacctctcaggcgccgaccagacccgcgccgaccac cctttcgacgaccactggcccggcgccgaccacccctgtagcgaccaccgtaccggcgcc cacgactccccggaccccgacccccgatctccccagcagcagcaacagcagcgtcctccc caccccacctgccaccgaggccccctcttcgcctcctccagaatggcccaggtcaagcta gaattatcctgtgtcatgttgccagttacagcagtaatggtactgctcagaactggacta aatagaagacattttaaaaaggcaaatgcaaagatggaaatccccaaaacagccattgtc cagttctggttagttatctctcagatcagacaaaggagcagctgttattcaactatagat ccaactagattcgaatatgttttccaactagattacatcctacaggatttaactaccttc taa >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_2|377_aa MQELYSASRPLKGACIADCLQITVETAILIETLFSLGVQEQWSSCSIFSTQEHAVAVFAE AGMPVFTWKGKMKEGYPWCIEQTLYFKDGPLNMILDDGGDLTNLIHTKWHQVDQDVMIAS KVAVVAGYGGVGKGCAQALQGFGACIIITETDPISALQAAMEGYEVTTMDEACQEGNIFV TTTTCVNIILGRHFEQMKDDAILCNTEQFDVEIDVRHGVGDKDGDEDHSFVREMDQKIGK YNAFKEVWRERRRREPGLPTVLVGQREFRVGVGLADPAFRAAGRLAGPAGPRQSKMSGYS QIWQCVLLQMIELEETTFLWNDDRIYQKLHTSTQLSWIPVHTMKPEVVMMAFMNLKEAAG FHHTTVSPKKSNPCKKG >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_2|1134_bp atgcaggagctttactcggcctccaggccactgaagggtgcctgcattgctgactgcctg caaataactgtggagacggccatcctcattgagacccttttctccctgggtgttcaggag cagtggtctagctgcagcatcttctccacccaggaacatgcagtggctgtctttgccgag gctggcatgccagtgttcacctggaagggcaaaatgaaagaggggtacccgtggtgcatt gaacagacactgtacttcaaggacgggcccctcaacatgattctggatgatgggggtgac cttaccaacctcatccacaccaaatggcaccaagtggaccaagacgtgatgattgccagc aaggtagcagtggtagcaggctatggtggtgtgggcaagggctgtgcccaggccttgcag ggttttggggcctgcataatcatcaccgagactgaccccatcagtgcactgcaggctgcc atggaaggctatgaggtgaccaccatggacgaggcctgtcaggagggcaacatctttgtc accaccacaacctgtgtcaatatcatccttggccggcactttgagcagatgaaggatgat gccatcttatgtaatactgaacaatttgacgtggagatcgatgtcaggcatggtgttggg gataaagatggagatgaagatcacagctttgtcagggaaatggaccagaaaataggcaaa tataatgcatttaaggaggtgtggagggagaggcgccggcgggaaccagggctgcccact gtgcttgtgggccagcgcgagttccgcgtgggcgtgggcttggcagaccccgcattcaga gcggctggccggctggccggccctgccggccccaggcagtctaaaatgtcaggttacagt cagatttggcaatgtgtattacttcaaatgattgagctggaggaaactacattcctttgg aatgatgacagaatctatcagaaacttcacacatccactcagctctcctggatccctgtc cacaccatgaagcctgaagttgttatgatggcctttatgaacctgaaagaagccgctggc ttccaccacaccacagtgtcacccaagaaatccaatccttgtaagaaaggatag >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_3|325_aa MKQLEDHEAFETSSLIGHSARVYALYYKDGLLCTGSDDLSAKLWDVSTGQCVYGIQTHTC AAVKFDEQKLVTGSFDNTVACWEWSSGARTQHFRGHTGAVFSVDYNDELDILVSGSADFT VKVWALSAGTCLNTLTGHTEWVTKVVLQKCKVKSLLHSPGDYILLSADKYEIKIWPIGRE INCKCLKTLSVSEDRSICLQPRLHFDGKYIVCSSALGLYQWDFASYDILRVIKTPEIANL ALLGFGDIFALLFDNRYLYIMDLRTESLISRWPLPEYRKSKRGSSFLAGEASWLNGLDGH NDTGLVFATSMPDHSIHLVLWKEHG >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_3|978_bp atgaagcaactggaggaccatgaagcctttgaaacctcgtcattaattggacacagtgcc agagtgtatgcactttactacaaagatggacttctctgtacagggtcagatgacttgtct gcaaagctgtgggatgtgagcacagggcagtgcgtttatggcatccagacccacacttgt gcagcggtgaagtttgatgaacagaagcttgtgacaggctcctttgacaacactgtggct tgctgggaatggagttccggagccaggacccagcactttcgggggcacacgggggcggta tttagcgtggactacaatgatgaactggatatcttggtgagcggctctgcagacttcact gtgaaagtatgggctttatctgctgggacatgcctgaacacactcaccgggcacacggaa tgggtcaccaaggtagttttgcagaagtgcaaagtcaagtctctcttgcacagtcctgga gactacatcctcttaagtgcagacaaatatgagattaagatttggccaattgggagagaa atcaactgtaagtgcttaaagacattgtctgtctctgaggatagaagtatctgcctgcag ccaagacttcattttgatggcaaatacattgtctgtagttcagcacttggtctctaccag tgggactttgccagttatgatattctcagggtcatcaagactcctgagatagcaaacttg gccttgcttggctttggagatatctttgccctgctgtttgacaaccgctacctgtacatc atggacttgcggacagagagcctgattagtcgctggcctctgccagagtacaggaaatca aagagaggctcaagcttcctggcaggcgaagcatcctggctgaatggactggatgggcac aatgacacgggcttggtctttgccaccagcatgcctgaccacagtattcacctggtgttg tggaaggagcacggctga >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_4|380_aa MLSTVYSLADPDLRPLLPVRSLTGERLPVLPASSPRVPQLRGSRRKRLHFRSPPGDVKVL EIKNKARKLNIEPLRSNLSKYYVLSQSEICKGKNIFLLSLIFSSPGNGTRRDLIRKTWGN VTSVQGHPILTLFALGMPVSVTTQKEINKESCKNNDIIEGIFLDSSENQTLKIIAMIQWA VAFCPNALFILKVDEETFVNLPSLVDYLLNLKEHLEDIYVGRVLHQVTPNRDPQNRDFVP LSEYPEKYYPDYCSGEAFIMSQDVARMMYVVFKEVPMMVPADVFVGICAKFIGLIPIHSS RFSGKRHIRYNRCCYKFIFTSSEIADPEMPLAWKEINDGKECTLFETSYELISCKLLTYL DSFKRFHMGTIKNNLMYFAD >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_4|1143_bp atgcttagcacagtgtacagtctagcggacccggacctgcggccgctgctcccggtccgc agcctcacaggggagcggcttccggtgctgcctgcgtcatctccgcgcgtccctcagctc cgcggctcccggcggaagcggctgcacttccggtccccgcccggagatgtgaaagttctt gaaattaagaataaggcaagaaaattgaacatcgaacccctaagaagtaatctctccaaa tattatgtcctgagccagtcagaaatatgtaaagggaagaacatttttttgctgtctctt atcttcagtagcccaggaaatggaacaagacgggacctcattaggaaaacttggggcaat gtgaccagtgtccaagggcatcccattctcacactgtttgctctgggaatgcctgtttcg gtaactacccagaaagagatcaacaaagaatcctgtaagaataatgatataattgaagga atcttcttggacagttctgagaaccaaaccctgaagatcattgcaatgatacagtgggct gtggctttctgccctaatgccctgttcattctcaaggtggatgaagagacgtttgtcaat ctaccaagcttggtagactatcttctcaatctgaaagaacacctagaagatatctatgta ggaagagttcttcatcaggttacacccaatagagatcctcagaacagagactttgtccct cttagtgagtacccagaaaaatactacccagattactgcagtggtgaggcctttataatg tcccaagatgtggctcgaatgatgtatgtggttttcaaagaagtacccatgatggtgcca gctgatgtgtttgtaggaatttgtgctaagttcattggccttatacccatccacagctca aggttttctgggaaaaggcacattagatacaacagatgttgctataagttcatttttaca tcctcagaaattgcagatcctgaaatgcccctagcatggaaggaaattaatgatggaaaa gaatgtacactgtttgagacatcctatgagctcatttcctgcaaacttctgacgtacctt gacagctttaaacgttttcacatggggaccataaaaaacaatctcatgtattttgctgat tag >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_5|439_aa MAAQALALLREVARLEAPLEELRALHSVLQAVPLNELRQQAAELRLGPLFSLLNENHRFF TLDEPNAIVKVTKPTIKTVTLIILIGRIVENSDAVTEILNNAELLKQIVYCIGGENLSVA KALIIEISSVSPESLNYCTTSGLVTQLLRELTGEDVLVRATCIEMVTSLAYTHHGRQYLA QEGVIDQISNIIVGADSDPFSSFYLPGFVKFFGNLAVMDSPQQICERYPIFVEKVFEMIE SQDPTMIGVAVDTVGILGSNVEGKQVLQKTGTRFERLLMRIGHQSKNAPVELKIRCLDAI SSLLYLPPEQQTDDLLRMTESWFSSLSRDPLELFRGISSQPFPELHCAALKVFTAIANQP WAQKLMFNSPGFVEYVVDRSVEHDKASKDAKYELVKALANSKTIAEIFGNPNYLRLRTYL SEGPYYVKPVSTTAVEGAE >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_5|1320_bp atggcagcccaggctttggcgctgctgagagaggtagcgaggctggaagcgccgctggag gagctacgcgcgcttcactccgtgctgcaggcagtgccgctcaacgagcttcgccagcaa gcggcggagctgcgcctcggcccgctcttctccctgcttaacgagaaccatagattcttt acactggatgaaccaaatgctattgtaaaggtcactaaaccaacaattaagacagtaact cttatcatcctgattggaagaattgttgaaaattctgatgctgttactgagattctaaat aatgctgaattactaaaacaaattgtttattgcattggtggagagaatctatctgtagca aaagcgctaattatagagatttcttccgtgtcaccagaatctttaaactactgtaccaca agtggattggtaacccagctcctgagagagctgactggtgaggatgtgttggtcagagcc acctgtatagaaatggtgacatcactggcatatactcatcatgggcgacaatatcttgct caagaaggagtaattgaccaaatttctaatataattgttggggcagattcagaccctttc tctagcttctatctgccaggattcgtgaagttttttggaaacctggctgtcatggatagt cctcaacagatctgtgagcgttatcctatctttgtggaaaaagtctttgaaatgatagaa agtcaggaccccactatgattggtgtagctgtagacacagttggaatcttgggatccaat gttgaaggaaaacaggttttacagaaaacaggaactcgctttgaacgcttgcttatgaga ataggacatcaatcaaagaatgccccagtggagctaaaaattagatgtttggatgcaatt tcatctcttctgtacttaccacctgagcagcagactgatgaccttctgaggatgacagaa tcctggttttcttctttatctcgggatccactggagctcttccgtggcattagtagtcag cccttccctgaactacactgtgctgccttaaaagtgtttacggccattgcaaaccaaccc tgggctcagaaacttatgtttaacagtccaggttttgtagaatatgtggtggaccggtct gtggagcatgacaaagcttcaaaggatgccaaatatgaactagtgaaagcacttgccaat tccaagacaattgcagaaatctttgggaacccaaattatttgaggctcagaacttacctg agtgaagggccatactatgtgaaacctgtttccacgacagcagtagaaggagccgaatga >gi568815589r:120664562_120888258|GENSCAN_predicted_peptide_6|683_aa MRARGAGGGGRGLAALRSPAAAGSNGEAGAAPANYISGTAPHILSFTHSLAQCRRHHPAQ SHRSTHTHHTETGTGLSPCEQIVTTPGAQRHGHKEETRCQGKLMENRALDPGTRDSYGAT SHLPNKGALAKVKNNFKDLMSKLTEGQYVLCRWTDGLYYLGKIKRVSSSKQSCLVTFEDN SKYWVLWKDIQHAGVPGEEPKCNICLGKTSGPLNEILICGKCGLGYHQQCHIPIAGSADQ PLLTPWFCRRCIFALAVRKGGALKKGAIARTLQAVKMVLSYQPEELEWDSPHRTNQQQCY CYCGGPGEWYLRMLQCYRCRQWFHEACTQCLNEPMMFGDRFYLFFCSVCNQGPEYIERLP LRWVDVVHLALYNLGVQSKKKYFDFEEILAFVNHHWELLQLGKLTSTPVTDRGPHLLNAL NSYKSRFLCGKEIKKKKCIFRLRIRVPPNPPGKLLPDKGLLPNENSASSELRKRGKSKPG LLPHEFQQQKRRVYRRKRSKFLLEDAIPSSDFTSAWSTNHHLASIFDFTLDEIQSLKSAS SGQTFFSDVDSTDAASTSGSASTSLSYDSRWTVGSRKRKLAAKAYMPLRAKRWAAELDGR CPSDSSAEGASVPERPDEGIDSHTFESISEDDSSLSHLKSSITNYFGAAGRLACGEKYQV LARRVTPEGKVQYLVEWEGTTPY >gi568815589r:120664562_120888258|GENSCAN_predicted_CDS_6|2052_bp atgcgcgcccgcggggcgggcgggggcgggcgggggctggcggcgctgcggagcccggcg gccgcgggctccaatggcgaggccggcgcggcccccgctaattacataagcgggacggcc ccgcacatacttagcttcacacactccctggcacaatgccgtagacatcatcctgcccag tctcacaggagtacacacacgcaccacacagaaacaggcactggactctctccctgtgaa cagatagtcacaacaccaggtgcccagcgccatggccacaaggaggagacccgatgtcag gggaagctgatggagaatcgagctctggatccagggactcgggactcctatggtgccacc agccacctccccaacaagggggccctggcgaaggtcaagaacaacttcaaagacttgatg tccaaactgacggagggccagtatgtgctgtgccggtggacagatggcctgtactacctc gggaagatcaagagggtcagcagctctaagcaaagctgcctcgtgactttcgaagataat tccaaatactgggtcctatggaaggacatacagcatgccggtgttccaggagaggagccc aagtgcaacatctgcctagggaagacatcagggccgctgaatgagatcctcatctgcggg aagtgtggcctgggttaccaccagcagtgccacatccccatagcgggcagtgctgaccag cccctgctcacaccttggttctgccgacgctgcatcttcgcactggctgtgcggaaaggc ggcgcgctgaagaagggcgccatcgccaggacgctgcaggccgtgaagatggtgctgtcc taccagcccgaggagctcgagtgggactcgccccatcgcaccaaccagcagcaatgctac tgctactgcggcgggcccggagaatggtacctgcggatgctgcaatgttaccggtgcagg cagtggttccacgaggcctgcacccagtgcctcaatgagcccatgatgtttggagaccgg ttttacctgttcttctgctccgtgtgtaaccagggcccagagtacatcgagaggctgccc ctgcgatgggtggatgtggttcacctggccctctataatctgggggtacagagcaagaag aagtactttgactttgaggagattctggcctttgtcaaccaccactgggagctcctgcag cttggcaagctcaccagcaccccagtgacagatcgaggaccacatctcctcaacgctctg aacagttataaaagccggttcctctgcggcaaggagatcaagaagaagaagtgcatcttc cgcctgcgcatccgcgtcccacccaacccgccagggaagctgctgcctgacaaaggactg ctgccaaatgagaacagcgcctcctctgagctgcgtaagagaggaaagagcaagcctggt ttgttgcctcacgaattccagcagcagaaaaggcgagtttatagaagaaaaagatcaaag tttttgctggaagatgctattcccagtagtgacttcacctcagcctggagcaccaaccac cacctggctagcatatttgacttcacgctggatgaaattcaaagtttaaaaagtgccagc tcaggccagaccttcttctcagatgtcgactccaccgacgctgccagcacctctggctct gcctccaccagcctctcctatgactccagatggacagtgggcagccgaaagaggaagctg gcagccaaggcatacatgcccctgcgggcaaagcggtgggcagctgagctggatggacgc tgcccctcggacagcagtgcagagggggcttcagtccccgagcggccagacgaaggcatt gacagccacacatttgagagcatcagtgaagatgactcatccctgtcccacctcaagtca tctatcaccaactactttggtgcagctgggcggttggcctgtggggagaagtaccaggtg ttggctcggagggtcacacctgagggcaaggttcagtacctggtggagtgggaagggacc accccttactga