GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:19:46 Sequence gi568815587r:75166166_75387377 : 221212 bp : 50.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3013 3241 229 0 1 46 85 366 0.974 28.63 1.02 Intr + 3516 3599 84 1 0 61 105 82 0.533 6.99 1.03 Intr + 6214 6404 191 2 2 130 121 136 0.968 20.40 1.04 Intr + 20945 20988 44 1 2 67 123 -32 0.086 -4.76 1.05 Intr + 21355 21562 208 1 1 -12 50 213 0.303 6.58 1.06 Intr + 21971 22073 103 1 1 123 115 110 0.996 16.95 1.07 Intr + 27053 27410 358 0 1 126 109 470 0.898 47.21 1.08 Intr + 30349 30514 166 1 1 92 109 131 0.999 15.56 1.09 Intr + 34059 34222 164 2 2 112 67 164 0.969 15.47 1.10 Intr + 36736 36800 65 1 2 58 38 81 0.702 -1.54 1.11 Intr + 37142 37262 121 0 1 51 97 185 0.893 15.25 1.12 Intr + 38235 38411 177 0 0 104 39 134 0.158 9.13 1.13 Intr + 46147 46237 91 1 1 80 55 64 0.247 2.30 1.14 Intr + 70845 70982 138 2 0 34 80 91 0.429 3.56 1.15 Term + 74720 76033 1314 1 0 59 53 2567 0.598 241.88 1.16 PlyA + 81307 81312 6 1.05 2.21 PlyA - 82350 82345 6 1.05 2.20 Term - 100109 99998 112 1 1 86 42 188 0.913 12.03 2.19 Intr - 101538 101487 52 1 1 55 80 47 0.680 -1.43 2.18 Intr - 103049 102724 326 1 2 67 105 216 0.853 16.62 2.17 Intr - 106813 106730 84 0 0 110 100 93 0.999 11.64 2.16 Intr - 108046 107909 138 0 0 83 48 155 0.982 10.58 2.15 Intr - 110746 110674 73 2 1 139 80 115 0.999 14.26 2.14 Intr - 111331 111199 133 1 1 92 68 247 0.493 23.32 2.13 Intr - 112579 112444 136 0 1 77 75 187 0.940 16.87 2.12 Intr - 114977 114910 68 2 2 74 101 132 0.961 10.80 2.11 Intr - 115856 115797 60 2 0 99 100 10 0.718 2.23 2.10 Intr - 117318 117122 197 1 2 101 55 420 0.972 39.13 2.09 Intr - 117658 117563 96 2 0 133 59 21 0.923 3.68 2.08 Intr - 118114 118070 45 2 0 93 80 61 0.945 4.18 2.07 Intr - 121210 121150 61 1 1 122 92 185 0.783 20.61 2.06 Intr - 123874 123844 31 0 1 122 113 36 0.901 7.63 2.05 Intr - 137480 137434 47 2 2 111 98 54 0.035 5.91 2.04 Intr - 146527 146498 30 1 0 134 45 39 0.291 2.53 2.03 Intr - 148911 148777 135 0 0 74 47 113 0.813 6.56 2.02 Intr - 154752 154697 56 1 2 89 28 41 0.176 -3.10 2.01 Init - 160503 160449 55 0 1 81 110 50 0.521 5.99 2.00 Prom - 169745 169706 40 0.44 3.05 PlyA - 170844 170839 6 1.05 3.04 Term - 174279 174174 106 2 1 116 39 80 0.500 3.78 3.03 Intr - 185486 185423 64 0 1 33 117 100 0.018 5.18 3.02 Intr - 186290 186145 146 1 2 108 23 71 0.016 2.53 3.01 Init - 211091 211072 20 2 2 114 103 -17 0.026 2.05 3.00 Prom - 220119 220080 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 38235 38415 181 0 1 104 49 142 0.840 8.98 S.002 Term - 126483 126367 117 0 0 125 39 61 0.825 3.44 S.003 Init - 180834 180766 69 0 0 80 73 97 0.871 7.18 S.004 Term + 218369 218561 193 2 1 104 41 116 0.901 5.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:75166166_75387377|GENSCAN_predicted_peptide_1|1150_aa MPQDFKASLCLPTTSAPASAPSNGNCSSYTETQHLSVVGIMFVAQTLLGVGGVPIQPFGI SYIDDFAHNSNSPLYLVTMMGPGLAFGLGSLMLRLYVDINQMPEGGISLTIKDPRWVGAW WLGFLIAAGAVALAAIPYFFFPKEMPKEKRELQFRRKVLAVTDSPARKLLVGTYILQEVV RFRGDGDEAGDTDLGEDSGSDDDVRDSDNDIDDSDHHHGGSDSEGSGDSDNDDCAGGGDH DRDNNNNHHIVMGKDSPSKQSPGESTKKQDGLVQIAPNLTVIQFIKVFPRVLLQTLRHPI FLLVVLSQVCLSSMAAGMATFLPKFLERQFSITASYANLLIGCLSFPSVIVGIVVGGVLV KRLHLGPVGCGALCLLGMLLCLFFSLPLFFIGCSSHQIAGITHQTSAHPGLELSPSCMEA CSCPLDGFNPVCDPSTRVEYITPCHAGCSSWVVQDALDNSQVFYTNCSCVVEGNPVLAGS CDSTCSHLVVPFLLLVSLGSALACLTHTPSFMLILRGVKKEDKTLAVGIQFMFLRILAWM PSPVIHGSAIDTTCVHWALSCGRRAVCRYYNNDLLRNRFIGLQFFFKTGSVICFALVLAV LRQQDKEARTKESRSSPAVEQQLLVSGPGKKPEDSRVDKETEAQKLEVQDPDYKAILPSS KRKYPDKSWRRGYLLGDLRDVITLCAPYQGLVELINSRVCASRKGPAKALEAQSASKCGS SDFRDSERTDRDCQPLRVKDSPHPCPPPGAPNSLVSAAARALDAGAAAMAPRAGQPGLQG LLLVAAALSQPAAPCPFQCYCFGGPKLLLRCASGAELRQPPRDVPPDARNLTIVGANLTV LRAAAFAGGDGDGDQAAGVRLPLLSALRLTHNHIEVVEDGAFDGLPSLAALDLSHNPLRA LGGGAFRGLPALRSLQLNHALVRGGPALLAALDAALAPLAELRLLGLAGNALSRLPPAAL RLARLEQLDVRLNALAGLDPDELRALERDGGLPGPRLLLADNPLRCGCAARPLLAWLRNA TERVPDSRRLRCAAPRALLDRPLLDLDGARLRCADSGADARGEEAEAAGPELEASYVFFG LVLALIGLIFLMVLYLNRRGIQRWMRNLREACRDQMEGYHYRYEQDADPRRAPAPAAPAG SRATSPGSGL >gi568815587r:75166166_75387377|GENSCAN_predicted_CDS_1|3453_bp atgccacaggacttcaaggcttccctgtgcctgcccacaacctcggccccagcctcggcc ccctccaatggcaactgctcaagctacacagaaacccagcatctgagtgtggtggggatc atgttcgtggcacagaccctgctgggcgtgggcggggtgcccattcagccctttggcatc tcctacatcgatgactttgcccacaacagcaactcgcccctctacctcgtgaccatgatg gggccaggcctggcctttgggctgggcagcctcatgctgcgcctttatgtggacattaac cagatgccagaaggtggtatcagcctgaccataaaggacccccgatgggtgggtgcctgg tggctgggtttcctcatcgctgccggtgcagtggccctggctgccatcccctacttcttc ttccccaaggaaatgcccaaggaaaaacgtgagcttcagtttcggcgaaaggtcttagca gtcacagactcacctgccaggaagttgttggtagggacgtatatacttcaggaggttgtg aggtttcgtggcgatggagatgaggctggtgatactgatcttggtgaagacagtggtagt gatgatgatgttagagacagtgataatgatattgatgattctgatcatcatcatggtggt agtgattctgaaggttcaggtgacagtgataatgatgattgtgctggtggtggtgatcat gacagagacaacaataacaatcatcacatcgtgatgggcaaggactctccctctaagcag agccctggggagtccacgaagaagcaggatggcctagtccagattgcaccaaacctgact gtgatccagttcattaaagtcttccccagggtgctgctgcagaccctacgccaccccatc ttcctgctggtggtcctgtcccaggtatgcttgtcatccatggctgcgggcatggccacc ttcctgcccaagttcctggagcgccagttttccatcacagcctcctacgccaacctgctc atcggctgcctctccttcccttcggtcatcgtgggcatcgtggtgggtggcgtcctggtc aagcggctccacctgggccctgtgggatgcggtgccctttgcctgctggggatgctgctg tgcctcttcttcagcctgccgctcttctttatcggctgctccagccaccagattgcgggc atcacacaccagaccagtgcccaccctgggctggagctgtctccaagctgcatggaggcc tgctcctgcccattggacggctttaaccctgtctgcgaccccagcactcgtgtggaatac atcacaccctgccacgcaggctgctcaagctgggtggtccaggatgctctggacaacagc caggttttctacaccaactgcagctgcgtggtggagggcaaccccgtgctggcaggatcc tgcgactcaacgtgcagccatctggtggtgcccttcctgctcctggtcagcctgggctcg gccctggcctgtctcacccacacaccctccttcatgctcatcctaagaggagtgaagaaa gaagacaagactttggctgtgggcatccagttcatgttcctgaggattttggcctggatg cccagccccgtgatccacggcagcgccatcgacaccacctgtgtgcactgggccctgagc tgtgggcgtcgagctgtctgtcgctactacaataatgacctgctccgaaaccggttcatc ggcctccagttcttcttcaaaacaggttctgtgatctgcttcgccttagttttggctgtc ctgaggcagcaggacaaagaggcaaggaccaaagagagcagatccagccctgccgtagag cagcaattgctagtgtcggggccagggaagaagccagaggattcccgagtggataaggag actgaggcccagaaacttgaggtccaggatcctgactataaagccatattgcctagcagt aaaaggaagtatccagacaagagctggagaaggggctacctgctgggtgatctcagggac gtcatcacactctgtgctccctaccaaggcttagtggagctgatcaacagcagggtctgt gccagcagaaaaggcccagcgaaggcactagaggctcagtcagccagtaagtgcggctcc tcagactttcgagacagcgaacggaccgaccgggactgccagccgctccgggtcaaggac tcgccccacccgtgccccccaccaggcgctcccaactcactggtgagcgcggcggcccgg gcgctggatgcgggggcggccgcgatggccccgcgcgcgggacagccggggctccagggg ctgctgctcgtggcggcggcgctgagccagcccgcggcaccctgccccttccagtgctac tgcttcggcggccccaagctgctgctgcgctgcgcgtcgggagccgagctccgccagcct ccgcgggacgtgccgcccgacgcgcgcaacctcaccatcgtaggcgccaacctgacggtg ctgcgcgcggccgccttcgccggcggggacggggacggcgaccaggcggcgggcgtgcgc ctgccgctcctgagcgcgctgcgcctcacgcacaaccacatcgaggtggtggaggacggc gccttcgacgggctgcccagcctggcggcgctcgacctcagccacaacccgctgcgcgcc ctgggcggcggcgccttccgcgggctgcccgcgctgcgctcgctgcagctcaaccacgcg ctggtgcgcggcggccccgcgctgctggccgcgctggacgctgcgctggcaccgctggcc gagcttcgcctgctgggcctagcgggcaacgcgctgagccgtctgccgccagccgccctg cgcctggcgcgcctggagcagctggacgtgcgcctcaacgcgctggccggcctggacccc gacgagctgcgcgcgctggagcgcgatggcggcctccccgggccgcgcctgctgctcgcc gacaaccccctgcgctgcggctgtgccgcacgccccctgctggcctggctgcgcaacgcc acggagcgcgtgcccgactcgcggcgcctgcgctgcgccgccccgcgggcgctgctagac cggccgctactggacctggacggggcgcggcttcgctgcgcggacagcggcgccgacgct cgcggagaggaggcggaggccgccggcccggagctggaagcctcctacgtgttcttcggg ctggtgctggcactcatcggcctcatcttcctcatggtgctctacctaaaccgccgcggc atccagcgctggatgcgcaacctgcgcgaggcgtgccgggaccagatggagggctaccac taccgctacgagcaggacgccgacccgcgccgcgcgcccgcgcccgccgcgcccgcgggc tcccgcgccacctccccgggctcggggctctga >gi568815587r:75166166_75387377|GENSCAN_predicted_peptide_2|644_aa MGVRVRLLETVCGREAVAASCPTRGKGPHEGFANGNERGVVVVVVEAALTGTQASRVQAL LSQWKEQLSSTTGLSVHRALNQDPCDDIGPTQVLWGNNTDTGACTFCRVFKKASPNGKLT VYLGKRDFVDHIDLVDPVDGVVLVDPEYLKERRGPLPPPQACQTTPAPAKPVRPFPYPST YALGRVYVTLTCAFRYGREDLDVLGLTFRKDLFVANVQSFPPAPEDKKPLTRLQERLIKK LGEHAYPFTFEIPPNLPCSVTLQPGPEDTGKACGVDYEVKAFCAENLEEKIHKRNSVRLV IRKVQYAPERPGPQPTAETTRQFLMSDKPLHLEASLDKENLAVLPANPVPSYAPQIYYHG EPISVNVHVTNNTNKTVKKIKISVRQYADICLFNTAQYKCPVAMEEADDTVAPSSTFCKV YTLTPFLANNREKRGLALDGKLKHEDTNLASSTLLREGANREILGIIVSYKVKVKLVVSR GGPWEPREPSQERGGFHFFLGPEEVSSSSSQADRGSEWQNWGWRASFGSTSSDGKTLCSP RTDRASTSSLTACESAQGSLGLGLCFSDVAVELPFTLMHPKPKEEPPHREVPENETPVDT NLIELDTNDDDIVFEDFARQRLKGMKDDKEEEEDGTGSPQLNNR >gi568815587r:75166166_75387377|GENSCAN_predicted_CDS_2|1935_bp atgggcgttagggtgagactgctggagacggtgtgtgggagagaggctgtggcagcctcc tgtcccaccagaggaaaaggcccccatgaaggctttgcgaatggaaatgaacgtggggtg gtggtggtggtggtggaagcagcactaactgggacccaggcgtcccgagtgcaggccctg ctctctcaatggaaagagcagctgtcatctaccacgggcctctctgttcaccgggcactg aaccaggacccttgtgatgacattgggcccacccaggtgctctggggaaacaacaccgac accggggcctgtaccttctgcagagtgttcaagaaggccagtccaaatggaaagctcacc gtctacctgggaaagcgggactttgtggaccacatcgacctcgtggaccctgtggatggt gtggtcctggtggatcctgagtatctcaaagagcggagaggtcccctacccccaccccag gcctgccagactactccagcccccgccaagcctgtcagaccgtttccatacccctccaca tatgccctggggagagtctatgtgacgctgacctgcgccttccgctatggccgggaggac ctggatgtcctgggcctgacctttcgcaaggacctgtttgtggccaacgtacagtcgttc ccaccggcccccgaggacaagaagcccctgacgcggctgcaggaacgcctcatcaagaag ctgggcgagcacgcttaccctttcacctttgagatccctccaaaccttccatgttctgtg acactgcagccggggcccgaagacacggggaaggcttgcggtgtggactatgaagtcaaa gccttctgcgcggagaatttggaggagaagatccacaagcggaattctgtgcgtctggtc atccggaaggttcagtatgccccagagaggcctggcccccagcccacagccgagaccacc aggcagttcctcatgtcggacaagcccttgcacctagaagcctctctggataaggagaac cttgctgtcctcccagccaaccccgtcccttcttatgccccccagatctattaccatgga gaacccatcagcgtcaacgtccacgtcaccaacaacaccaacaagacggtgaagaagatc aagatctcagtgcgccagtatgcagacatctgccttttcaacacagctcagtacaagtgc cctgttgccatggaagaggctgatgacactgtggcacccagctcgacgttctgcaaggtc tacacactgacccccttcctagccaataaccgagagaagcggggcctcgccttggacggg aagctcaagcacgaagacacgaacttggcctctagcaccctgttgagggaaggtgccaac cgtgagatcctggggatcattgtttcctacaaagtgaaagtgaagctggtggtgtctcgg ggcggcccgtgggaacccagagagccttctcaggaaagaggaggcttccatttctttttg ggaccagaggaagtctcctcgtccagctcccaggcagaccgaggctccgagtggcagaac tggggctggagggcatcttttggatctacctcctctgacggcaaaaccctctgcagtcct cggactgacagggcatcgacatcaagtctcacagcctgtgagtccgctcaaggctcactg ggtctgggtctctgtttcagcgacgtggccgtggaactgcccttcaccctaatgcacccc aagcccaaagaggaacccccgcatcgggaagttccagagaacgagacgccagtagatacc aatctcatagaacttgacacaaatgatgacgacattgtatttgaggactttgctcgccag agactgaaaggcatgaaggatgacaaggaggaagaggaggatggtaccggctctccacag ctcaacaacagatag >gi568815587r:75166166_75387377|GENSCAN_predicted_peptide_3|111_aa MPSQHLCFLFLGGLLQPLLAAGLRSVPQCQGRLLPPPPLLPVPVVWLGVISPVEGAWALT PRTSLRPSRTMGDKGTRGALEEVLHCSEFLHPEGRGEHPPALDSGSAAHTV >gi568815587r:75166166_75387377|GENSCAN_predicted_CDS_3|336_bp atgcccagccaacatctgtgcttcctcttcctgggcggcctcctgcagcccctgctggcc gcgggtctccgctcagttccccagtgtcaaggccgtctactgccccctccgccccttctt cctgtccctgttgtgtggctgggtgttatctctccagtagaaggggcctgggcgctgacg ccgcggacctccctgcgaccgtcgcggaccatgggcgacaaagggacccgaggggccttg gaggaggtgctgcattgctcagaattcctgcacccagaaggtcgtggggagcatccccct gccttggatagtggtagtgccgcccacactgtgtag