GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:36:38 Sequence gi568815578r:37883459_38113816 : 230358 bp : 45.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19893 20013 121 2 1 96 113 250 0.999 26.65 1.02 Intr + 24661 24842 182 2 2 8 72 166 0.730 6.59 1.03 Intr + 31202 31317 116 1 2 52 77 73 0.844 1.85 1.04 Intr + 39371 39425 55 2 1 54 68 62 0.010 -0.22 1.05 Intr + 41195 41357 163 1 1 24 64 52 0.000 -4.05 1.06 Intr + 48177 48346 170 1 2 97 95 211 0.742 22.37 1.07 Intr + 53301 53366 66 2 0 107 73 9 0.164 0.40 1.08 Term + 60523 60795 273 0 0 103 38 542 0.986 46.17 1.09 PlyA + 61872 61877 6 1.05 2.00 Prom + 75327 75366 40 -2.86 2.01 Init + 87640 87659 20 0 2 113 61 19 0.754 1.23 2.02 Intr + 88630 88676 47 1 2 122 95 14 0.575 3.55 2.03 Intr + 89080 89208 129 2 0 61 84 27 0.375 0.27 2.04 Intr + 93959 94068 110 0 2 53 94 64 0.615 3.50 2.05 Term + 95459 95578 120 1 0 39 40 102 0.472 -1.03 2.06 PlyA + 98571 98576 6 1.05 3.09 PlyA - 98690 98685 6 1.05 3.08 Term - 100181 99998 184 1 1 126 46 271 0.973 23.72 3.07 Intr - 113004 112917 88 1 1 128 94 69 0.811 10.53 3.06 Intr - 113495 113291 205 2 1 54 80 160 0.980 10.57 3.05 Intr - 115870 115730 141 1 0 109 79 32 0.915 4.95 3.04 Intr - 117061 116947 115 0 1 17 70 120 0.781 3.55 3.03 Intr - 119318 119123 196 0 1 69 70 151 0.582 9.87 3.02 Intr - 122939 122739 201 0 0 53 88 150 0.878 10.76 3.01 Init - 130358 128057 2302 2 1 77 56 970 0.967 80.24 3.00 Prom - 134604 134565 40 -5.36 4.00 Prom + 137744 137783 40 -4.16 4.01 Init + 150490 150640 151 0 1 102 92 283 0.536 30.30 4.02 Intr + 156977 157106 130 0 1 105 91 29 0.979 4.65 4.03 Intr + 164890 165023 134 1 2 64 70 131 0.876 9.19 4.04 Intr + 174074 174186 113 0 2 92 74 114 0.984 10.50 4.05 Intr + 175936 176062 127 0 1 81 94 79 0.918 8.05 4.06 Intr + 182623 182798 176 2 2 53 105 230 0.988 20.86 4.07 Intr + 195872 195903 32 1 2 99 91 8 0.476 -0.87 4.08 Intr + 200360 200474 115 2 1 58 82 100 0.621 6.85 4.09 Intr + 204849 204962 114 2 0 123 66 75 0.955 9.44 4.10 Term + 206268 206417 150 2 0 112 45 102 0.996 6.21 4.11 PlyA + 208883 208888 6 1.05 5.00 Prom + 223967 224006 40 -2.46 5.01 Init + 229522 229563 42 0 0 79 72 76 0.072 5.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 42107 42036 72 0 0 76 97 106 0.951 9.68 S.002 Init + 47338 47560 223 0 1 69 1 134 0.838 1.52 S.003 Intr - 102016 101972 45 1 0 88 107 55 0.871 6.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:37883459_38113816|GENSCAN_predicted_peptide_1|381_aa MGAPLAVALGALHYLALFLQLGGATRPAGHAPWDNHVSGHAVFPKSFTPIGFDTLPISSS CSLFPKQEKPQAASRTQGIWLLAMVTARMVGKVACGKQVGRRSQKHMTCQSRIARPAAGI GLPFNTPTLQRGKLRLREAWSHGSTRGNEHREELEDRWRLSVHHHNQQQQQQHQNTHQSV YFPVSGAGAVSSLRVRRGRKTLLLIELPAFSGALFTETPHDMTARTGEDVEMACSFRGSG SPSYSLEIQWWYVRSHRDWTDKQAWASNQLAVVEGIYENALETVKCHGLGYVVKVVGSNI SHKLRLSRVKPTDEGTYECRVIDFSDGKARHHKVKAYLRVQPGENSVLHLPEAPPAAPAP PPPKPGKELRKRSVDQEACSL >gi568815578r:37883459_38113816|GENSCAN_predicted_CDS_1|1146_bp atgggggccccgctcgccgtagcgctgggcgccctccactacctggcacttttcctgcaa ctcggcggcgccacgcggcccgccggccacgcgccctgggacaaccacgtctccggccac gcggttttccccaagtccttcacaccaattggttttgacaccttgcccatcagcagctcc tgcagccttttccccaaacaagagaagccgcaggcggccagcaggacgcagggaatttgg ctccttgccatggtgacagcaaggatggtgggcaaagtcgcctgcggcaaacaagtgggc aggagaagccagaagcacatgacctgtcagagccgcatcgcacggcccgcagcaggcatc gggctcccattcaacacgcccaccttgcagaggggaaaactgaggcttagagaggcgtgg agccatgggagcaccagaggaaatgagcacagagaagagctagaggaccggtggagatta tcagtacatcatcataatcagcagcagcagcagcagcatcaaaatacacatcaatctgtg tattttccagtttctggtgctggtgcagtatcctcactcagagttaggagagggaggaag accctgttgctgattgagctgcctgctttcagtggggccctgttcacagagacaccccat gacatgacagcacggacgggcgaggacgtggagatggcctgctccttccgcggcagcggc tccccctcctactcgctggagatccagtggtggtatgtacggagccaccgggactggacc gacaagcaggcgtgggcctcgaaccagttagctgttgttgaaggcatatatgaaaacgct ttggaaactgtaaagtgccatggattaggctatgtggtcaaggtggtgggcagcaacatc tcccacaagctgcgcctgtcccgggtgaagcccacggacgaaggcacctacgagtgccgc gtcatcgacttcagcgacggcaaggcccggcaccacaaggtcaaggcctacctgcgggtg cagccaggggagaactccgtcctgcatctgcccgaagcccctcccgccgcgcccgccccg ccgccccccaagccaggcaaggagctgaggaagcgctcggtggaccaggaggcctgcagc ctctag >gi568815578r:37883459_38113816|GENSCAN_predicted_peptide_2|141_aa MVYRYTQARSGPPVAHCPSSLAYFPEISKAGAHKLEIPWVRTLVFAASGTLGTRPFSSGW EWPGKESTITQNYFLKFGLYSLLWRKGSTGTWLPIKPDAQDWSKTLFQQQQQQQQQQQQQ QQDCKKKKVKEDATETFMYGL >gi568815578r:37883459_38113816|GENSCAN_predicted_CDS_2|426_bp atggtctacagatacactcaggcaagatcaggtcctcctgttgcacattgtcccagctcc ctggcctactttccagaaatcagcaaagcaggtgctcacaagctggaaataccctgggtt aggaccctagtgtttgctgccagtgggaccttaggaaccaggcccttcagctctggctgg gagtggcctggcaaggagtccaccatcactcagaattattttctgaaatttggcctctat tccttactctggaggaaaggcagtacgggcacctggcttccaatcaagcccgacgcacag gactggagtaagaccctgtttcaacaacaacagcaacaacaacaacaacaacaacaacaa caacaagattgtaagaaaaaaaaagtcaaagaagatgccacagagacctttatgtatggc ctgtaa >gi568815578r:37883459_38113816|GENSCAN_predicted_peptide_3|1143_aa MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSALQELQQYILFPLRFTL KTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSACLYSPSSQKPAAVSEELK LAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLGLAEQEKSKQIKIAALKCLQV LLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALTRLITGDFKQGHSIVVSSLKIFY KTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYREADWVKKTGDKLTILIKKIIECVSV HPHWKVRLELVELVEDLLLKCSQSLVECAGPLLKALVGLVNDESPEIQAQCNKVLRHFAD QKVVVGNKALADILSESLHSLATSLPRLMNSQDDQGKFSTLSLLLGYLKLLGPKINFVLN SVAHLQRLSKALIQVLELDVADIKIVEERRWNSDDLNASPKTSATQPWNRIQRRYFRFFT DERIFMLLRQVCQLLGYYGNLYLLVDHFMELYHQSVVYRKQAAMILNELVTGAAGLEVED LHEKHIKTNPEELREIVTSILEEYTSQENWYLVTCLETEEMGEELMMEHPGLQAITSGEH TCQVTSFLAFSKPSPTICSMNSNIWQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKA GDQTLLISQVATSTMMDVCRACGYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLE VMLRNSDANLLPLVADVVQDVLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGHL QEQSLGEEGSHLNQRPAALEKSTTTAEDIEQFLLNYLKEKDVADGNVSDFDNEEEEQSVP PKVDENDTRPDVEPPLPLQIQIAMDVMERCIHLLSDKNLQIRLKVSAQLLLCIPRVAGLR LGVHVGGKPALAAHAGGKPALAVVLTAACDTDAGSLFSVLDVLDLCVVVLQSHKNQLLPL AHQAWPSLVHRLTRDAPLAVLRAFKVLRTLGSKCGDFLRSRFCKDVLPKLAGSLVTQAPI SARAGPVYSHTLAFKLQLAVLQGLGPLCERLDLGEGDLNKVADACLIYLSVKQPVKLQEA ARSVFLHLMKVDPDSTWFLLNELYCPVQFTPPHPSLHPVQLHGASGQQNPYTTNVLQLLK ELQ >gi568815578r:37883459_38113816|GENSCAN_predicted_CDS_3|3432_bp atggcagtttttgatactcctgaggaggcctttggtgtcttacgtccagtctgtgttcag ctcacaaagacccagacagtggagaatgtggagcatctgcagacacgactacaagctgtg agtgacagtgcccttcaggaacttcagcagtacatcctcttccctctgcgatttaccctg aagaccccaggtcccaaaagagagcgtttgatccaaagtgtggtggaatgcctcacattt gtcctttcttcaacatgtgtgaaagaacaggagcttctccaggaactcttttcagaactc tctgcttgtctgtattcacccagctcccaaaaacctgcggctgtgtccgaggagttgaaa ttggctgtgatccagggacttagcacattaatgcactcagcttatggggacatcattctg actttttatgagccctccattctgccacgtttaggatttgctgtatctttactgttaggc cttgcagaacaggagaaatcaaagcaaattaaaattgctgccttaaaatgtttacaggtt ctactcttgcagtgtgattgtcaggaccatccaaggtcattggatgaacttgaacaaaag cagctgggggatttgtttgcctcttttttacctggaatctcaactgcactgaccaggctt atcacaggagactttaaacaaggtcacagcattgtcgtatcttccctaaagatcttttac aagacagtgagcttcattatggctgatgaacagctcaaaagaatctcaaaggtccaagca aaacctgcagttgagcacagagtagcagagctgatggtttacagggaagcagattgggta aaaaagactggcgacaagttgactatccttattaaaaagataattgagtgtgtttctgtt cacccacactggaaggtgagactggaactggtagaacttgtggaggaccttcttttgaag tgcagtcaatcattggtcgaatgtgctggtccccttctgaaggccttagtgggactagta aatgatgagagtcctgaaatccaagcccagtgcaataaagttctgagacattttgcagat caaaaagtagtggtgggcaacaaagccctcgctgacatcttgtcagaaagcctgcattcc cttgccacatctcttcctcgcctaatgaactcccaagatgaccagggcaaattctctact ctttccttgttacttggttatctgaaactcttgggcccaaaaataaactttgtcctcaac tctgtggcccatctccagcggctttccaaagcactcatccaagttctagagctagacgtg gctgacatcaagattgttgaggaacggcgttggaactctgatgatctgaatgcttctcca aagacctcagccacacagccttggaaccgcatccagaggagatatttccgcttcttcact gatgagagaatcttcatgctcttgaggcaggtttgtcagctacttggttattatgggaat ctttatttgcttgtggatcactttatggaactttaccatcaatctgtggtttaccggaag caagctgccatgatccttaatgaactggttacaggggctgctgggctggaggttgaggat cttcacgaaaaacatattaaaacaaacccagaagaactgagagagattgtgacatctata cttgaagaatacacaagtcaagaaaattggtatttggttacctgtcttgaaactgaggaa atgggagaggagctgatgatggagcacccaggcctccaagccatcacgtctggtgaacac acctgccaagttacatcttttctagccttctcaaagccaagtcccactatttgctccatg aacagtaacatctggcaaatatgcattcagttggaaggaattggccagtttgcatatgca ctaggaaaagacttctgtttgctcttgatgtcagccctttatccagtactggagaaggct ggagaccaaaccctactcattagtcaggtggctaccagcaccatgatggacgtttgccgt gcttgtggctacgactccctgcagcacctgatcaatcaaaattcagactatttagtgaat gggatctctttaaatctgcgtcatctggctctgcatcctcataccccaaaggtcctggaa gtcatgctgcggaactcagatgctaacctgcttcctttggtggcagatgtggttcaagat gtcttggccaccctggaccaattttacgataagagagctgcttcctttgtcagcgttctg catgctctgatggcagcattagcccagtggttcccagacacaggtaatcttgggcacctc caagagcaaagtttaggagaagagggaagtcatttgaaccaaagaccagcagctcttgag aagagcaccaccacagctgaagacatcgaacagtttttgctgaactacctcaaagagaag gatgtggcagatggaaatgtctcggattttgataatgaagaagaggaacagtcagtccct cccaaagtggatgagaatgacacccgtccagatgtggagccaccactgccattgcagatc caaatagccatggacgtgatggaacgctgcatccacttgttgtcagataaaaatctgcaa atccgcctgaaggtcagtgcgcagctgcttctctgcattcccagagtagcaggactaagg ttgggcgtccatgtgggagggaagcctgcgctggccgcccacgcgggagggaagcctgcg ctggccgtcgtactgaccgctgcctgtgacactgatgcaggcagtctcttttcagtcttg gatgtgctggatctgtgtgtggttgttcttcagtcccacaaaaaccagctgcttcccttg gctcatcaggcctggccctcgctcgttcaccgactcacacgggacgcccccctggcagtg cttagagccttcaaggttttacgtaccctgggaagcaagtgtggtgactttcttcgcagc cggttctgcaaagatgtcctgccaaagctggctggctccctagtcacccaggcccccatc agtgccagggctggaccagtttactcgcacacgctggccttcaagttgcagctggctgtc ttacagggcctgggccccctctgtgagagactggacctaggtgagggtgacctgaataaa gtggctgatgcctgcttgatttacctcagtgtcaaacagcccgtgaaattacaagaggct gccaggagcgtcttcctccacttgatgaaggtggacccagactccacctggttcctcctg aacgagctttactgccccgtgcagttcacacctccccaccccagcctccaccctgtgcag ctgcacggggccagcgggcagcagaacccctacacgaccaacgtgctccagctgctcaag gagctgcagtga >gi568815578r:37883459_38113816|GENSCAN_predicted_peptide_4|413_aa MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFL YLANDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGE FIQQLKLSMEDSKSPPPKATEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLTEEL IKALQDLENAASGDATVRQKIASLPQEVQDVSLLEKITDKEAAERLSKTVDEACLLLAEY NGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLEVFGVQIILSSSVLLSSSRRAVTP QNVSFRNVTAATRSYGSVELAGGEVQVCIRDLVSRNLELEKKCYGPVPSLSDMEEGNTST VTQQEYKQKLARVTQVRKELKSHIQSLPDLSLLPNVTGGLAPLPSAGDLFSTD >gi568815578r:37883459_38113816|GENSCAN_predicted_CDS_4|1242_bp atgtcctccttctctgagtcggcgctggagaagaagctctcggagctgagcaactctcag cagagcgtgcagaccctgtccctttggctcatccaccaccgcaagcacgcgggacccatc gtctccgtgtggcaccgcgagctccgcaaagccaaatcaaatagaaagcttacttttctg tatttagcgaatgatgtcatccaaaacagtaaaaggaaaggacctgaattcactagagaa tttgaatctgtccttgtggatgctttttctcatgttgccagagaggcagatgaaggctgt aaaaaacctttagaaagattgctgaacatctggcaagaacgaagtgtgtatggcggcgag ttcatacagcagctgaagctgtctatggaggactccaagagccctccccccaaagcaaca gaagagaagaaatctctgaaacgaacttttcagcaaattcaggaggaggaggatgacgac taccctggcagctactctcctcaggatccttctgcaggacccctcttgactgaggaacta atcaaagctttgcaggatctggaaaatgccgcatcaggggatgctactgtccgacagaaa attgcttctctgccccaggaagtgcaagatgtttctctattggaaaaaataacagacaaa gaggcagctgaacgtctttcaaaaacagtagatgaagcatgtctgttactagcagaatat aacgggcgcctggcagcagaactggaggaccgtcgccagctggctcggatgttggtggag tatacccagaatcagaaagatgttttgtcggagaaggagaaaaaactagaggtatttggc gtacagataattttgtcatccagtgtcttactcagcagctcccgccgtgctgtgacaccc cagaatgtgtcatttcgaaacgtcacagctgccactaggtcatatgggagcgtggagctg gctgggggagaggtgcaggtgtgcatcagggatctggtatcaaggaatttagaacttgaa aagaagtgttatggtccagttccctcactttcagatatggaagaagggaacacatccacg gtcacacagcaggaatacaaacagaagcttgcacgagtaacccaggtccgcaaggaactg aaatcccatattcagagcttgccagacctctcactgctgcccaacgtcacagggggctta gcccccctgccctctgctggggacctgttttcaactgactag >gi568815578r:37883459_38113816|GENSCAN_predicted_peptide_5|14_aa MPISAMLQEETEAQ >gi568815578r:37883459_38113816|GENSCAN_predicted_CDS_5|42_bp atgcccatcagcgccatgctgcaggaggaaaccgaggcccag