GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:30:31 Sequence gi568815588r:59692920_60006424 : 313505 bp : 40.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 700 695 6 1.05 1.03 Term - 6646 6510 137 2 2 81 39 125 0.800 4.10 1.02 Intr - 7518 7370 149 2 2 15 80 57 0.662 -3.44 1.01 Init - 9622 9480 143 1 2 41 38 165 0.657 6.45 1.00 Prom - 10049 10010 40 -7.15 2.06 PlyA - 12090 12085 6 1.05 2.05 Term - 15864 15719 146 1 2 96 38 114 0.801 4.29 2.04 Intr - 16339 16108 232 1 1 15 49 180 0.278 3.22 2.03 Intr - 16944 16560 385 2 1 51 103 175 0.227 9.23 2.02 Intr - 17457 17165 293 0 2 80 24 120 0.072 -0.19 2.01 Init - 36106 35990 117 1 0 44 75 66 0.170 1.15 2.00 Prom - 36473 36434 40 -3.15 3.00 Prom + 46449 46488 40 -5.05 3.01 Init + 57825 57954 130 2 1 56 28 157 0.347 6.87 3.02 Intr + 65036 65131 96 0 0 47 100 53 0.267 1.46 3.03 Term + 84497 84858 362 0 2 70 37 241 0.071 10.91 3.04 PlyA + 85098 85103 6 -0.45 4.02 PlyA - 85238 85233 6 1.05 4.01 Sngl - 91369 90002 1368 1 0 42 41 413 0.889 27.86 4.00 Prom - 92149 92110 40 -5.65 5.22 PlyA - 92413 92408 6 1.05 5.21 Term - 100192 99998 195 1 0 113 38 64 0.592 0.33 5.20 Intr - 101678 101554 125 0 2 94 98 42 0.675 5.18 5.19 Intr - 111601 111501 101 0 2 121 93 53 0.723 7.93 5.18 Intr - 114212 114003 210 1 0 73 93 236 0.907 19.71 5.17 Intr - 116812 116656 157 2 1 13 94 101 0.567 1.35 5.16 Intr - 119825 119716 110 1 2 38 56 160 0.021 6.81 5.15 Intr - 121836 121733 104 0 2 96 14 88 0.011 0.25 5.14 Intr - 139734 139606 129 0 0 76 115 129 0.339 14.37 5.13 Intr - 159783 159634 150 0 0 84 99 153 0.996 15.44 5.12 Intr - 164239 164159 81 1 0 78 85 75 0.925 5.12 5.11 Intr - 172349 172233 117 2 0 100 80 7 0.005 0.84 5.10 Intr - 178048 177869 180 1 0 94 32 82 0.002 2.34 5.09 Intr - 195662 195522 141 2 0 43 39 116 0.002 1.83 5.08 Intr - 197478 197323 156 0 0 38 70 89 0.210 1.39 5.07 Intr - 199565 199419 147 2 0 92 82 66 0.279 5.91 5.06 Intr - 213517 213203 315 1 0 105 78 446 0.441 40.64 5.05 Intr - 229001 228871 131 0 2 83 -17 94 0.006 -2.11 5.04 Intr - 234159 234033 127 0 1 46 121 63 0.403 4.73 5.03 Intr - 271851 271823 29 1 2 84 78 51 0.012 0.62 5.02 Intr - 277734 277682 53 2 2 93 60 46 0.001 -0.07 5.01 Init - 294155 294019 137 2 2 63 98 107 0.836 8.86 5.00 Prom - 296181 296142 40 -5.85 6.03 PlyA - 296304 296299 6 1.05 6.02 Term - 296995 296811 185 2 2 92 41 110 0.955 3.42 6.01 Init - 298906 298843 64 1 1 33 97 55 0.653 2.26 6.00 Prom - 301353 301314 40 -3.15 7.03 PlyA - 301997 301992 6 1.05 7.02 Term - 311595 311430 166 2 1 44 53 123 0.759 0.81 7.01 Init - 313269 313226 44 0 2 89 116 27 0.850 5.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 276423 276627 205 2 1 65 84 147 0.875 11.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_1|142_aa MLRETTYLTVTELTTEDKQEALNDSFEPRFALVLAPVMSQPPPFHISEEILGVFGIRKGS LNWGNRSKLGYEECKDELDCAVLMRQSQHLSAFILPTDEEKEAESYLSNLLKVTLLAGTR AAMYTRVIQDTSQCAYSLCEIF >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_1|429_bp atgttgagagaaaccacataccttactgtgactgagctgactacagaggacaaacaagag gctctgaatgacagttttgagccacgatttgcccttgtcctagccccagtaatgagtcag cccccaccctttcacatcagcgaggagatcctaggagtctttgggataaggaaggggtct ctgaactggggaaaccgcagcaaactggggtatgaagaatgcaaagatgaacttgactgt gcagtgcttatgagacaaagtcagcatttgtcagctttcattctaccaacagatgaagaa aaagaggcagagagctatttaagtaacttgctgaaggtcacactcttggctggtaccaga gctgcgatgtacactcgggttatccaagacaccagccagtgtgcttattccttatgcgaa attttctag >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_2|390_aa MRKLIAGLIFLKIWTCTVRTSTDFPQIEDCSQCIHQVTEDALLSRGAVTKDCTWHPHSSL PRITDSTKFWCPFLRDHCQLSIQLRPGGRQRPREPGRPRNTRIHLYTPAPPKRARPLAGL ARARGGRRGHSGAWGRAHNRFRLAGVPRKERQGRKRQQGKPRGGEGEREGGAQRRGPAPQ LHVAAAAAGLWGQRAAAPGAARSGTDCLILRQGPGPQLARINSRPAGPELQGRAAAAIPT TPPARPHGALSRVLSTGGPAEAGTQWASFPGNPKAAFCTELKEREGVLGVVRWPPRELPL DSRELPGLLVEEQGEVHVMQVGLHFPEDRKASNCLGPEREQPGNLLWLQQRPEAFQRADN HEQTWLSKANQCFTLFSGCLVYPTQTIPEV >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_2|1173_bp atgagaaaactcatcgcgggactcattttccttaaaatttggacttgtacagtaaggact tcaactgactttcctcagattgaggactgttcccagtgtatacatcaagtcactgaggac gccctcctttcccgcggagctgtcacaaaggactgtacttggcatcctcactcctcactt cccaggatcacggactccacaaagttctggtgcccgtttctccgggaccactgccagctg agcatacagctccggccaggagggcgccagcggccccgggagccagggagaccaaggaac actcgcattcatttgtacaccccggctccccccaaacgcgcacggcccttggcgggcttg gccagggcacgtggtgggcggcgggggcacagcggcgcctggggacgcgctcacaaccgg ttcagactggccggggtgccccgcaaagagagacaagggagaaaaagacagcagggaaaa ccccggggaggagaaggcgaaagagaaggtggagctcagagaagggggccggctccccag ctccatgtggccgccgccgctgcgggtctgtgggggcagagggcggcggctcccggggca gcgcgtagcgggaccgattgcctaatactccggcaggggccggggccgcagctggctcgg ataaatagccgcccggctggcccggagctgcaggggagagcggcggccgcgatccccacc acaccaccagcccggccgcacggggcactgagccgggtgctgagcaccggaggccccgcc gaggccgggactcagtgggcaagttttcccgggaatccgaaggcggcattttgtacggag ttgaaagagagggagggggtcctgggggtggtacggtggccccccagagagctgcctctg gactccagagagcttccagggttgctggtggaagagcagggagaggtgcatgtgatgcag gtgggcctccatttcccagaggaccgaaaggcctccaactgccttggtccggagcgcgag caaccaggtaacctgctctggcttcagcagcgcccagaggctttccagagagctgataac cacgaacaaacttggctgtcaaaagcaaatcaatgctttaccttgttttctgggtgccta gtctatccaacacaaactatccccgaagtttag >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_3|195_aa MPGALAGDQLCEASSESPNAAVPADTCPLSGARMQKWVQDAETDDVIGNIGYTTKLILNK ALFRMSYLRMNQDMASSQCKLPVDLPFWNLEDSGPLLTSPLGSAPGGTLCGGSNPTFPSR LPQQRFSMRAPPLQQTFAWASRCFHTFSEIQTEVPKLQFLTSVYLRAQQHLEAAKAWGVH PLKPQSELYVGPFQP >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_3|588_bp atgcccggagccctggctggggatcagctgtgcgaggcttcctcagagagccccaacgca gcggtgccagcagacacttgtcctttgagtggtgcgaggatgcagaaatgggtgcaggat gcagaaacggacgatgtaattggaaacattggttatacaacaaagcttattctgaataag gctttgttcagaatgtcatatctgagaatgaatcaagatatggccagctcacagtgcaag ctgccagtggatctaccattctggaatctagaggacagtggccctcttctcacatctcca ctgggcagtgccccaggagggactctgtgtgggggctccaaccccacatttccctccaga ctgccccagcagaggttctccatgagggctccgcccctgcagcaaacttttgcctgggca tccaggtgtttccatacattttctgaaatccagacagaggttcccaaacttcagttcttg acttctgtgtacctgcgggctcaacagcacttggaagctgccaaagcttggggtgtccac cctctgaagccacagtctgagctctacgttggcccctttcagccatag >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_4|455_aa MIISIDAEEAFDKIQQLFIVKTLNKLGIDGTCFKIIRASYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQALLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKDNYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKIIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRACITKAILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEITPHVYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFISKTPKSMATKAKIDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKYS >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_4|1368_bp atgattatctcaatagatgcagaagaggcctttgacaaaattcaacaactcttcatcgta aaaactctcaataaattaggtattgatgggacgtgtttcaaaataataagagctagctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaaccctattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattattatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggacaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaagataatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcatcaccaaggcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaataga gatatagatcaatggaacagaacagagccctcagaaataacgccgcatgtctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atatccaaaacaccaaaatcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaatactcttaa >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_5|964_aa MGKISEQTFLRRRHTNAKQVYEKVLIITDHQRNANRNYNEISSHPSEPDKYYHLQCVHDV EKSIEELEAQYDELPIFVGSLECKRLSPSHENTSIVSAYVNGTGNILCSETGYCKVIKSS NHQSEHKSSICGKQNAFCSSWLLQAESKLLQEHMHTCLGPAAAMADSASESDTDGAGGNS SSSAAMQSSCSSTSGGGGGGGGGGGGGKSGGIVISPFRLEELTNRLASLQQENKVLKIEL ETYKLKCKALQEENRDLRKASVTIVGEPRNLASNDPCFSGAWGCLGTEWVSKGLVSQDYT SLCGCVKVTGLQEHPLLLRGSLESSQDGASTSSSAALSLLSQFLFFHTSFLVSSSLKAIV DWMSQPCALRLLAPALTFLPKAVDSRLGDLLPAALFPGPQLPSIPVLLVLPSELCMAEAK LLMGRPPNTAAQFMVLSFNSPGLVVDSSMRLFTGVFLFHSCTILRDVSLAIVVQLRNTAG FPSMYSCWNVISRSSGVSHCPWARVGNLGCQILVGWEKNDEETLCGLVLTLTDSDYCSQA RAEQEEEFISNTLFKKIQALQKEKETLAVNYEKEEEFLTNELSRKLMQLQHEKAELEQHL EQEQEFQVNKLMKKIKKLENDTISKQLTLEQLRREKIDLENTLEQEQEALVNRLWKRMDK LEAEKRDISMEIDSPENMMRHIRFLKNEVERLKKQLRAAQLQLTVSKDLWMMLSEGLLYW EMAVDLGKYKLWPLSTREFCVQEEGGGLIGTNLVRFGYDCMKHGLIENSDFSDSEKMAQY LEEERHMREENLRLQRKLQREMERREALCRQLSESESSLEMDDERYFNEMSAQGLRPRTV SSPIPYTPSPSSSRPISPGLSYASHTVGFTPPTSLTRAGMSYYNSPGLHVQHMGTSHGIT RPSPRRSNSPDKFKRPTPPPSPNTQTPVQPPPPPPPPPMQPTVPSAATSQPTPSQHSAHP SSQP >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_5|2895_bp atgggcaaaatatctgaacagacatttctcagaagaagacatacaaatgccaaacaggta tacgaaaaggtgctcatcatcactgatcatcagagaaatgcaaatcgaaactacaatgag atatcatctcaccccagtgagccagacaaatactatcatcttcagtgtgtgcatgatgtg gaaaagagtattgaagaacttgaagcacagtatgatgagctaccaatatttgttggatca ctggagtgcaagaggctaagtccaagccatgaaaacacctctatagtttctgcttatgtc aatgggacaggaaatatactctgctcagagacagggtattgcaaagtgatcaagagcagc aatcatcaatcagaacacaaatcctcaatatgtggaaaacagaatgctttttgttcatcc tggctcctacaagctgagagcaagctgctgcaggaacacatgcacacttgcctgggtccc gccgcggccatggcggacagcgccagcgagagcgacacggacggggcggggggcaacagc agcagctcggccgccatgcagtcgtcctgctcgtcgacctcgggcggcggcggtggcggc gggggaggcggcggcggtgggaagtcggggggcattgtcatctcgccgttccgcctggag gagctcaccaaccgcctggcctcgctgcagcaagagaacaaggtgctgaagatagagctg gagacctacaaactgaagtgcaaggcactgcaggaggagaaccgcgacctgcgcaaagcc agcgtgaccatcgttggggagcctcgcaacctggcttctaacgacccatgtttctctggt gcttggggatgcctgggaactgagtgggtttccaagggtttggtgagccaggattatacc agtctgtgtggctgtgtaaaagtaacaggactgcaggagcatcccctccttttgcggggc tctttggagtcttctcaggatggtgcctccacttcttcctcagcggctctcagtttgctt tcacagttcctgttcttccacaccagcttcctggtctcctcctctctgaaagccatcgtt gactggatgtctcagccctgtgcccttaggctgctggccccagctctaaccttcctccct aaagctgttgacagcaggcttggagaccttttacctgctgccctgtttcctggtccccag cttccgagcatcccagttttgctggttctgccctcagaattgtgtatggcagaggctaaa ctgctgatggggagacccccaaatacagctgctcagttcatggtgctctcatttaatagt cccgggctggtggttgactcctccatgaggttgttcactggggtcttcctatttcatagc tgtaccatcttgcgggatgttagccttgccattgtggtgcaactgagaaacaccgcaggt tttccaagcatgtatagctgttggaatgttatatcccggtcttcaggtgttagtcattgc ccctgggccagggttggaaacctcgggtgtcagattttagtaggatgggaaaagaatgat gaagagacgctttgtggtctagtcctcacattaacagacagtgattactgctcgcaagcc agggctgagcaggaagaagaattcattagtaacactttattcaagaaaattcaggctttg cagaaggagaaagaaacccttgctgtaaattatgagaaagaagaagaattcctcactaat gagctctccagaaaattgatgcagttgcagcatgagaaagccgaactagaacagcatctt gaacaagagcaggaatttcaggtcaacaaactgatgaagaaaattaaaaaactggagaat gacaccatttctaagcaacttacattagaacagttgagacgggagaagattgaccttgaa aatacattggaacaagaacaagaagcactagttaatcgcctctggaaaaggatggataag cttgaagctgaaaagcgagatatctccatggagattgattctccagaaaatatgatgcgt cacatcaggtttttaaagaatgaagtggaacggctgaagaagcaactgagagctgctcag ttacagctcacagtttccaaggatctgtggatgatgttaagtgagggcctactatattgg gagatggctgtagacctggggaaatataagctgtggcccctaagtactagggagttctgt gtgcaagaggaaggaggaggactcataggaacaaatctggtcagatttggatatgattgc atgaaacatggtttaattgaaaactctgacttttcagattcagagaaaatggcacagtat ctggaggaggaacgtcacatgagagaagagaacttgaggctccagaggaagctgcagagg gagatggagagaagagaagccctctgtcgacagctctccgagagtgagtccagcttagaa atggacgacgaaaggtattttaatgagatgtctgcacaaggattaagacctcgcactgtg tccagcccgatcccttacacaccttctccgagttcaagcaggcctatatcacctggtcta tcatatgcaagtcacacggttggtttcacgccaccaacttcactgactagagctggaatg tcttattacaattccccgggtcttcacgtgcagcacatgggaacatcccatggtatcaca aggccttcaccacggagaagcaacagtcctgacaaattcaaacggcccacgccgcctcca tctcccaacacacagaccccagtccagccacctccgcctccacctccgccacccatgcag cccacggtcccctcagcagccacctcgcagcctactccttcgcaacattcggcgcacccc tcctcccagccttaa >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_6|82_aa MTQTHTTAPPHQTAVAMGTTQGSCQSTMSDGCTGADKDSNSSPGKVEWACDPDLANHDVS CPLSISDTLKERHISNLARVLL >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_6|249_bp atgacccagacacacactactgctccaccccaccagactgctgttgccatgggaaccact caaggaagctgccagtccaccatgtcagatggttgtactggggctgacaaggatagcaac tcatctccaggaaaagtggaatgggcctgtgacccagacctggctaatcatgatgtttca tgccccttgtcaataagtgacacacttaaggaaaggcatatctcaaatctagccagagtt ctcctctga >gi568815588r:59692920_60006424|GENSCAN_predicted_peptide_7|69_aa MVLKVAKLTKDYNESIIEIAGHKPGGAGNHQMQKACLGIRPAHRKVELRDGKEGRGGEER KWLGSNDII >gi568815588r:59692920_60006424|GENSCAN_predicted_CDS_7|210_bp atggtgctcaaagtggctaaattgacaaaagactataatgaaagtattattgaaattgca ggacataagcctggaggtgctggaaaccaccagatgcagaaggcttgcctgggaataagg ccagcgcataggaaagtagaactgagagatggaaaagagggcagaggaggagaagagaga aagtggctaggttctaatgacatcatttga