GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:41:04 Sequence gi568815590f:95054045_95254791 : 200747 bp : 42.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 986 981 6 1.05 1.10 Term - 8984 8763 222 2 0 106 39 39 0.080 -3.27 1.09 Intr - 13215 13121 95 1 2 110 84 43 0.145 4.86 1.08 Intr - 15992 15930 63 0 0 110 32 57 0.089 0.07 1.07 Intr - 17561 17440 122 1 2 55 89 58 0.287 1.82 1.06 Intr - 17863 17702 162 0 0 72 38 102 0.266 1.77 1.05 Intr - 19284 18673 612 2 0 80 94 417 0.307 32.07 1.04 Intr - 30796 30750 47 1 2 50 80 53 0.016 -3.01 1.03 Intr - 32119 31991 129 1 0 68 94 15 0.429 0.07 1.02 Intr - 32234 32196 39 2 0 78 97 42 0.684 1.60 1.01 Init - 34198 33920 279 1 0 93 57 235 0.604 16.21 1.00 Prom - 36983 36944 40 -6.85 2.00 Prom + 37292 37331 40 -5.45 2.01 Init + 41247 41464 218 2 2 86 66 138 0.454 9.61 2.02 Intr + 48490 48628 139 1 1 12 107 48 0.281 -1.35 2.03 Intr + 48932 48967 36 1 0 118 64 34 0.113 1.54 2.04 Intr + 55700 55809 110 1 2 95 109 54 0.948 6.36 2.05 Term + 60291 60471 181 0 1 50 32 158 0.548 2.50 2.06 PlyA + 62768 62773 6 1.05 3.06 PlyA - 64391 64386 6 1.05 3.05 Term - 69815 69690 126 2 0 98 44 115 0.779 5.40 3.04 Intr - 79915 79796 120 1 0 42 62 107 0.011 3.27 3.03 Intr - 80307 80026 282 0 0 88 3 210 0.008 9.19 3.02 Intr - 83207 83054 154 1 1 67 10 171 0.086 6.35 3.01 Init - 90231 90176 56 0 2 45 75 59 0.512 1.11 3.00 Prom - 93508 93469 40 -5.65 4.00 Prom + 98727 98766 40 -5.75 4.01 Sngl + 100001 100750 750 1 0 74 49 530 0.999 43.62 4.02 PlyA + 101423 101428 6 1.05 5.00 Prom + 109090 109129 40 -5.85 5.01 Init + 110461 110515 55 0 1 53 54 57 0.125 0.30 5.02 Term + 120489 120604 116 1 2 5 37 231 0.440 7.45 5.03 PlyA + 120705 120710 6 1.05 6.00 Prom + 134766 134805 40 -3.65 6.01 Init + 136757 136962 206 1 2 85 38 227 0.551 15.86 6.02 Intr + 138577 138684 108 1 0 61 94 65 0.853 2.88 6.03 Intr + 139360 139490 131 1 2 49 36 90 0.598 -0.68 6.04 Term + 143607 143731 125 1 2 125 43 280 0.992 24.77 6.05 PlyA + 145139 145144 6 1.05 7.13 PlyA - 145209 145204 6 1.05 7.12 Term - 145346 145327 20 0 2 81 51 24 0.322 -4.50 7.11 Intr - 146318 146210 109 2 1 81 93 36 0.595 2.34 7.10 Intr - 146737 146570 168 1 0 115 72 130 0.884 13.32 7.09 Intr - 147419 147164 256 1 1 -45 70 263 0.473 7.72 7.08 Intr - 151671 151547 125 0 2 74 45 67 0.243 -0.54 7.07 Intr - 152716 152476 241 0 1 41 75 143 0.575 4.93 7.06 Intr - 153264 153192 73 1 1 117 33 45 0.343 -0.55 7.05 Intr - 153907 153712 196 1 1 68 64 151 0.559 8.77 7.04 Intr - 154534 154375 160 0 1 47 94 79 0.335 3.47 7.03 Intr - 156015 155793 223 1 1 94 0 167 0.273 4.76 7.02 Intr - 159222 159052 171 1 0 89 37 140 0.621 7.99 7.01 Init - 167419 167227 193 1 1 78 84 116 0.123 9.38 7.00 Prom - 174504 174465 40 -3.45 8.07 PlyA - 175596 175591 6 1.05 8.06 Term - 176183 176051 133 1 1 59 42 106 0.068 -0.32 8.05 Intr - 176930 176805 126 1 0 104 80 26 0.078 2.27 8.04 Intr - 177772 177629 144 0 0 77 116 51 0.158 5.28 8.03 Intr - 179202 179085 118 1 1 114 27 54 0.145 0.50 8.02 Intr - 188805 188728 78 1 0 102 87 19 0.203 1.80 8.01 Init - 196004 195938 67 2 1 93 83 43 0.461 5.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 53289 53438 150 2 0 74 41 82 0.872 2.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_1|589_aa MGIAVAEVRGSLWEAVSPGGHLVLIAMRFAGSASSITAAAVQAPGCGQPHGGGEQVAQGA RASMASAGSASLQCCQEEFMSCLSSATRKQPDGLALPVAILTARLKSPVLAMGRQDVILD SHHNPGWSRGSYENIAYRCKEVCRSSCCWFYLWESLVAKAKSYDSGFWHPQRKKRGDPLW PQEAQKHCPGCARKGENLGDPPKAPLGLSTELAQLRAATRDAASFRPVGALLALSASIPS FGPRSHISNDTLADSPASLAASLQRADASPEVVRKGGKAGQPRGSPQPWRSGAEEIVEVG LLPLTPRALLFPQVPTALPAVYHFLAACEGICPHSCAAPSMIELWDFLIQQHQPEKAVAC TTTHIPGLRISALNKHRAWGGTDDDVAASEMPLGAAGAEPWDWNDFCPLPSKFPQATPLP PLGVQVPEALREAAKVTGVGSGATADLLAPTAGVGTSRTACGGGTQFMSEGAPGVMNNGS PDWSSTQFMSEGAPGVMNNGSPDWSSNLSQPQPMHLLYLTTWFLERPMCCPDTTALTQTL PVPSFSYPVCTSHIRQNPHLHFLRAPSPHYTTACSFEPQRFLEQQQDLL >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_1|1770_bp atgggcatagctgtggcagaggtgagaggcagcctgtgggaggctgtgagcccaggtggc cacctggtccttatagccatgaggtttgcaggcagtgcctcaagcatcacagcagctgca gtgcaggcacctggctgtggacaacctcatgggggtggcgagcaggtggcccaaggtgcc agggccagcatggcatccgcagggtcagctagtcttcagtgctgccaagaagagtttatg agctgcctctcctctgctaccaggaaacagcctgatgggttggcattacctgtcgctatc ctcacagcccgattaaagtcccctgttttggccatgggtagacaggacgttattctggat tctcaccataacccaggctggtctagaggcagttatgagaacatagcctatagatgcaag gaagtttgtcgctcttcctgttgctggttctatttgtgggaaagtttggttgcaaaagct aaatcctacgacagtggcttttggcatccccagaggaaaaaacggggagacccgctttgg cctcaggaagcgcagaaacactgtcctggctgcgcgcgcaagggcgagaacctaggggac ccgcctaaagcgccgttggggctgagcaccgaactcgcacagctccgggcagcgacccgc gatgcggcatccttccgtccggtcggagccctcctagccctcagcgcgtctattcctagc ttcggtccccgcagccatataagcaatgacaccctcgccgactcgcctgccagcttggcc gcctccctgcagagggctgacgcttctccagaagttgtaagaaaaggagggaaagcaggc caacctcgaggatctccccagccttggcgttcaggtgctgaggagatcgtcgaggttggc ctgcttcccctcactcctcgggccttgctcttcccccaggtgcccactgccctccctgct gtgtaccactttcttgctgcctgtgaaggtatttgtcctcactcttgtgctgctccttca atgatagaactttgggactttctcatccagcagcaccagcctgaaaaagcagtggcttgc accaccacacacatacctggtctcaggatctcagccctaaacaagcacagggcgtggggt ggtacagatgatgatgtagctgcttctgagatgcccctgggcgctgctggagctgagccc tgggactggaatgacttttgcccgctgccctcaaaattcccacaggccacaccgctacca cctcttggggtgcaggtccctgaagccctcagggaagctgcaaaggttactggggtcggc tctggggcaacagctgacctcttagctcccactgcaggcgtaggaacctccaggacagcc tgtgggggtggtacacagttcatgagtgaaggagctcctggagtcatgaataatggaagt ccagattggtcaagtacacagttcatgagtgaaggagctcctggagtcatgaataatgga agtccagattggtcaagtaacttgtcccagccccagcccatgcatctgctgtacttaaca acctggtttcttgaacgtcccatgtgttgcccagacaccacagctttgacacagactctt ccagttccatctttctcatatccagtttgtacttcacacatcagacagaatccccattta catttcctgagagcccccagcccacattatactacagcatgttcattcgaaccacaaaga tttttggagcagcaacaggatctactttag >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_2|227_aa MGVKLAEKIVTLVLLEKEGCEGLVDTVIRMGSRSLILGRSLCPVASSQPRCKLKAIHPEP QGGVQGNPAHVGRGVAVMQCQWNLLEIYSLELAGNSSLGCSGKQFMGKYFPWVPEGGALA LQSHVPYDDKMDLCDYIGSPRLAPACFADKETEIQIGKVKMTYPRSCRQVEVNVREAHRT AKAAGAVSSPVSDFPVISPLIPKAMFAESQWHFLELPMEVYFTGTFK >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_2|684_bp atgggggtgaagctagctgagaagatagtcaccttagtactgctggagaaggaaggatgt gagggtttggtggacacagtcatccgaatgggatccaggtcattaatccttggccgctct ctgtgcccagtggcctcctctcaaccaaggtgcaaactaaaggcaattcaccctgaaccc cagggtggagtgcaggggaatccagcacacgtgggaagaggagttgctgttatgcagtgt cagtggaacttgctagaaatctattccctggaacttgctggaaattcatctctaggatgc tcaggaaagcaattcatgggaaagtatttcccatgggtacctgagggaggtgccctggct ctacagtctcatgtaccttatgacgacaagatggacctttgtgattacattgggtcacct agactagcccctgcctgctttgctgataaggaaactgaaattcagattggtaaagtgaaa atgacttatccgagatcatgcaggcaggtggaggtgaatgtgagggaggcccacaggacc gcaaaagcagctggtgctgtgagttcaccagtaagtgactttcctgtgatttccccgctt ataccaaaagccatgtttgctgaaagtcaatggcattttctggaactgcctatggaggtg tacttcacaggaacttttaaataa >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_3|245_aa MSLENQNSHTGGLKTSLERPFHKHRILGDPHANTEANISVPISQMSPVKLTEVKRLVQES QASKELSQDKTRGPDPTRRPPSPRGGGGVLRAGAARSRAGGTAAATGPRAGPGARGAAGS APFLTDPNMASAATARDDDTSRRCDGSGTAGGSARKVSRLPSRPGADSARSPPGPSRVCA VRSPAQPARSRRGGRAGRQRSRSQAEHHMEAATAWGFHPLKPQPELYIGPFQPWLEWLEY KAPSP >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_3|738_bp atgagtttggaaaatcagaacagtcacactggaggtttaaaaacatctttagagaggccc ttccacaaacatcgaatcttgggtgaccctcacgctaacactgaggcaaatatttctgtg cccatttcacaaatgagtccagtgaagcttacagaagtgaagcggcttgttcaagagtca caagccagcaaggagctgagtcaggacaagacccgtggaccggaccctacccggcggccc ccgtccccccgcggcggcggcggcgtcctacgggcgggcgcggctcgctcccgggctggc gggacggcggcggcaactggcccgcgggccggccccggcgcgaggggagcggccgggagc gccccgtttctcacagaccccaacatggcgtcggccgccaccgcccgcgacgacgacacc tcccggcgctgcgacggctccgggaccgcgggaggaagcgcccggaaagtttcgcgtctc ccctcccggcccggcgcggactctgcccgttccccgccggggccttccagggtgtgtgct gtccgcagccccgcgcagccggcgcgatccagacgcggtgggcgggctggccgccagcgc agccgctcgcaggctgaacaccacatggaagctgccacagcttggggcttccaccctctg aagccacagcccgagctctacattggcccctttcagccatggctggagtggctggaatac aaggcaccaagtccctag >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_4|249_aa MVDRLANSEANTRRISIVENCFGAAGQPLTIPGRVLIGEGVLTKLCRKKPKARQFFLFND ILVYGNIVIQKKKYNKQHIIPLENVTIDSIKDEGDLRNGWLIKTPTKSFAVYAATATEKS EWMNHINKCVTDLLSKSGKTPSNEHAAVWVPDSEATVCMRCQKAKFTPVNRRHHCRKCGF VVCGPCSEKRFLLPSQSSKPVRICDFCYDLLSAGDMATCQPARSDSYSQSLKSPLNDMSD DDDDDDSSD >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_4|750_bp atggtggatcgcttggcaaacagtgaagcaaatactagacgtataagtatagtggaaaac tgttttggagcagctggtcaacctttaactatacctggacgagttcttattggagaagga gtattgactaagttgtgcaggaaaaagcccaaagcaaggcagtttttcttgtttaatgat attcttgtatatggcaatattgtcatccagaagaaaaaatataacaaacaacatattatt cccctggaaaatgtcactattgattccatcaaagatgagggagacttaaggaatggatgg ctaatcaagacaccaactaaatcttttgcagtttatgctgccactgctacggagaaatca gaatggatgaatcatataaataaatgtgttactgatttactctccaaaagtgggaagaca cccagtaatgaacatgctgctgtctgggttcctgactctgaggcaactgtatgtatgcgt tgtcagaaagcaaaattcacacctgttaatcgtcgccaccattgccgcaaatgtggtttt gttgtctgtgggccctgctctgaaaagagatttcttcttcccagccagtcctctaagcct gtgcggatttgtgacttctgctatgacctgctttctgctggggacatggccacatgccag cctgctagatcagactcttacagccagtcattgaagtctcctttaaatgatatgtctgat gatgatgacgatgatgatagcagtgactaa >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_5|56_aa MAVRILKEKRDHSNLWVGGPQNAELQQEPDNGVGDKASFRDASVYAEGTWAESADK >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_5|171_bp atggcagtaagaatcttgaaagaaaaacgggaccattccaatttatgggttggtggacct caaaatgcagaactgcagcaggaacccgacaatggggtgggagacaaagccagcttcaga gatgccagtgtttacgctgaagggacgtgggcagaatccgccgacaaataa >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_6|189_aa MAFWGLQNYKGRGQLAALAHTFRCALGAASAVSPVSPEQGEGLQKPALSTRYLLDPAGLI LWLWSPPAGSLHGLRVGMQTKPVGHPSTIHAAFHLALLGSLCQLRPQGLESTPLKISVCR GEETPAERAPVRGTYKQELRNCRKSAGEGHPGPWMGLDAKSCSVRTFSDDDDDDDDDDDD SGESKDHMA >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_6|570_bp atggctttctggggccttcagaactataaagggagagggcagctggcagccctggcccac accttcagatgtgccctgggtgctgcctctgctgtgtctcctgtgtctcctgagcaggga gagggactgcagaaacctgcactgtccactcgctacctcctggatcctgccggtctcatt ctgtggctgtggtccccgccggctggcagcctgcatggactaagagtgggcatgcaaacc aagcccgtgggccatcccagcaccattcatgctgccttccacctggcattgttgggctct ttgtgtcagctcaggccccagggacttgaaagcactccactcaagatttctgtctgccga ggagaggagaccccagcagaaagagccccggtgaggggtacctacaagcaggagctgagg aattgtaggaagtcagctggagaggggcatcctggtccgtggatgggcttggatgccaag agctgctctgtaagaaccttctcagatgatgatgatgatgatgatgatgatgatgatgat tctggagaaagcaaagaccacatggcctga >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_7|644_aa MGPAKRRAMGCGNAGKTAFPTHPDERGWPGRQQSGETVLPGKLEEQRIAKGWGVADGVSC PRGVCLLSSHEDSRGCGGPRGTEILLHQKDDKMSTKPAHRAEYAQGCARVRTQCMHGSQG HEDPGGGGHVLSAPGLSLLQTGQPCCGHLSIEPDLEIQFLRGTRRGAPTVFSQAIPEKWK KRAFHRPSGKHSTIRVPPPSSLSSVSSMRLKKNAVVVPCLLTQRVKQEFSAPVCHGGPWE EVTYHSHPQGQQDGLGNSCTPTAASPAQYLDNMNPPPSSISVFQSCQLPLGKALSTRSLR PKADSGLIITVFPAGTGALRTLLDLLDQKGRGNAGQKSSQSLARRMERPPLASIMRIRSL GWEHPYTYVKENRYLLGRRKEGQALGRQPKDPAAVSVTTSLPGASATLIIFSWCPLSLKQ PLTGHFTQPPEQYFQTTGHIMPLLTHIPQLLKTSTGPSMSRRGEGDVTTGAETATTAKKR CSRQRLEDMDSPLEPPEGAQPASTLISASDTDCGLPAFGTVREEMPAVISHQVYSNMLQE PQETKSGVQPPAEEPGTAIAIAAAAPQKWPNPPGKAPKGKRPERVAFQCLHRDCMEFTPN FRDSLCLNNLGGGVRVWQGEGSSQDPPHAAPAPSSHDPDNWHQK >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_7|1935_bp atgggccctgcgaagagaagagccatgggctgcgggaacgctgggaaaacagcattccca acacacccagatgaaaggggctggcctgggagacagcagagtggagaaacagtcctgcca ggaaaactagaggagcagaggattgccaaggggtggggtgtggctgatggtgtctcctgc cccaggggagtctgcctcctctccagccatgaggacagcaggggatgtggagggccaaga gggacagaaatcctcctgcaccaaaaggatgataaaatgagcaccaaaccagctcaccga gctgagtatgcccagggttgtgccagagtcagaacccagtgcatgcacgggagccaaggc catgaagacccaggtggaggtggccacgtgctctcagcacctggactatcacttctgcag actggacaaccatgctgtgggcacttgtccatagagccagacctggaaatacagtttctc agaggtacccggagaggggcaccaactgtgttctcccaggccatcccagaaaagtggaag aagagagcatttcacagaccatcaggaaaacactcaaccataagggtaccccctccctct tccctgagctctgtttcttccatgaggttgaagaaaaatgccgtcgtggtcccctgcctg cttactcagcgtgtcaagcaggagttctcagctccagtgtgccatggagggccttgggaa gaggtcacataccattcccatccccagggccagcaggatgggctggggaattcctgcacc ccaacagctgcgtccccagctcagtacctggacaacatgaatccacccccatcgtcaata agcgtctttcagagctgccagctccctctggggaaagccctttccactcgttctttgagg cccaaagctgattctggtctaattataactgtattcccagcaggcacaggagcactgagg acccttcttgacctcctggatcagaaaggcagaggaaatgctgggcagaagtcaagccag tccctggcaagaagaatggaacgaccacctttggcttccattatgaggattcgctctctg gggtgggaacatccatacacttatgtcaaggagaacagatacctgttgggaaggaggaaa gagggacaagcgttgggcaggcaacccaaggaccctgcagcagtctccgtgacaacttct cttcctggtgcctctgcgactctgatcattttttcttggtgtcctctttctctgaagcag cctctcaccggccactttacacagccaccagaacaatatttccaaaccacaggtcacatt atgcccctccttacccacatcccccagcttctgaagacatccactggaccttccatgagc aggagaggagaaggtgatgtgaccacgggggcagagactgcgaccacagccaagaaacgc tgcagccgtcagaggctggaagacatggattctcccctagagcctccagagggagcgcag cctgccagcaccttgatttcagccagtgacactgattgtggacttccagccttcggaact gtaagagaagaaatgcctgctgttataagccaccaggtttacagtaatatgttacaggag ccacaggaaactaagtcaggggttcagcctcctgcagaggaaccaggtacagccattgcc attgcagctgcagccccacagaaatggcctaaccccccagggaaggcccctaaaggaaag cgccctgaaagagtggctttccaatgtttacacagagattgcatggaatttacaccaaac ttcagggactcgctctgtctgaacaacctgggaggaggagtcagggtgtggcagggtgag ggaagtagccaggaccctccacatgctgcccctgctccctcatcccatgaccctgataat tggcatcagaagtga >gi568815590f:95054045_95254791|GENSCAN_predicted_peptide_8|221_aa MGGDKHVSSKVIETIGSRALMGDILPHLILSKSNSFFKASILYNSSSQTEVSQPTPFTLP PPRAVDKFVNHHLAIREAFYQGLLGRRGPTDQPGCSTITRTRSMSFQPSSNAAMPPLLPP GHCLTTTEAVTTLFLSDLPSPGLSRWSPGFCIPLSPPFSSTKTNHSCPVYQGLPTETRHQ ALSQVLETQIQPLCSWQKEDNIGRGDCSVVTHQIQLRQVYI >gi568815590f:95054045_95254791|GENSCAN_predicted_CDS_8|666_bp atgggtggggacaagcatgtgagcagcaaggtcatagagacaatagggtcaagagcactg atgggagatatactccctcatcttatcctttcaaaatccaactcatttttcaaagccagt atcttgtacaacagcagttctcaaactgaagtatctcagcccactccctttaccttgcca cccccaagggctgtagacaagtttgtgaaccatcaccttgcaatcagggaagccttctac caagggcttcttggaagaaggggccccacagaccagcctggctgctccaccatcacccgc accaggtctatgtccttccagccatcctcaaatgctgccatgccccctttgctgcctcct ggccactgcctgacaaccacagaagctgtcaccactttgttcctaagcgacctgcccagt ccaggcctctccagatggtcccctggcttctgcatccctctttcacctcctttttcctca accaagacaaaccactcctgtccagtctatcagggcctccccactgaaaccaggcaccag gcattgagccaggtcctggagacacaaatacagcccctgtgctcatggcagaaggaagac aacataggtcgaggagactgttcagttgtgactcaccagatccaactcaggcaggtctat atttaa