GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:33:06 Sequence gi568815590r:42075996_42293185 : 217190 bp : 45.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25546 25704 159 0 0 67 53 104 0.443 4.62 1.02 Intr + 37600 37656 57 0 0 95 95 4 0.010 0.88 1.03 Intr + 46563 46646 84 2 0 38 106 40 0.005 0.82 1.04 Intr + 53092 53265 174 0 0 62 109 115 0.089 11.14 1.05 Intr + 68627 68678 52 1 1 98 77 28 0.005 1.18 1.06 Intr + 76795 76981 187 2 1 -32 2 198 0.003 -2.05 1.07 Intr + 78866 78965 100 2 1 45 79 124 0.500 7.31 1.08 Intr + 81946 82117 172 0 1 79 96 151 0.998 14.52 1.09 Intr + 86286 86423 138 1 0 21 96 96 0.930 4.14 1.10 Intr + 89076 89161 86 1 2 82 53 74 0.997 2.84 1.11 Intr + 89432 89565 134 1 2 52 74 106 0.998 5.04 1.12 Intr + 91169 91376 208 2 1 88 97 157 0.996 15.68 1.13 Intr + 91671 91815 145 2 1 56 119 60 0.975 5.76 1.14 Intr + 93197 93290 94 0 1 110 93 73 0.688 9.02 1.15 Term + 95534 95630 97 2 1 93 42 83 0.647 1.64 1.16 PlyA + 96738 96743 6 1.05 2.13 PlyA - 97004 96999 6 1.05 2.12 Term - 100156 99998 159 1 0 106 47 172 0.981 12.84 2.11 Intr - 103068 102902 167 1 2 112 91 110 0.999 13.38 2.10 Intr - 104071 103931 141 2 0 132 95 287 0.999 34.12 2.09 Intr - 104347 104247 101 0 2 41 91 231 0.609 18.45 2.08 Intr - 104690 104495 196 0 1 140 75 238 0.999 26.17 2.07 Intr - 106027 105942 86 0 2 42 59 133 0.999 5.26 2.06 Intr - 106895 106724 172 0 1 94 94 109 0.999 11.10 2.05 Intr - 109177 109086 92 0 2 77 110 80 0.999 8.74 2.04 Intr - 111577 111403 175 2 1 63 94 234 0.999 20.50 2.03 Intr - 112021 111911 111 2 0 77 95 149 0.999 14.85 2.02 Intr - 113076 112939 138 1 0 89 53 49 0.770 1.94 2.01 Init - 117190 117031 160 1 1 95 31 164 0.539 9.39 2.00 Prom - 119742 119703 40 -4.26 3.00 Prom + 121749 121788 40 -2.26 3.01 Init + 125711 125845 135 1 0 80 47 109 0.488 6.14 3.02 Term + 126522 126644 123 2 0 81 44 60 0.471 -0.72 3.03 PlyA + 128923 128928 6 1.05 4.03 PlyA - 129009 129004 6 1.05 4.02 Term - 133142 132844 299 0 2 30 48 234 0.632 9.03 4.01 Init - 139207 139114 94 1 1 65 90 81 0.766 6.55 4.00 Prom - 140453 140414 40 -4.16 5.00 Prom + 152924 152963 40 -4.66 5.01 Init + 163305 163420 116 2 2 86 80 109 0.962 7.58 5.02 Intr + 166800 166970 171 0 0 85 35 108 0.154 4.26 5.03 Intr + 167264 167321 58 2 1 76 41 53 0.051 -1.71 5.04 Intr + 177737 177953 217 1 1 61 93 114 0.151 7.28 5.05 Intr + 195475 195603 129 2 0 110 4 83 0.001 2.77 5.06 Intr + 201633 201730 98 1 2 80 117 -20 0.000 -0.17 5.07 Intr + 212639 212733 95 1 2 67 100 152 0.966 13.06 5.08 Intr + 214161 214278 118 0 1 105 100 196 0.908 22.97 5.09 Intr + 214864 215015 152 0 2 112 80 8 0.481 1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 38561 38633 73 1 1 106 100 24 0.964 6.63 S.002 Intr - 189447 189296 152 0 2 121 41 93 0.874 6.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:42075996_42293185|GENSCAN_predicted_peptide_1|628_aa MQGQWPKKSDELLLSQDAESNADLKNLYVDSLVIEHMKANKAPKMPIKLRELTACSRYLG CYKGWLVNSFSLERSPDPDPKRGFLDLVPKRIQGKSTKMKRGQAFFVNEAITALGKPVED LLKLPDWGWSNITKTLDVAHCGFSVVWPWPACWTAWLQGRDTPHSPTVEGDQNSPSRTVF DLQSGLGVSELYQSEMTVASTGPIGGLEPDNRASRQQRSIRGEERLPDTQSAAFRRRSVY RHKIFFVAVIQTEVPPLFVIEFLHRVVDTFQDYFGVCSEPVIKDNVVVVYEVLEEMLDNG FPLATESNILKELIKPPTILRTVVNTITGSTNVGDQLPTGQLSVVPWRRTGVKYTNNEAY FDVIEEIDAIIDKSGSTITAEIQGVIDACVKLTGMPDLTLSFMNPRLLDDVSFHPCVRFK RWESERILSFIPPDGNFRLLSYHVSAQNLVAIPVYVKHNISFRDSSSLGRFEITVGPKQT MGKTIEGVTVTSQMPKGVLNMSLTPSQGTHTFDPVTKMLSWDVGKINPQKLPSLKGTMSL QAGASKPDENPTINLQFKIQQLAISAFSAMELIKGGKGHQGELDRTETQQHQFSRTRHET VSQYSGVFIPGEASYKIRHCESIVTPHL >gi568815590r:42075996_42293185|GENSCAN_predicted_CDS_1|1887_bp atgcagggtcagtggcccaaaaagagtgatgaattattgctgtcccaagatgcagagagt aatgctgaccttaagaatttgtatgtagattctctagtcattgagcacatgaaggcgaac aaagcccccaagatgcctatcaaactgagagagctcacggcctgctctaggtacctgggc tgctacaaaggctggctagtaaactccttcagtctggaaaggagtcctgatccagacccc aagagagggttcttagatctcgtgccaaaaagaattcagggcaagtccacaaaaatgaaa cgaggccaggccttctttgtcaatgaagccatcacagctctgggcaaacccgtggaagac ctccttaaactcccagattggggctggtcaaacatcacgaagacattggatgtggcccac tgtggcttctcggtggtctggccatggcctgcctgctggacagcttggcttcagggcagg gatacccctcattcaccaacagtggaaggtgatcagaacagcccatcacgcactgtcttc gacctccaatcaggactcggggtctcggagctgtaccaatcggagatgaccgtggcctca acagggccaatcgggggactcgagcccgacaacagggcaagccgccaacagcgatcaatc agaggggaggagcggctcccagatacccaatcagcggccttcaggcgcagatctgtttac cgccacaagatcttttttgtggccgtgatccagacggaggtcccccctctgtttgtcatt gagtttcttcaccgagtggtggacacatttcaggattattttggagtctgttcagagcca gtgatcaaagacaatgtagttgtggtttatgaggtattggaagagatgcttgacaatggt tttccattggctaccgagtcgaacattcttaaagaactcataaagcctcctaccatcctt cgaacggttgtcaacaccatcacaggaagcacgaatgtgggtgaccagcttcccactggg cagctgtcagtggtgccttggcgacggactggggtgaaatataccaacaatgaggcctat tttgatgtgattgaagagattgatgcaattattgataaatcaggctccacaattactgct gagatccagggggtgattgatgcctgtgtcaagctgactggcatgccagaccttacactt tccttcatgaaccctaggttgttggatgatgtcagcttccatccttgtgttcgtttcaaa cgctgggaatctgagcgcatcctctccttcatccctcctgatggaaacttccgcctgctg tcttaccatgtcagtgcacagaatctggttgcaatcccagtgtatgtcaaacataacatc agtttccgggacagtagttcccttggacgctttgaaataacggtgggacccaagcagacg atggggaagaccattgagggagtgactgtcaccagccagatgcccaagggggtcctgaac atgagccttactccatcacaggggacacacacattcgacccagtcacaaagatgctgtct tgggatgtaggaaaaataaatccacaaaagctaccaagtttgaaggggaccatgagtctt caggctggagcttccaaaccagatgaaaaccccacaattaacctgcagtttaagatccag cagctggccatttctgccttcagtgccatggaactaatcaagggaggaaaaggtcaccag ggagaactggacagaactgaaacacagcaacaccagttctcaaggacaaggcatgaaacg gtgtcgcagtactctggcgtctttattccaggagaggcttcttacaagataaggcactgc gagtcgattgttacgccacacctgtaa >gi568815590r:42075996_42293185|GENSCAN_predicted_peptide_2|565_aa MDAMKRGLCCVLLLCGAVFVSPSQVGVQDPCVPPLKAVMLPEAPGVCVHLGHGVICRDEK TQMIYQQHQSWLRPVLRSNRVEYCWCNSGRAQCHSVPVKSCSEPRCFNGGTCQQALYFSD FVCQCPEGFAGKCCEIDTRATCYEDQGISYRGTWSTAESGAECTNWNSSALAQKPYSGRR PDAIRLGLGNHNYCRNPDRDSKPWCYVFKAGKYSSEFCSTPACSEGNSDCYFGNGSAYRG THSLTESGASCLPWNSMILIGKVYTAQNPSAQALGLGKHNYCRNPDGDAKPWCHVLKNRR LTWEYCDVPSCSTCGLRQYSQPQFRIKGGLFADIASHPWQAAIFAKHRRSPGERFLCGGI LISSCWILSAAHCFQERTYRVVPGEEEQKFEVEKYIVHKEFDDDTYDNDIALLQLKSDSS RCAQESSVVRTVCLPPADLQLPDWTECELSGYGKHEALSPFYSERLKEAHVRLYPSSRCT SQHLLNRTVTDNMLCAGDTRSGGPQANLHDACQGDSGGPLVCLNDGRMTLVGIISWGLGC GQKDVPGVYTKVTNYLDWIRDNMRP >gi568815590r:42075996_42293185|GENSCAN_predicted_CDS_2|1698_bp atggatgcaatgaagagagggctctgctgtgtgctgctgctgtgtggagcagtcttcgtt tcgcccagccaggttggtgtgcaggatccctgtgtcccgcccctcaaggctgtgatgctt ccggaggctccaggggtctgtgtccatctgggccacggagtgatctgcagagatgaaaaa acgcagatgatataccagcaacatcagtcatggctgcgccctgtgctcagaagcaaccgg gtggaatattgctggtgcaacagtggcagggcacagtgccactcagtgcctgtcaaaagt tgcagcgagccaaggtgtttcaacgggggcacctgccagcaggccctgtacttctcagat ttcgtgtgccagtgccccgaaggatttgctgggaagtgctgtgaaatagataccagggcc acgtgctacgaggaccagggcatcagctacaggggcacgtggagcacagcggagagtggc gccgagtgcaccaactggaacagcagcgcgttggcccagaagccctacagcgggcggagg ccagacgccatcaggctgggcctggggaaccacaactactgcagaaacccagatcgagac tcaaagccctggtgctacgtctttaaggcggggaagtacagctcagagttctgcagcacc cctgcctgctctgagggaaacagtgactgctactttgggaatgggtcagcctaccgtggc acgcacagcctcaccgagtcgggtgcctcctgcctcccgtggaattccatgatcctgata ggcaaggtttacacagcacagaaccccagtgcccaggcactgggcctgggcaaacataat tactgccggaatcctgatggggatgccaagccctggtgccacgtgctgaagaaccgcagg ctgacgtgggagtactgtgatgtgccctcctgctccacctgcggcctgagacagtacagc cagcctcagtttcgcatcaaaggagggctcttcgccgacatcgcctcccacccctggcag gctgccatctttgccaagcacaggaggtcgcccggagagcggttcctgtgcgggggcata ctcatcagctcctgctggattctctctgccgcccactgcttccaggagagaacataccgg gtggtccctggcgaggaggagcagaaatttgaagtcgaaaaatacattgtccataaggaa ttcgatgatgacacttacgacaatgacattgcgctgctgcagctgaaatcggattcgtcc cgctgtgcccaggagagcagcgtggtccgcactgtgtgccttcccccggcggacctgcag ctgccggactggacggagtgtgagctctccggctacggcaagcatgaggccttgtctcct ttctattcggagcggctgaaggaggctcatgtcagactgtacccatccagccgctgcaca tcacaacatttacttaacagaacagtcaccgacaacatgctgtgtgctggagacactcgg agcggcgggccccaggcaaacttgcacgacgcctgccagggcgattcgggaggccccctg gtgtgtctgaacgatggccgcatgactttggtgggcatcatcagctggggcctgggctgt ggacagaaggatgtcccgggtgtgtacaccaaggttaccaactacctagactggattcgt gacaacatgcgaccgtga >gi568815590r:42075996_42293185|GENSCAN_predicted_peptide_3|85_aa MHAEATVDSQQKPELAAEMTQLRHQAISIDKAHLTKPHYESQRHHPSSTHMSCFSVKHSQ TMIYPCYIFTINVYGAPAVPWHRLW >gi568815590r:42075996_42293185|GENSCAN_predicted_CDS_3|258_bp atgcatgcagaggccactgtggactcccagcagaagccagaactagcagctgagatgaca cagctgagacatcaggccatctctattgataaggcccatctgaccaagccacactatgaa agtcagaggcatcatccctcaagcacccacatgagctgcttctctgttaagcattcacag acaatgatctacccatgctatatattcaccatcaacgtgtatggagcaccagccgtgccc tggcacaggctctggtga >gi568815590r:42075996_42293185|GENSCAN_predicted_peptide_4|130_aa MESYTKAVFTGQKCLTQEGGRSGESCDAMAGEVQDPGFKCNHSVENHPLKSSNESPHDQH SLLNGNAPPPKKASILTILDHIAGKQPHAHQTGPQLLLLMGGLPPVLSRQPTPVPLFYND HQQGCSFIKM >gi568815590r:42075996_42293185|GENSCAN_predicted_CDS_4|393_bp atggaaagctacaccaaagctgtattcactggacaaaaatgcttgactcaggaaggaggc cggagcggcgagtcctgtgatgccatggcgggagaggttcaggatccaggatttaagtgc aaccattcagtggaaaaccatcctctcaagagttcaaatgagagtcctcatgaccagcac tccctgctgaacggcaatgccccaccccccaaaaaagcctccatcctcaccatcttagac cacattgcaggaaagcagccccatgcccaccagacaggcccacagctgctgctgctcatg ggaggtctacctcccgtgctcagcaggcaacccaccccagtgccccttttctacaatgac catcagcagggctgcagcttcataaaaatgtga >gi568815590r:42075996_42293185|GENSCAN_predicted_peptide_5|385_aa MALIPLPISFSLPLAALWATASAIPQVAWPDLLGPHPLRELRDTGSGHTQCDFHMMAAGP DFVVMSRPLNEGPDHNQNAGSVSCYLQNPIHKREFCLLVGERSALEHLSTELDEFKGYTR QQFCHESTPNKGVRVIYNPTHPPYCCAQFPLAATGPHILYSSNSELFKRGKGRGEQRKEE VTWNAEKGESPSWVRPGCHLQAPPPRCLQGPEDPSVPLGSRSLRTGERFTCLPLLHIPPG PTLGHLLLICVWDGFVLWQLVLPETGEQIAIKQCRQELSPRNRERWCLEIQIMRRLTHPN VVAARDVPEGMQNLAPNDLPLLAMEYCQGGDLRKNVHFWWVDGSPSWSQGQLGALLPIVP YDSTTQPASTCYLIRFPKAHTWVHS >gi568815590r:42075996_42293185|GENSCAN_predicted_CDS_5|1155_bp atggccctcatcccgcttcccatctccttctcgctccctttagccgccctctgggccact gcgtctgccattcctcaggtggcctggcccgacctgcttggcccccatcccctccgggag ctgagggacactggctctggccatacccagtgcgatttccacatgatggctgcaggaccg gactttgtagtaatgagcaggccactcaacgaaggcccagatcataaccaaaatgcaggg tctgtcagttgctacttgcagaatccaattcacaagcgcgagttctgcctcctggtgggt gagcgttctgctctggagcatctaagcacagagttagatgaatttaaagggtacactcgc cagcagttttgccatgagagtacaccgaacaaaggagtcagggtcatttataacccgacg catccaccctactgctgtgcccagtttccattggctgcaacaggacctcacattctgtat tctagcaactcagaactttttaaaagaggcaaaggcagaggagaacaaaggaaggaggag gtaacgtggaatgctgagaaaggtgagtccccctcgtgggtgcggcccgggtgccacctg caggccccgccgccccgctgcctgcaaggcccggaagacccctctgtgccgctgggaagt cgcagcttgcggactggggagcgtttcacttgcctccccctcctccacattcctccagga ccaactctaggccatctcctcctcatatgcgtctgggatggctttgttctgtggcaactg gttctcccggaaacaggtgagcagattgccatcaagcagtgccggcaggagctcagcccc cggaaccgagagcggtggtgcctggagatccagatcatgagaaggctgacccaccccaat gtggtggctgcccgagatgtccctgaggggatgcagaacttggcgcccaatgacctgccc ctgctggccatggagtactgccaaggaggagatctccggaagaatgttcacttctggtgg gtggatggcagcccctcctggtctcaggggcagttgggtgccctcctccccattgttccc tatgacagtaccacacagcctgctagtacatgttacctcattcgatttccaaaagctcac acatgggtgcacagn