GENSCAN 1.0 Date run: 19-Feb-121 Time: 20:39:11 Sequence gi568815586r:128694199_128923827 : 229629 bp : 46.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1632 1905 274 0 1 48 91 345 0.990 28.34 1.02 Intr + 3026 3217 192 1 0 91 91 248 0.995 25.09 1.03 Term + 10892 12097 1206 1 0 94 47 1680 0.985 156.17 1.04 PlyA + 12306 12311 6 1.05 2.05 PlyA - 12661 12656 6 1.05 2.04 Term - 20001 19851 151 2 1 51 49 90 0.506 -1.22 2.03 Intr - 20318 20080 239 2 2 127 87 71 0.540 7.31 2.02 Intr - 20570 20488 83 0 2 56 96 58 0.801 2.76 2.01 Init - 24125 24011 115 2 1 86 73 24 0.337 1.08 2.00 Prom - 28875 28836 40 -1.66 3.09 PlyA - 32322 32317 6 1.05 3.08 Term - 38070 37870 201 0 0 88 44 66 0.614 -0.41 3.07 Intr - 50917 50854 64 0 1 126 110 14 0.869 6.12 3.06 Intr - 53720 53529 192 1 0 97 54 89 0.056 5.01 3.05 Intr - 82477 82309 169 2 1 48 16 99 0.105 -2.30 3.04 Intr - 87228 87127 102 1 0 67 41 89 0.646 2.25 3.03 Intr - 87464 87372 93 0 0 84 47 83 0.722 3.74 3.02 Intr - 91254 91159 96 1 0 26 81 77 0.417 0.78 3.01 Init - 93548 93494 55 2 1 46 99 69 0.776 5.25 3.00 Prom - 101857 101818 40 -3.46 4.10 PlyA - 102400 102395 6 1.05 4.09 Term - 104922 104720 203 1 2 108 54 65 0.693 2.65 4.08 Intr - 105219 105061 159 1 0 140 91 98 0.986 15.26 4.07 Intr - 106811 106656 156 0 0 61 94 251 0.998 22.98 4.06 Intr - 114758 114590 169 2 1 103 110 160 0.925 19.22 4.05 Intr - 115830 115745 86 1 2 39 89 68 0.638 1.54 4.04 Intr - 118563 118486 78 1 0 68 88 40 0.509 1.52 4.03 Intr - 120118 119988 131 0 2 57 62 69 0.665 1.54 4.02 Intr - 120872 120577 296 2 2 96 94 226 0.838 19.81 4.01 Init - 129745 129200 546 1 0 44 99 1004 0.997 90.01 4.00 Prom - 133102 133063 40 -3.46 5.00 Prom + 142354 142393 40 -5.86 5.01 Init + 144812 144907 96 1 0 56 75 110 0.904 6.81 5.02 Intr + 149216 149329 114 1 0 73 60 86 0.957 4.94 5.03 Intr + 151115 151252 138 1 0 57 58 77 0.549 2.26 5.04 Intr + 151497 151700 204 2 0 85 33 111 0.504 4.70 5.05 Term + 152261 152554 294 1 0 73 47 177 0.667 7.41 5.06 PlyA + 154980 154985 6 1.05 6.00 Prom + 155757 155796 40 -7.96 6.01 Init + 159144 159451 308 2 2 65 86 317 0.273 23.97 6.02 Intr + 169889 169981 93 2 0 60 93 48 0.005 1.58 6.03 Intr + 181716 181864 149 0 2 46 84 76 0.071 2.88 6.04 Intr + 194441 194546 106 0 1 103 90 110 0.818 11.97 6.05 Intr + 220688 220791 104 2 2 53 85 24 0.025 -1.58 6.06 Term + 223826 224091 266 0 2 52 41 157 0.072 3.07 6.07 PlyA + 224275 224280 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_1|557_aa XPTRESEDEDEEERRGRGCALQYQHATVRVLTQFVSEGAGPWGQPNYLLSPNWQFDITHL VADFMKLEEPHVATLQDSRVLVGREVGMTTIQVLSPLSDSILAEKTITVLDDKVSVTDLA IQLVAGLSVALYPNAENSKAVTAVVTAEEVLRTPKQEAVFSTWLQFSDGSVTPLDIYDTK DFSLAATSQDEAVVSVPQPRSPRWPVVVAEGEGQGPLIRVDMTIAEACQKSKRKSILAVG VGNVRVKFGQNDADSSPGGDYEEDEIKNHASDRRQKGQHHERTGQDGHLYGSSPVEREEG ALRRATTTARSLLDNKVVKNSRADGGRLAGEGQLQNIPIDFTNFPAHVDLPKAGSGLEEN DLVQTPRGLSDLEIGMYALLGVFCLAILVFLINCATFALKYRHKQVPLEGQASMTHSHDW VWLGNEAELLESMGDAPPPQDEHTTIIDRGPGACEESNHLLLNGGSHKHVQSQIHRSADS GGRQGREQKQDPLHSPTSKRKKVKFTTFTTIPPDDSCPTVNSIVSSNDEDIKWVCQDVAV GAPKELRNYLEKLKDKA >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_1|1674_bp nngcccactcgtgagagcgaggatgaggacgaggaggagcggcggggccggggctgcgca ctgcaataccagcacgccaccgtgcgggtcctcacccagtttgtgtctgagggcgccggt ccatggggccagccgaactacctgcttagtcctaactggcagttcgacatcactcacctg gtggcagacttcatgaagctggaggaacctcacgtggccaccctccaggacagccgggtc ctggttgggcgagaggttgggatgacgaccatccaggtgttgtctccactgtctgactcc atcctggcagagaagacaataaccgtgctagatgacaaagtatcggtgacagacttggcc atccagctcgtggctgggctgtctgtcgccctttaccccaacgcagaaaacagcaaggcc gtaacagctgtggtcacagctgaggaggtgctgcggacccccaaacaggaggctgtattc agcacgtggctgcagttcagtgatggctctgtgacgcccctggacatctacgacaccaag gacttctccctggcagccacctcccaggacgaggctgtcgtgtcagtcccccagccccgc tctcccaggtggcccgttgtggtggccgaaggggaaggccagggcccactgatccgagtg gacatgacgatcgccgaggcctgccagaaatctaaacgcaagagcatcctggctgtgggc gtcggcaacgtcagggtcaagttcggacagaacgatgctgactccagccccggcggggac tatgaggaagatgagatcaagaaccacgccagcgaccgccggcagaagggccagcaccat gagcgcacaggccaagatgggcacctctatggcagctctcccgtggagcgtgaggaaggg gctctccgaagagccactaccacggccaggtccctgctggacaacaaagtggtgaagaac agtcgggcagacgggggcaggctggcaggagaggggcagctgcagaacatccccattgac ttcaccaacttccctgcccacgtggacctccccaaggccgggagtgggctggaggaaaac gacctggtgcagactccgcggggcctgagtgatctggagatagggatgtacgccctcctg ggggtgttctgcctggccatcctcgtcttcctgatcaactgcgccacctttgccctgaag tacaggcacaagcaagtgcccctggaaggtcaggcctccatgacccactctcacgactgg gtgtggcttggcaatgaggccgaactcctggagagcatgggggatgcgccgccgccccag gacgagcacaccaccatcatagaccgcggaccgggggcctgcgaggagagcaaccatctc ctgctcaatggtggctcccacaagcacgtgcagagccagattcacaggtcagccgactcc ggggggcggcagggcagagaacagaagcaggaccccctgcactcgcccacctccaagagg aagaaggtgaaatttaccacctttaccaccatccccccggacgacagctgccccacggtg aactccatcgtcagcagcaatgatgaggacatcaaatgggtgtgtcaagacgtggctgtg ggtgcccccaaggaacttagaaactatctggagaaactcaaagataaggcttag >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_2|195_aa MGERKKPGDPASKGIPPSSTCLVLVILATDWMVPTHIEEAKKSLGAHIVPGQLSTQEDAC GPGQDQNNPGKAGWSGCSAEHHADQVYSWTTLIGSDEQEVARILSDLEYMLQRVRDDLCR MFESCHVGKMMSGLRHVYAVETFPTRGSLAFLATLSWPYIERLNRLLAFSGTQSENKNVF IKSRLQMNGLVTWAI >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_2|588_bp atgggagaaagaaagaagccaggagacccagcaagcaagggcatcccaccttcttccacc tgccttgttctggtcatactggcaactgattggatggtgcccacacatattgaggaggcc aagaagagcctgggagctcacattgtcccagggcagctctcaacccaggaggacgcatgt ggccctggacaggaccagaataatccagggaaagctgggtggtctggatgttctgcagag catcatgctgaccaagtgtattcatggaccacgttaattggatctgatgagcaggaagtg gcaagaattttgagtgacctggaatacatgctccagagggtgagagacgacctctgcaga atgttcgaatcctgccacgttggtaaaatgatgagtggtctaaggcacgtctacgctgtt gaaactttccctacaagaggcagcctagctttcttggccacgctttcttggccgtatatt gagagactcaacaggttgctagcttttagtgggactcagagtgaaaacaaaaatgtcttc atcaagtccaggctgcagatgaatgggcttgtcacttgggccatttga >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_3|323_aa MLSQEKLIVNQQCLPHKAEYYWDGFNNRNRFFRVLGAAASKVKALADWMAGTSVGPCEFQ EPAFPKLLAVLEEIFILSLGPGLLQDQISTDFVAYPTGIYAPCSGGQKSEITVLAGPQEE GILTDPGVGRLRSVHSPSNASCWSLYKDPPFVSRFCSEYRTCQWRNLATTSSGQSPHVQT PPPCAVAETLAVGEGPQPHRGEERELGPNVNVEAGQQVAKNRQQAGIRARRLSLTGSLRV EFLMSWNQKSPDSYRNQGLWGIKTPTTDCTGPCDKWPSNSEKSLGRKRRGREHWSCTESR LDEAVPCRTALQLQLLPEAGEGP >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_3|972_bp atgttgtcacaggagaagctcatcgtcaatcaacaatgccttcctcacaaagctgaatat tattgggacggcttcaacaacaggaatagattcttcagagtcctgggggctgcggcgtcc aaggtcaaggcactggctgattggatggctgggactagtgttggcccctgtgagttccag gagccggcgtttccaaagctgcttgctgtgctggaggagattttcatcctgagccttgga ccagggctgctgcaggaccagatttccactgacttcgtggcttacccaactggaatttat gctccctgttctggaggccagaaatctgaaatcacagtattggcagggcctcaggaagag gggatcctcactgaccctggagtggggcggctgaggtcagtccacagcccctccaatgcc agctgctggagcctgtacaaagacccccctttcgtctcccgattctgcagcgagtatagg acctgccagtggcgcaacctggcaaccacctcttcgggtcagagcccacatgtccagact cccccaccctgtgctgttgctgagacactggccgtcggggaagggccccagcctcacaga ggagaggagcgggaactaggccccaatgtcaatgtggaggctggccagcaagttgccaaa aatagacaacaagctggaatcagagcaaggcgtttgtcactcacagggtcacttcgagtt gaatttcttatgtcttggaaccaaaagagtcccgactcatatagaaatcagggcctctgg ggcatcaagacccccactacagattgcactggtccttgtgataagtggccttctaactca gagaagtctcttggaaggaagaggagaggcagggagcactggagctgcacagagtcaagg ctggatgaggctgtgccgtgcagaactgccctgcagctgcagctgcttccagaggcaggc gagggcccctga >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_4|607_aa MEGSGGGAGERAPLLGARRAAAAAAAAGAFAGRRAACGAVLLTELLERAAFYGITSNLVL FLNGAPFCWEGAQASEALLLFMGLTYLGSPFGGWLADARLGRARAILLSLALYLLGMLAF PLLAAPATRAALCGSARLLNCTAPGPDAAARCCSPATFAGLVLVGLGVATVKANITPFGA DQVKDRGPEATRRFFNWFYWSINLGAILSLGGIAYIQQNVSFVTGYAIPTVCVGLAFVVF LCGQSVFITKPPDGSAFTDMFKILTYSCCSQKRSGERQSNGSAWAPSYSGQVHHTAVRSI IQRSGPSYGGQVHHTAVRSIIRRSAEMDSSHGLTVGPEHSDVCKALSRMPEEKVEDVKAL VKIVPVFLALIPYWTVYFQLPAAWLTMFDAVLILLLIPLKDKLVDPILRRHGLLPSSLKR IAVGMFFVMCSAFAAGILESKRLNLVKEKTINQTIGNVVYHAADLSLWWQVPQYLLIGIS EIFASIAGLEFAYSAAPKSMQSAIMGLFFFFSGVGSFVGSGLLALVSIKAIGWMSSHTDF ALLCAACITVFPYIMNGFQETSRPLFSSQGPSLPWPQPSEQLLQPPLLTGTSLTDKHLLH ARPAQSS >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_4|1824_bp atggagggctctgggggcggtgcgggcgagcgggcgccgctgctgggcgcgcggcgggcg gcggcggccgcggcggcggctggggcgttcgcgggccggcgcgcggcgtgcggggccgtg ctgctgacggagctgctggagcgcgccgctttctacggcatcacgtccaacctggtgcta ttcctgaacggggcgccgttctgctgggagggcgcgcaggccagcgaggcgctgctgctc ttcatgggcctcacctacctgggctcgccgttcggaggctggctggccgacgcgcggctg ggccgggcgcgcgccatcctgctgagcctggcgctctacctgctgggcatgctggccttc ccgctgctggccgcgcccgccacgcgagccgcgctctgcggttccgcgcgcctgctcaac tgcacggcgcctggtcccgacgccgccgcccgctgctgctcaccggccaccttcgcgggg ctggtgctggtgggcctgggcgtggccaccgtcaaggccaacatcacgcccttcggcgcc gaccaggttaaagatcgaggtccggaagccactaggagattttttaattggttttattgg agcattaacctgggagcgatcctgtcgttaggtggcattgcctatattcagcagaacgtc agctttgtcactggttatgcgatccccactgtctgcgtcggccttgcttttgtggtcttc ctctgtggccagagcgttttcatcaccaagcctcctgatggcagtgccttcaccgacatg ttcaagatactgacgtattcctgctgttcccagaagcgaagtggagagcgccagagtaat gggtcagcatgggctccatcatacagcggtcaggtccatcatacggcggtcaggtccatc atacagcggtcaggtccatcatacggcggtcaggtccatcatacggcggtcaggtccatc atacggcggtcagcggagatggattcatctcatgggctcacggtggggcctgagcacagt gatgtgtgcaaggccctcagcaggatgccagaagagaaagtggaagatgtgaaagctctg gtcaagattgtccctgttttcttggctttgataccttactggacagtgtatttccaactc cctgcagcctggctgaccatgtttgatgctgtgctcatcctcctgctcatccctctgaag gacaaactggtcgatcccattttgagaagacatggcctgctcccatcctccctgaagagg atcgccgtgggcatgttctttgtcatgtgctcggcctttgctgcaggaattttggagagt aaaaggctgaaccttgttaaagagaaaaccattaatcagaccatcggcaacgtcgtctac catgctgccgatctgtcgctgtggtggcaggtgccgcagtacttgctgattgggatcagc gagatctttgcaagtatcgcaggcctggaatttgcatactcagctgcccccaagtccatg cagagtgccataatgggcttgttctttttcttctctggcgtcgggtcgttcgtgggttct ggactgctggcactggtgtctatcaaagccatcggatggatgagcagtcacacagacttt gcactgctctgtgctgcttgcatcactgtgtttccttacatcatgaacggttttcaggag acaagcaggcctctcttctcatcccaggggccaagcctgccctggccacagcccagcgag cagctgctgcagccaccgctccttacggggacgtccttgacagacaagcaccttctccac gcccgtccagctcagtcttcgtga >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_5|281_aa MAKTYNNKAEIKSVLVYAYTGMLRRYSNDTAEHLPEGRNTVSSIAEEVEGMSSPPKTFEG MNAASSIAEESRRKGHSADTVSPDVCLPEPEEGPLCDTASPEVCPPEPEEGPLCDTSRRK GPSVTPRARTSVLQSRRKGHSADTGIPDVCPPEAEEGPLCDTAIPDVCPPEAEEGPLCDT TSPDSRRKGPSADTASPDVCPPEPEEGPLCDTASPDVCPPEPEEGPLCDTASPDVCPPEP EEGPLCDTASPDVCPPEPEEGPLCDTVSRTSVLRSRRKGHL >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_5|846_bp atggcaaagacgtacaacaacaaggcagagatcaaatcagttctagtgtatgcatatact ggaatgctacgtcgttattcaaatgatactgcagagcatcttccagaggggaggaacact gtgtcctccatagcagaggaggtagaaggaatgagctcacccccgaagacctttgagggg atgaacgctgcatcctccatagcagaggagagccggaggaagggccactctgctgacacc gtgagcccggacgtctgtcttccagagccggaggaagggccactctgtgacaccgcgagc ccggaagtctgtcctccagagccggaggaagggccactctgtgacaccagccggaggaag ggcccctctgtgacaccgcgagcccggacgtctgtcctccagagccggaggaagggccac tctgctgacaccgggatcccggacgtctgtcctccagaggcggaggaagggccactctgt gacaccgcgatcccggacgtctgtcctccagaggcagaggaagggccactctgtgacacc acgagcccggacagccggaggaagggaccctctgctgacaccgcgagcccggacgtctgt cctccagagccggaggaagggccactctgtgacaccgcgagcccggacgtctgtcctcca gagccggaggaagggcccctctgtgacaccgcgagcccggacgtctgtcctccagagccg gaggaagggcccctctgtgacaccgcgagcccggacgtctgtcctccagagccggaggaa gggccactctgtgacaccgtgagccggacgtctgtcctccggagccggaggaagggccac ttgtga >gi568815586r:128694199_128923827|GENSCAN_predicted_peptide_6|341_aa MDPTGHTRATPHPGLAPPEAPGSCPPPRLAPEPASWGAASGRGGRDRPSRPGSPAVRVCA GPGAWSAAAGPVDGPGGGGGMRLLFLAVLRPHTGNAVTAQRVRSLHVECWLMPARLGQES LNMLTARCLSDDTLAHLEAAGHVCVLKDAFDFESRSEIANLILAENCEAALALHLYRGGR LLQGHRIPFGVIFGGTDVNEDANQAEKNTVMGRVLEEASSRFKVNLPFFCFFDPGIATTP NAAFNWNTFLQRSALFTIAKTWNQPKCPSVIDWIKKMWYIDTVEYYAAIKTNKIMSFAGT WMKLEAVILSKLRQEQKTKHRMFSTYKWELNNENMWTQGGE >gi568815586r:128694199_128923827|GENSCAN_predicted_CDS_6|1026_bp atggaccccacggggcacacccgcgccaccccgcacccagggctcgcgccgcccgaggcc cccggttcctgtccccctccccgcctggcgccggaacctgcgagctggggcgcggcctcg gggaggggcgggcgggacagacccagccgccccggctcccccgccgtccgcgtctgcgcc ggccccggggcctggtcggcggcggcggggccggtcgatggcccgggcggcggcggcggc atgcggctcctgttcctggcggtgctgcggccacacaccggcaacgcggtcacggcccag cgcgttcggtccctgcacgtggaatgctggctgatgccagctcggctggggcaggagagt cttaacatgctgactgcgaggtgcctgtcagacgacacactggcccatctagaggctgca gggcacgtgtgcgttttgaaggatgcctttgactttgaaagccgatctgagattgcaaac ctcatcttggctgagaactgcgaggctgccctggctcttcatctctataggggaggcagg cttttgcaaggccaccgaatcccttttggagtcatctttggtggaactgatgtaaatgaa gatgccaaccaggcggaaaaaaacacagtcatgggcagagttcttgaggaagccagcagt cgcttcaaagtgaacttgccctttttttgtttctttgacccaggaattgcaacaacacca aacgccgcttttaactggaatacctttcttcaacgctctgcactattcacaatagcaaag acatggaatcaacccaaatgcccatcagtgatagactggataaagaaaatgtggtacata gacaccgtggaatactatgcagccataaaaacgaacaagatcatgtcctttgcaggaaca tggatgaagctggaagccgttatcctcagcaaactaaggcaggaacagaaaaccaaacac cgcatgttctcaacttataagtgggagctgaacaatgagaacatgtggacacagggaggg gaataa