GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:12:01 Sequence gi568815587r:107607838_107807930 : 200093 bp : 40.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10858 10928 71 2 2 84 76 51 0.062 1.48 1.02 Intr + 13420 13550 131 0 2 104 67 23 0.036 0.37 1.03 Intr + 22580 22725 146 2 2 67 111 12 0.042 0.41 1.04 Intr + 22863 22891 29 1 2 98 115 8 0.817 1.42 1.05 Intr + 26047 26154 108 0 0 13 86 115 0.591 3.36 1.06 Intr + 27818 27928 111 1 0 56 96 94 0.838 6.66 1.07 Intr + 39631 39764 134 0 2 87 92 135 0.996 12.32 1.08 Intr + 42498 42566 69 0 0 83 91 106 0.963 7.78 1.09 Intr + 48096 48229 134 0 2 56 84 39 0.690 -0.33 1.10 Intr + 57188 57343 156 0 0 86 9 229 0.275 13.86 1.11 Intr + 57484 57605 122 2 2 28 80 47 0.208 -2.81 1.12 Term + 61428 61595 168 2 0 118 41 123 0.592 7.50 1.13 PlyA + 62132 62137 6 -0.45 2.09 PlyA - 63304 63299 6 1.05 2.08 Term - 64212 64093 120 0 0 82 53 67 0.490 0.09 2.07 Intr - 68251 68080 172 0 1 -5 89 123 0.308 2.12 2.06 Intr - 68723 68439 285 1 0 52 49 173 0.035 5.33 2.05 Intr - 75624 75587 38 0 2 110 71 64 0.015 3.04 2.04 Intr - 94699 94634 66 1 0 86 99 18 0.017 0.88 2.03 Intr - 107202 107067 136 2 1 46 28 131 0.006 2.55 2.02 Intr - 115881 115817 65 0 2 105 -7 111 0.006 -0.10 2.01 Init - 125003 124950 54 2 0 81 103 5 0.453 2.80 2.00 Prom - 126339 126300 40 -8.55 3.02 PlyA - 126860 126855 6 1.05 3.01 Sngl - 129103 128609 495 1 0 53 55 354 0.484 24.40 3.00 Prom - 129680 129641 40 -3.65 4.03 PlyA - 130071 130066 6 -0.45 4.02 Term - 130835 130329 507 2 0 75 35 379 0.636 24.72 4.01 Init - 136681 136670 12 1 0 71 87 16 0.184 -0.48 4.00 Prom - 137651 137612 40 -6.95 5.00 Prom + 139332 139371 40 -2.45 5.01 Init + 142143 142353 211 2 1 72 24 114 0.126 2.48 5.02 Intr + 142455 142702 248 1 2 39 62 165 0.146 5.26 5.03 Intr + 149957 150103 147 1 0 81 76 73 0.084 4.91 5.04 Intr + 152412 152634 223 2 1 15 53 179 0.038 3.88 5.05 Term + 156123 156514 392 1 2 38 49 191 0.119 4.36 5.06 PlyA + 156799 156804 6 1.05 6.06 PlyA - 160656 160651 6 1.05 6.05 Term - 184963 184778 186 1 0 97 37 172 0.946 9.51 6.04 Intr - 195318 195164 155 1 2 93 92 105 0.924 10.27 6.03 Intr - 196933 196881 53 0 2 76 76 52 0.692 0.43 6.02 Intr - 197678 197522 157 0 1 116 50 108 0.726 7.95 6.01 Intr - 199039 198880 160 1 1 95 94 81 0.863 8.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_1|459_aa XMSLKFVSIVGSLKIVLDRVCHKLQKRGKENICQAQTSVGGTEQYSRLIPCPNFEAIKGK NYFSSMIRMLIQVCLYFYCKFLWRCLKFVMRKLTGRCELQRICYNTKPGASRTMKIETSL RDSKSKLFGAMVTSNDESHSLNMTLLKFSIPRLSDCPVRLEQACLLQIVGYRNLIADVEK LRREAYDSDNPQHEEMLLKLWKFLKPNTPLESRISKQWCEIGFQGDDPKTDFRGMGLLGL YNLQYFAERDATAAQQVLSDSLHPKCRYSFAIVGINITDLAYNLLVSGALKTHFYNIAPE APTLSHFQQTFCYLMHEFHKFWIEEDPMDIMEFNRVREKFRKRIIKQLQNPDMALCPHFA ASEVFSSLGHNSEIPRDHCFWSICHPVTAHIVALCSVTSCLDTQERKAANLLSAPLQGKP QQGGFAGLVRWQAQAGSYTAFISVEGETRRALGMCALHK >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_1|1380_bp ntgatgtctctgaaatttgtatcaattgttggttccctgaagattgtgctggacagagta tgccataaattgcagaaaagaggaaaggaaaatatctgccaagctcaaacctctgtggga ggaactgagcaatacagcagacttatcccatgtcccaactttgaggcaatcaagggaaaa aactatttttctagtatgattagaatgttgatccaggtatgcctgtatttttactgtaaa tttctgtggcgctgcctgaaatttgtaatgaggaagctaactggaagatgtgaactacaa cggatctgttataataccaagccgggagcttctagaaccatgaaaatcgaaacatcactg agggattctaaaagtaagttgtttggtgctatggtaacaagtaatgatgaaagtcattca ctgaatatgactttattaaagttcagtattcctaggctgtctgactgtcctgtacgtttg gagcaggcttgccttctgcaaatcgttgggtacaggaaccttattgcagatgtggaaaaa ctgcgtagagaggcctatgattctgataatccccaacatgaagaaatgcttttgaagtta tggaaattcttgaagcccaatactccactggaatctcggatttctaagcagtggtgtgaa attggtttccaaggtgatgatcctaaaacagactttcgaggaatgggacttctgggactg tacaatttgcagtatttcgcggaaagggatgccacagcagctcagcaggtcctgtctgac tctcttcatccgaaatgcaggtactcatttgcaattgtgggcatcaatataactgacctg gcatataatctactggtcagcggagctctaaaaacccatttctacaatatcgccccagaa gctccaacattgtctcactttcagcaaacattctgctatttgatgcatgaatttcataag ttttggatcgaagaggaccccatggacataatggaatttaatcgtgtgagggagaaattc cgcaagaggatcatcaaacagctgcagaacccagacatggcgctgtgcccacattttgct gcctcggaagtgttttcatctcttggtcataattccgagatccccagagaccactgtttc tggagtatctgtcatccagtgactgctcatattgtggcattatgcagtgttacatcttgc cttgatacccaggagaggaaggctgcaaatctcctttctgctcccttgcaggggaaacct cagcaaggtgggtttgcaggacttgtgcgatggcaagctcaggccggaagctacacggca tttatctctgtggaaggagaaacacgcagggctttgggcatgtgcgcgcttcataaataa >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_2|311_aa MNHRIQPMAYTFLKIPVSPTQCEDEKEEDLYDDPLLLNEYSQGKKAGGRGEYMKRASLAT DQAQELSTDSQRMSFSHLRMYMVMVGTSTFPYSGLKEPKYVYVRSLQLLDEELILAPKLS CSVSCCSSKEARRVELQKAATCSGLWHSTKAKANTLSLNRLWKSTSGCVGIPKSSSRKDF KERICVPEYKQTCYTTPFTTWLLPLSLAPASFLVRGGQTKAGRSAEVLRDLGYVQLTTLT SLGCGSCIYGPRRQMSHPVYSSKKNLETKEQRGKGHGAHSEKGAICRPGRRPSPEPDHAG TLTWDLRPPEL >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_2|936_bp atgaatcaccgcatccagccgatggcatatacattcttaaaaatccccgtttcgcctact caatgtgaagatgaaaaggaggaagacctttatgatgacccacttctacttaatgaatat tctcagggtaaaaaggcgggtggacgaggggagtacatgaaacgagctagtttagcaacc gatcaggcacaagagctcagcactgattctcaaaggatgtcattttcacacctgagaatg tatatggtgatggtgggcacaagcacctttccttattctgggctgaaagagccaaaatat gtgtatgtcagaagcctccagctccttgatgaagagctgattcttgcaccaaaattaagc tgcagtgtcagctgctgcagcagcaaagaagccaggagggttgagctgcagaaggcagca acctgctcagggctctggcatagcaccaaagccaaggccaacacactctctctcaatcgt ttgtggaagagcacatctggatgtgtgggtattcctaaatcctcctccaggaaagatttc aaagaaaggatctgtgttcctgaatacaagcagacctgttataccacacccttcacgacc tggcttctgcctttgtctctggctcctgcctcatttctcgttagaggaggacagaccaaa gctggtagatctgctgaggttctcagggatctaggctatgtccaactcaccactctgaca tccctaggatgtggctcttgtatttatggtccaagaaggcaaatgtcacatccagtttat agcagcaagaagaacctagaaacaaaggagcagagagggaaaggccatggggcacacagt gagaagggggccatctgcaggccaggaagaaggccctcaccagaacccgaccatgctggc acactgacctgggacttgcggcctccagaactgtga >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_3|164_aa MNNFIEIFSQKTNLIDKISWRIFKEEREYFDEMKEYGPIHILWLVSEEDLVDTLKDVDSC IDRCCKATEKWKSGLSEALLPVVHEYVLYSEMLMGVMKRRDQIQAELDSKVEALTYKKTD SDLLTEEIGKLEDKVEYVNNALKAEWERWQQNMQNDIVSIYRYG >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_3|495_bp atgaataactttattgaaatttttagccagaaaacaaatttgatagataaaatatcttgg agaattttcaaggaagaaagggaatattttgatgaaatgaaagaatatggtccaattcat attctgtggttagtgtcagaagaggatctggttgatactctaaaagatgttgacagctgc attgacagatgctgtaaggccactgaaaaatggaagtctggactctcagaggccctgctt cctgttgtacatgagtacgtgctttatagtgaaatgttaatgggtgttatgaaaagaaga gaccaaatacaagcagaactggattccaaagttgaagctttgacctataaaaagacagat agtgatctgcttacagaggagattggaaaacttgaagataaagtagaatatgttaataat gccctgaaagcagaatgggaaagatggcaacaaaatatgcaaaatgatatcgttagcatt tacagatatggctga >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_4|172_aa MGLLVEVLALDEDEDDLEVFSNYASLMDMNSFSTMMPTSPLSMINQIKFEDERDLKEHFI TVDKPESYVTTIETFITYSIITKSSCGEFDSSEFEVRRRYQGFLWLKRKLEEAQPTLNIP PFPEKFIVKGMVEHFKDDFIETCRKALHKFLSRTADHPTLTFNKVFKIFLTA >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_4|519_bp atggggctcttggtggaggtgctggctctggacgaggacgaggatgacctggaggttttc agcaactatgcctcactgatggacatgaattccttcagcactatgatgccaacatcccct ttatcaatgataaaccaaatcaagtttgaagatgaacgagatttaaaggaacacttcatt acagttgataaacctgaaagttatgttactacaatagaaactttcattacgtatagcatt attactaagtcatcttgtggagaatttgactccagtgaatttgaagttaggagacggtat caaggtttcctttggttgaagagaaaacttgaagaagcacaacccactctgaatattcca ccatttccagaaaaatttatagtaaaaggaatggtggaacactttaaagatgacttcatt gagacatgcaggaaggctttacataaatttttgagccgaactgctgatcatccaacttta acatttaataaggtcttcaaaatttttctcactgcataa >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_5|406_aa MASSFTYVAAKNMISLLLWLHSEMGKVPLSPSQGVRWGCDSLLQCPAAQTSMGGMQTGRL WGSDPKAVSRGQRAFCIPGFLPWCTGRIGSHVGLETEYKVLLSGSSSQQLGEPEGRWFSP GIWPFSSAGSPPPTLSKLHVLPVDGLLTCRRLSVGNLGCTQLKSSGNCLLPSRAEEVTVP VSLISQEGSPGFVHMAAGEFPRFLKMVCPEFVPSDVQMCPEFIPSGGFTVLLSSGVKPQT FAVSVTALKGGVSRVVRSSWWVRGLADFRSKAADLHTNIMLNEEKLKTFPLTTGTRQGCT LSPLLFNLVLEILARAIRQEKEIKGIQISEEEVKLSLFADYMIIYLENPKDSCRKLLELT KEFCEVSGYKINVCKSVALLYTNSDQAENQSKSSTPFTIAEKRRNT >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_5|1221_bp atggcctctagcttcacctacgttgctgcaaagaacatgatctcactgcttttatggctt catagtgaaatgggaaaagttcccttgtccccctcgcagggtgtgcgatggggatgtgac tcgcttcttcagtgtcccgctgctcaaacctctatggggggcatgcagacgggcaggctg tggggctccgaccccaaggcagtgtctaggggacagagggctttctgtatccctgggttc ttgccttggtgtactggaagaatcggatcacacgtgggcttggagactgagtacaaggtt ttgttgagtggaagtagctctcagcagttgggggagccagaagggagatggttttcccct ggaatctggccattcagcagcgcaggctctcctcctccgaccctgtccaaactccatgtt ctgccggtcgatggcctgctgacctgccggcgtctgtcggttggcaatttgggctgcact cagctgaagagttctgggaattgtctactgccatctagagctgaggaggtaacagtgcca gtgtctctcattagccaggagggtagcccaggctttgttcacatggcagcaggcgagttc ccaaggttcttaaagatggtgtgtccggagtttgttccttcagatgttcagatgtgccca gagtttattccttctggtgggttcacggtcttgctgtcttcaggagtgaagccacagacc tttgcagtgagtgttacagctcttaaaggtggcgtgtccagagttgttcgttcttcctgg tgggttcgtggtcttgctgacttcaggagtaaagccgcagaccttcacaccaacataatg ctgaatgaggaaaagttgaaaacattccctttgacaaccggaacgagacaaggatgcaca ctctcaccactccttttcaacttagtactggaaattctagccagagcaatcagacaagag aaagaaataaagggcatccaaatcagtgaagaggaagtcaaactgtcactgtttgctgac tatatgatcatttaccttgaaaaccctaaagactcctgcagaaagctcctagaactgacg aaagaattctgtgaagtttctggatacaagattaatgtatgcaaatcagtagctcttcta tataccaacagtgaccaagcagagaatcaaagcaagagctcaaccccttttacaatagct gaaaaaagaagaaatacttag >gi568815587r:107607838_107807930|GENSCAN_predicted_peptide_6|236_aa LLDCFGIPVLMALSWFILHARYRVIHFIAVAVCLLGVGTMVGADILAGREDNSGSDVLIG DILVLLGASLYAISNVCEEYIVKKLSRQEFLGMVGLFGTIISGIQLLIVEYKDIASIHWD WKIALLFVAFALCMFCLYSFMPLVIKVTSATSVNLGILTADLYSLFVGLFLFGYKFSGLY ILSFTVIMVGFILYCSTPTRTAEPAESSVPPVTSIGIDNLGLKLEENLQETHSAVL >gi568815587r:107607838_107807930|GENSCAN_predicted_CDS_6|711_bp cttttggattgctttgggattcctgtgttgatggctctgtcatggtttattcttcatgca agatacagagtgatccacttcatcgccgtggctgtctgtctgttgggtgtaggaaccatg gttggtgcagacatactagcagggagggaagacaattcagggagtgatgtattgattggt gacatcttggtccttcttggggcttccctctatgccatttcaaatgtttgtgaggaatac atcgtgaagaagctgagcagacaggagtttttaggaatggtgggcctgtttggaacaatt atcagtggtatacagctattgattgtggaatataaggatattgccagcattcattgggac tggaaaattgccctgctgttcgtggcatttgccctgtgtatgttttgcctgtacagcttc atgccattggtgattaaagtcactagtgccacttccgtcaacctgggcatcctgacagcg gacctctacagcctttttgttggactctttctgtttggctataagttttcaggactctac atcctgtccttcactgtcatcatggtggggtttatcctgtactgctccacccctactcgc acggccgagccggctgaaagcagcgtgcctccagtcaccagcattgggattgacaacctg gggctgaagctggaggagaacctccaggagacccactctgctgtcttgtag